GitHub 中文社区

回车: Github搜索 Shift+回车: Google搜索

©2025 GitHub中文社区论坛 GitHub官网网站地图 GitHub官方翻译

GitHub on X
GitHub on Facebook
GitHub on LinkedIn
GitHub on YouTube
GitHub on Twitch
GitHub on TikTok
GitHub’s organization on GitHub

集合主题趋势排行榜

#

vad

Website
Wikipedia

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

conformer PyTorch speech-recognition paraformer punctuation speaker-diarization rnnt audio-visual-speech-recognition pretrained-model voice-activity-detection Whisper dfsmn vad speechgpt speechllm

Python 9.71 k

1 天前

smacke / ffsubsync

自动化同步视频字幕，提升字幕编辑效率

subtitles Video audio FFmpeg vad fft synchronization sync subtitle captions vlc vlc-media-player srt srt-subtitles voice-activity-detection fast-fourier-transform alignment caption

Python 7.1 k

2 个月前

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

voice-detection voice-recognition voice-commands PyTorch onnx voice-activity-detection voice-control onnx-runtime onnxruntime speech speech-processing vad

Python 5.55 k

19 天前

CheshireCC / faster-whisper-GUI

faster_whisper GUI with PySide6

faster-whisper openai transcribe vad Whisper whisperx asr

Python 2.28 k

4 个月前

k2-fsa / sherpa-ncnn

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, Lich...

Python speech-recognition C++asr C C#Go Kotlin vad voice-activity-detection

C++ 1.26 k

3 个月前

jtkim-kaist / VAD

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

vad dnn lstm attention speech data voice-detection speech-recognition voice-activity-detection

MATLAB 854

4 年前

amsehili / auditok

An audio/acoustic activity detection and audio segmentation tool

voice-detection vad voice-activity-detection

Python 770

4 个月前

DmitryRyumin / ICASSP-2023-24-Papers

#人脸识别#ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processi...

asr denoising domain-adaptation face-recognition language-modeling self-supervised-learning semantic-segmentation signal-processing speech-recognition vad generative-models image-generation music-generation multimodal-learning

Python 449

3 个月前

shashikg / WhisperS2T

#计算机科学#An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

asr 深度学习 speech-recognition speech-to-text Whisper tensorrt-llm tensorrt vad voice-activity-detection

Jupyter Notebook 388

8 个月前

gtreshchev / RuntimeAudioImporter

Runtime Audio Importer plugin for Unreal Engine. Importing audio of various formats at runtime.

虚幻引擎 audio-files mp3 blueprints audio 插件 ue4-plugin audio-player ue5 ue5-plugin unreal-engine-5 vad voice-activity-detection

C++ 382

2 个月前

filippogiruzzi / voice_activity_detection

#计算机科学#Voice Activity Detection based on Deep Learning & TensorFlow

voice-activity-detection 深度学习 speech Tensorflow time-series time-series-classification resnet speech-recognition Python 机器学习 vad 人工智能深度神经网络

Python 361

2 年前

Baidu-AIP / speech-vad-demo

集成Webrtc的VAD，用于切分音频文件

WebRTC vad speech

C 341

5 年前

gkonovalov / android-vad

#安卓#Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

vad offline real-time audio-processing WebRTC Android dnn on-device-ai neural-networks voice-detection 深度神经网络 onnx-models voice-activity-detection

C 324

2 个月前

EtienneAb3d / WhisperHallu

Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts

asr sound-processing text-to-speech vad Whisper audio-processing vocals

Python 319

5 个月前

Picovoice / cobra

On-device voice activity detection (VAD) powered by deep learning

voice-activity-detection speech-recognition vad on-device

Python 206

2 天前

eesungkim / Voice_Activity_Detector

A statistical model-based Voice Activity Detection

vad voice-detection voice-activity-detection

Jupyter Notebook 192

6 年前

xiongyihui / python-webrtc-audio-processing

Python bindings of WebRTC Audio Processing

Python vad agc ns

C++ 188

7 个月前

voithru / voice-activity-detection

Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021

voice-activity-detection vad

Python 153

3 年前

Enumerate user mode shared memory mappings on Windows.

driver vad ntoskrnl shared-memory windows-kernel Windows

C 118

4 年前

xia-chu / webrtc_apm

webrtc中apm相关代码的提取，包括AEC/NS/AGC/VAD ，另外还包括mp3/aac编码器、SoundTouch

WebRTC aec vad agc ns mp3 aac jni

C 99

2 年前

loading...