speaker-diarization · GitHub Topics

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

conformer PyTorch speech-recognition paraformer punctuation speaker-diarization rnnt audio-visual-speech-recognition pretrained-model voice-activity-detection Whisper dfsmn vad speechgpt speechllm

Python 11.47 k

9 天前

espnet / espnet

#计算机科学#End-to-End Speech Processing Toolkit

深度学习 end-to-end chainer PyTorch kaldi speech-recognition speech-synthesis speech-translation machine-translation voice-conversion speech-enhancement speech-separation singing-voice-synthesis speaker-diarization text-to-speech

Python 9.28 k

2 天前

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

PyTorch speech-processing speaker-diarization voice-activity-detection pretrained-models speaker-recognition speaker-verification

Jupyter Notebook 7.87 k

6 天前

MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

asr speaker-diarization speech speech-recognition speech-to-text Whisper

Jupyter Notebook 4.71 k

3 个月前

linto-ai / whisper-timestamped

#计算机科学#Multilingual Automatic Speech Recognition with word-level timestamps and confidence

深度学习 speech speech-recognition speech-to-text asr 机器学习 Python PyTorch attention-is-all-you-need attention-mechanism attention-model speaker-diarization speech-processing transformers Whisper

Python 2.5 k

3 个月前

Purfview / whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

openai speech-to-text Whisper asr speech-recognition subtitles ctranslate2 faster-whisper whisperx uvr diarization speaker-diarization

2.25 k

3 个月前

modelscope / 3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

speaker-diarization speaker-verification language-identification modelscope

Python 2.19 k

1 个月前

wq2012 / awesome-diarization

#Awesome#A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

speaker-diarization Awesome Lists 机器学习 speech-recognition speech-processing 深度学习

1.77 k

9 个月前

google / uis-rnn

#计算机科学#This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

speaker-diarization uis-rnn speaker-recognition supervised-learning clustering supervised-clustering 机器学习

Python 1.58 k

10 个月前