WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
End-to-End Neural Diarization
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
Variational Bayes HMM over x-vectors diarization
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Speaker diarization scripts, based on AaltoASR
Unsupervised Speaker Diarization using GMM and Clustering
Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data
A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems
Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.
A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
翻译 - 这是用于无界交错状态递归神经网络(UIS-RNN)算法的库,与论文《完全监督的说话人歧义》相对应。