Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Speaker diarization scripts, based on AaltoASR
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
翻译 - 这是用于无界交错状态递归神经网络(UIS-RNN)算法的库,与论文《完全监督的说话人歧义》相对应。
How to use OpenAIs Whisper to transcribe and diarize audio files
Transcription with speaker diarization pipeline
Unsupervised Speaker Diarization using GMM and Clustering
Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data
A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems
Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Variational Bayes HMM over x-vectors diarization
Deep Speaker: an End-to-End Neural Speaker Embedding System.
🔈 Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)
Speaker materials from CppCon 2014
翻译 - CppCon 2014的扬声器材料
A Speaker Recognition System
🔈 Deep Learning & 3D Convolutional Neural Networks for Speaker Verification