A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
翻译 - 这是用于无界交错状态递归神经网络(UIS-RNN)算法的库,与论文《完全监督的说话人歧义》相对应。
#大语言模型# turnkey self-hosted offline transcription and diarization service with llm summary