#计算机科学#A PyTorch-based Speech Toolkit
翻译 - 基于Pytorch的语音工具包
#计算机科学#End-to-End Speech Processing Toolkit
翻译 - 端到端语音处理工具包
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
#计算机科学#Multilingual Automatic Speech Recognition with word-level timestamps and confidence
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
#Awesome#A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
#计算机科学#This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
翻译 - 这是用于无界交错状态递归神经网络(UIS-RNN)算法的库,与论文《完全监督的说话人歧义》相对应。
#计算机科学#A python package to build AI-powered real-time audio applications
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
#大语言模型#turnkey self-hosted offline transcription and diarization service with llm summary
#计算机科学#Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
#计算机科学#End-to-End Neural Diarization
Open source inference code for Rev's model