”diarization“ 的搜索结果

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

asr speech speech-recognition speech-to-text Whisper

Python12.68 k

3 个月前

speaker-diarization uis-rnn whisper speaker-recognition speech-recognition speech-to-text speech supervised-clustering asr

whisper-diarization

@MahmoudAshraf97

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

asr speaker-diarization speech speech-recognition speech-to-text

Jupyter Notebook3.8 k

6 天前

pyannote-audio

@pyannote

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook6.44 k

3 天前

awesome-diarization

@wq2012

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1.64 k

2 个月前

EEND

@hitachi-speech

End-to-End Neural Diarization

Python376

3 年前

dscore

@nryant

Diarization scoring tools.

Python221

2 年前

wespeaker

@wenet-e2e

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python739

3 天前

Speaker-Diarization

@taylorlu

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

Python471

3 年前

VBx

@BUTSpeechFIT

Variational Bayes HMM over x-vectors diarization

Python254

1 年前

3D-Speaker

@modelscope

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python1.27 k

3 天前

speaker-diarization

@aalto-speech

Speaker diarization scripts, based on AaltoASR

Python191

6 年前

SpeakerDiarisation

@rvipandey

Unsupervised Speaker Diarization using GMM and Clustering

Jupyter Notebook6

4 年前

VBDiarization

@Jamiroquai88

Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data

Python95

1 年前

pyannote-metrics

@pyannote

A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems

Python187

2 年前

SpectralCluster

@wq2012

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

Python508

2 个月前

WeeaBlind

@FlorianEagox

A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!

Python211

7 个月前

uis-rnn

谷歌公司@google

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

翻译 - 这是用于无界交错状态递归神经网络（UIS-RNN）算法的库，与论文《完全监督的说话人歧义》相对应。

speaker-diarization uis-rnn speaker-recognition supervised-clustering

Python1.56 k

2 个月前

编程语音

Python
Jupyter Notebook