speaker-verification · GitHub Topics

kaldi-asr / kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

翻译 - 这是Kaldi项目的正式所在地。

kaldi C++CUDA Shell speech-recognition speech-to-text speaker-verification speaker-id speech

Shell 14.76 k

3 个月前

speechbrain / speechbrain

#计算机科学#A PyTorch-based Speech Toolkit

翻译 - 基于Pytorch的语音工具包

speech-recognition speech-toolkit speaker-recognition speech-to-text speech-enhancement speech-separation audio audio-processing speech-processing speechrecognition asr voice-recognition speaker-diarization speaker-verification PyTorch huggingface transformers language-model 深度学习

Python 9.67 k

2 天前

alphacep / vosk-api

#安卓#Vosk 是一个离线的语言识别工具。支持 Python, Java, Node.JS, C#, C++ ，能识别20+种语言，包括中文、英语、法语等。

speech-recognition asr voice-recognition speech-to-text Android iOS 树莓派深度学习深度神经网络 speech-to-text-android speaker-verification Python offline 隐私 kaldi deepspeech vosk stt

Jupyter Notebook 9.23 k

1 个月前

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

PyTorch speech-processing speaker-diarization voice-activity-detection pretrained-models speaker-recognition speaker-verification

Jupyter Notebook 7.25 k

4 天前

zzw922cn / awesome-speech-recognition-speech-synthesis-papers

#新手入门#Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

automatic-speech-recognition papers 路线图 rnn cnn dnn attention-mechanism seq2seq timit-dataset tts language-model speaker-verification speech-recognition speech-synthesis 神经网络 diffusion-models singing-voice-synthesis voice-conversion

3.02 k

1 年前

modelscope / 3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

speaker-diarization speaker-verification language-identification modelscope

Python 1.9 k

25 天前

Delta-ML / delta

#前端开发#DELTA is a deep learning based natural language and speech processing platform. LF AI & DATA Projects: https://lfaidata.foundation/projects/delta/

翻译 - DELTA是基于深度学习的自然语言和语音处理平台。

自然语言处理深度学习 Tensorflow speech sequence-to-sequence seq2seq speech-recognition text-classification speaker-verification nlu text-generation emotion-recognition tensorflow-lite inference asr serving front-end ops

Python 1.59 k

1 个月前

mravanelli / SincNet

#计算机科学#SincNet is a neural architecture for efficiently processing raw audio samples.

深度学习 audio waveform filtering cnn convolutional-neural-networks speaker-recognition speaker-verification speech-recognition asr audio-processing speech-processing digital-signal-processing signal-processing neural-networks 人工智能 timit PyTorch Python

Python 1.17 k

4 年前

clovaai / voxceleb_trainer

In defence of metric learning for speaker recognition

speaker-recognition metric-learning speaker-verification

Python 1.09 k

1 年前

wenet-e2e / wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

production-ready PyTorch resnet speaker-recognition speaker-verification speaker-diarization repvgg TLS (Transport Layer Security)dino wavlm

Python 875

2 个月前

TaoRuijie / ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

speaker-recognition speaker-verification

Python 667

1 年前

markovka17 / dla

#计算机科学#Deep learning for audio processing

深度学习 speech-recognition tts signal-processing voice-conversion speaker-verification

Jupyter Notebook 636

4 个月前

HarryVolek / PyTorch_Speaker_Verification

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

PyTorch speaker-verification

Python 583

3 年前

microsoft / UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

PyTorch speech-recognition speech-processing speech diarization speech-separation speaker-verification

Python 455

1 年前

google / speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

speaker-recognition source-separation speaker-diarization speaker-verification

Python 410

13 天前

Jungjee / RawNet

Official repository for RawNet, RawNet2, and RawNet3

speaker-verification PyTorch

Python 374

1 年前

speechbrain / speechbrain.github.io

#计算机科学#The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN a...

深度学习 speech-recognition speech-to-text speech speech-processing speaker-recognition speaker-verification speech-separation speechrecognition 神经网络 neural-networks timit speech-analysis

HTML 365

4 个月前