automatic-speech-recognition

wenet-e2e / wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

翻译 - 生产优先和生产就绪的端到端语音识别工具包

e2e-models PyTorch asr transformer conformer production-ready automatic-speech-recognition speech-recognition Whisper

Python 4.44 k

15 天前

zzw922cn / awesome-speech-recognition-speech-synthesis-papers

#新手入门#Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

automatic-speech-recognition papers 路线图 rnn cnn dnn attention-mechanism seq2seq timit-dataset tts language-model speaker-verification speech-recognition speech-synthesis 神经网络 diffusion-models singing-voice-synthesis voice-conversion

3.02 k

1 年前

zzw922cn / Automatic_Speech_Recognition

#计算机科学#End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

翻译 - Tensorflow中的英语和英语的端到端自动语音识别

automatic-speech-recognition Tensorflow timit-dataset feature-vector phonemes data-preprocessing rnn audio 深度学习 lstm end-to-end cnn evaluation Bukkit speech-recognition chinese-speech-recognition

Python 2.84 k

2 年前

ahmetoner / whisper-asr-webservice

OpenAI Whisper ASR Webservice API

automatic-speech-recognition speech-recognition speech-to-text openai-whisper Docker asr speech

Python 2.52 k

2 个月前

coqui-ai / STT

#计算机科学#🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

翻译 - TSTT-用于语音转文本的深度学习工具包，在研发和生产中经过了实战测试

stt speech-to-text Tensorflow 深度学习 automatic-speech-recognition asr voice-recognition speech-recognition

C++ 2.41 k

1 年前

kakaobrain / pororo

#自然语言处理#PORORO: Platform Of neuRal mOdels for natuRal language prOcessing

翻译 - Pororo：基于深度学习的多语言自然语言处理库

深度学习自然语言处理 automatic-speech-recognition speech-synthesis neural-models

Python 1.3 k

3 年前

TensorSpeech / TensorFlowASR

⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

automatic-speech-recognition speech-recognition speech-to-text tensorflow2 rnn-transducer conformer tflite ctc Tensorflow

Python 965

1 天前

FireRedTeam / FireRedASR

#大语言模型#Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recogn...

asr 大语言模型 Open Source speech-recognition automatic-speech-recognition conformer speechllm transformer

Python 873

17 天前