Production First and Production Ready End-to-End Speech Recognition Toolkit
翻译 - 生产优先和生产就绪的端到端语音识别工具包
#新手入门#Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
#计算机科学#End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
翻译 - Tensorflow中的英语和英语的端到端自动语音识别
OpenAI Whisper ASR Webservice API
#计算机科学#🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
翻译 - TSTT-用于语音转文本的深度学习工具包,在研发和生产中经过了实战测试
#自然语言处理#PORORO: Platform Of neuRal mOdels for natuRal language prOcessing
翻译 - Pororo:基于深度学习的多语言自然语言处理库
⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Open STT
翻译 - 俄罗斯开放式STT数据集
#大语言模型#Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recogn...
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
On-device streaming speech-to-text engine powered by deep learning
End-to-end ASR/LM implementation with PyTorch
#Awesome#This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目,CPU上的实时率(RTF)小于0.1
On-device speech-to-text engine powered by deep learning
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
🔉 Youtube Videos Transcription with OpenAI's Whisper