OpenAI Whisper语音识别模型,C++移植版本。
#计算机科学# DeepSpeech 是一款开源嵌入式(离线、设备上)语音识别引擎,最低可以在树莓派上运行
🧠 Leon is your open-source personal assistant.
翻译 - 🧠Leon是您的开源个人助理。
#计算机科学# Faster Whisper transcription with CTranslate2
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
A PyTorch-based Speech Toolkit
翻译 - 基于Pytorch的语音工具包
#计算机科学# End-to-End Speech Processing Toolkit
翻译 - 端到端语音处理工具包
#安卓# Vosk 是一个离线的语言识别工具。支持 Python, Java, Node.JS, C#, C++ ,能识别20+种语言,包括中文、英语、法语等。
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
#计算机科学# Facebook AI Research's Automatic Speech Recognition Toolkit
翻译 - Facebook AI Research的自动语音识别工具包
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
翻译 - Silero模型:经过预先训练的STT模型和基准测试非常简单
Production First and Production Ready End-to-End Speech Recognition Toolkit
翻译 - 生产优先和生产就绪的端到端语音识别工具包
A small speech recognizer
翻译 - PocketSphinx是一种轻量级的语音识别引擎,尽管在台式机上同样出色,但专为手持设备和移动设备进行了优化
#IOS# On-device Speech Recognition for Apple Silicon
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
#大语言模型# Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
#自然语言处理# Machine Learning Resources, Practice and Research
翻译 - 机器学习资源,实践与研究
#计算机科学# Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative