#计算机科学#DeepSpeech 是一款开源嵌入式(离线、设备上)语音识别引擎,最低可以在树莓派上运行
#安卓#Vosk 是一个离线的语言识别工具。支持 Python, Java, Node.JS, C#, C++ ,能识别20+种语言,包括中文、英语、法语等。
#计算机科学#Examples of how to use or integrate DeepSpeech
#计算机科学#基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。
#计算机科学#Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。
#计算机科学#speech to text benchmark framework
A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui
#计算机科学#A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
#计算机科学#🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
A testing server for a speech to text service based on coqui.ai
Golang bindings for Mozilla's DeepSpeech speech-to-text library
Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments
#安卓#Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeech
Tooling for producing Italian model (public release available) for DeepSpeech and text corpus
Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speech
An editor for speech-to-text transcripts such as AWS Transcribe and Mozilla DeepSpeech
A MXNet implementation of Baidu's DeepSpeech architecture