A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
#计算机科学#DeepSpeech 是一款开源嵌入式(离线、设备上)语音识别引擎,最低可以在树莓派上运行
Speech recognition module for Python, supporting several engines and APIs, online and offline.
翻译 - 适用于Python的语音识别模块,支持在线和离线的多个引擎和API。
A PyTorch-based Speech Toolkit
翻译 - 基于Pytorch的语音工具包
PaddleSpeech 是基于飞桨 PaddlePaddle 的语音方向的开源模型库,用于语音和音频中的各种关键任务的开发,典型的应用包括:语音识别、语音翻译、语音合成等
Speech To Speech: an effort for an open-sourced and modular GPT4-o
whisper 是一个通用语音识别模型
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
speech enhancement\speech seperation\sound source localization
You can find the speech algorithms you want here
A small speech recognizer
翻译 - PocketSphinx是一种轻量级的语音识别引擎,尽管在台式机上同样出色,但专为手持设备和移动设备进行了优化
Android speech recognition and text to speech made easy
Botium Speech Processing
翻译 - t语音处理
Alibaba speech technology
Speech Recognition using DeepSpeech2.
General Speech Restoration
Speech recognition
💬 Speech recognition for your site
翻译 - :speech_balloon:您网站的语音识别
A Vue2 Streaming Speech Recognition Speech to text with Google Cloud Speech
speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding