speech-to-text · GitHub Topics

ggml-org / whisper.cpp

OpenAI Whisper语音识别模型，C++移植版本。

openai speech-to-text transformer Whisper inference speech-recognition

C++ 39.18 k

1 天前

mozilla / DeepSpeech

#计算机科学#DeepSpeech 是一款开源嵌入式（离线、设备上）语音识别引擎，最低可以在树莓派上运行

深度学习机器学习 neural-networks Tensorflow speech-recognition speech-to-text deepspeech embedded on-device offline

C++ 26.22 k

7 个月前

leon-ai / leon

🧠 Leon is your open-source personal assistant.

翻译 - 🧠Leon是您的开源个人助理。

leon personal-assistant Node.js Python 人工智能 speech-to-text text-to-speech speech-recognition speech-synthesis flite assistant virtual-assistant 聊天机器人 Bot voice-assistant 自动化 offline 隐私 ai-assistant

TypeScript 16.15 k

9 小时前

SYSTRAN / faster-whisper

#计算机科学#Faster Whisper transcription with CTranslate2

深度学习 inference quantization speech-recognition speech-to-text transformer Whisper openai

Python 15.36 k

23 天前

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

asr speech speech-recognition speech-to-text Whisper

Python 14.93 k

10 小时前

kaldi-asr / kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

翻译 - 这是Kaldi项目的正式所在地。

kaldi C++CUDA Shell speech-recognition speech-to-text speaker-verification speaker-id speech

Shell 14.76 k

2 个月前

jianchang512 / pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，同时支持语音识别转录、语音合成、字幕翻译。

text-to-speech video-transition speech-to-text

Python 12.46 k

5 天前

speechbrain / speechbrain

#计算机科学#A PyTorch-based Speech Toolkit

翻译 - 基于Pytorch的语音工具包

speech-recognition speech-toolkit speaker-recognition speech-to-text speech-enhancement speech-separation audio audio-processing speech-processing speechrecognition asr voice-recognition speaker-diarization speaker-verification PyTorch huggingface transformers language-model 深度学习

Python 9.67 k

1 天前

alphacep / vosk-api

#安卓#Vosk 是一个离线的语言识别工具。支持 Python, Java, Node.JS, C#, C++ ，能识别20+种语言，包括中文、英语、法语等。

speech-recognition asr voice-recognition speech-to-text Android iOS 树莓派深度学习深度神经网络 speech-to-text-android speaker-verification Python offline 隐私 kaldi deepspeech vosk stt

Jupyter Notebook 9.23 k

1 个月前

Uberi / speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

翻译 - 适用于Python的语音识别模块，支持在线和离线的多个引擎和API。

Python audio speech-recognition speech-to-text

Python 8.68 k

19 天前

nl8590687 / ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Tensorflow cnn ctc Python Keras speech-recognition speech-to-text chinese-speech-recognition asrt

Python 8.08 k

7 个月前

TalAter / annyang

💬 Speech recognition for your site

翻译 - ：speech_balloon：您网站的语音识别

speech-recognition speech speech-to-text voice

JavaScript 6.66 k

8 个月前

KoljaB / RealtimeSTT

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python realtime speech-to-text

Python 6.64 k

1 天前

k2-fsa / sherpa-onnx

#安卓#Sherpa-ONNX 是一个轻量级语音识别框架，基于 Kaldi 和 onnxruntime，无需联网即可实现语音转文本、文本转语音、说话人分离以及语音活动检测(VAD)。支持嵌入式系统、安卓、iOS、鸿蒙系统、树莓派、RISC-V、x86_64 服务器、WebSocket 服务器 / 客户端，以及 C/C++、Python、Kotlin、C#、Go、NodeJS、Java、Swift、Dart、JavaScript、Flutter、Object Pascal、Lazarus、Rust 等编程语言。