speech-recognition · GitHub Topics

#自然语言处理#为 Jax、PyTorch 和 TensorFlow 打造的先进的自然语言处理

自然语言处理 PyTorch pytorch-transformers transformer model-hub pretrained-models speech-recognition Hacktoberfest Python 机器学习深度学习 audio deepseek gemma glm 大语言模型 qwen vlm

Python 146.86 k

7 小时前

ggml-org / whisper.cpp

OpenAI Whisper语音识别模型，C++移植版本。

openai speech-to-text transformer Whisper inference speech-recognition

C++ 41.47 k

9 小时前

mozilla / DeepSpeech

#计算机科学#DeepSpeech 是一款开源嵌入式（离线、设备上）语音识别引擎，最低可以在树莓派上运行

深度学习机器学习 neural-networks Tensorflow speech-recognition speech-to-text deepspeech embedded on-device offline

C++ 26.53 k

24 天前

SYSTRAN / faster-whisper

#计算机科学#Faster Whisper transcription with CTranslate2

深度学习 inference quantization speech-recognition speech-to-text transformer Whisper openai

Python 17 k

1 个月前

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

asr speech speech-recognition speech-to-text Whisper

Python 16.72 k

11 天前

leon-ai / leon

🧠 Leon is your open-source personal assistant.

leon personal-assistant Node.js Python 人工智能 speech-to-text text-to-speech speech-recognition speech-synthesis flite assistant virtual-assistant 聊天机器人 Bot voice-assistant 自动化 offline 隐私 ai-assistant

TypeScript 16.47 k

6 天前

kaldi-asr / kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

kaldi C++CUDA Shell speech-recognition speech-to-text speaker-verification speaker-id speech

Shell 14.97 k

2 个月前

NVIDIA / DeepLearningExamples

#自然语言处理#深度学习示例

机器视觉深度学习 drug-discovery forecasting large-language-models mxnet paddlepaddle PyTorch recommender-systems speech-recognition speech-synthesis Tensorflow tensorflow2 translation 自然语言处理

Jupyter Notebook 14.39 k

1 年前

alphacep / vosk-api

#安卓#Vosk 是一个离线的语言识别工具。支持 Python, Java, Node.JS, C#, C++ ，能识别20+种语言，包括中文、英语、法语等。

Jupyter Notebook 12.68 k

1 天前

kmario23 / deep-learning-drizzle

#自然语言处理#Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!

机器学习深度学习深度神经网络 pattern-recognition 机器视觉 optimization visual-recognition reinforcement-learning deep-reinforcement-learning 自然语言处理 artificial-neural-networks artificial-intelligence-algorithms bayesian-statistics speech-recognition graph-neural-networks Medical imaging geometric-deep-learning explainable-ai probability

HTML 12.61 k

9 个月前

PaddlePaddle / PaddleSpeech

PaddleSpeech 是基于飞桨 PaddlePaddle 的语音方向的开源模型库，用于语音和音频中的各种关键任务的开发，典型的应用包括：语音识别、语音翻译、语音合成等

transformer conformer speech-translation streaming-asr speech-alignment punctuation-restoration streaming-tts speech-synthesis tts asr speech-recognition 声音克隆 vocoder voice-recognition self-supervised-learning Whisper

Python 12.07 k

17 天前

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

conformer PyTorch speech-recognition paraformer punctuation speaker-diarization rnnt audio-visual-speech-recognition pretrained-model voice-activity-detection Whisper dfsmn vad speechgpt speechllm

Python 11.47 k

9 天前