GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub

编程语言

”speech“ 的搜索结果

ASRT_SpeechRecognition
@nl8590687

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

TensorflowcnnctcPythonKeras
Python8.16 k
9 个月前🇨🇳

相关主题

speech-recognitionspeech-to-textspeech-synthesisttsspeech-translationspeechtext-to-speechPythonasrspeech-enhancement

Google   Bing   GitHub

Mozilla
DeepSpeech存档
Mozilla@mozilla

#计算机科学#DeepSpeech 是一款开源嵌入式(离线、设备上)语音识别引擎,最低可以在树莓派上运行

深度学习机器学习neural-networksTensorflowspeech-recognition
C++26.5 k
14 天前
speech
@awni

A PyTorch Implementation of End-to-End Models for Speech-to-Text

Python759
2 年前
fish-speech
@fishaudio

SOTA Open Source TTS

llamatransformerttsvallevits
Python22.17 k
20 小时前
speechbrain
@speechbrain

#计算机科学#A PyTorch-based Speech Toolkit

speech-recognitionspeech-toolkitspeaker-recognitionspeech-to-textspeech-enhancement
Python10.09 k
19 小时前
Anthony Zhang
speech_recognition
Anthony Zhang@Uberi

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Pythonaudiospeech-recognitionspeech-to-text
Python8.78 k
1 个月前
飞桨
PaddleSpeech
@PaddlePaddle • 百度

PaddleSpeech 是基于飞桨 PaddlePaddle 的语音方向的开源模型库,用于语音和音频中的各种关键任务的开发,典型的应用包括:语音识别、语音翻译、语音合成等

transformerconformerspeech-translationstreaming-asrspeech-alignment
Python12.04 k
7 天前
Hugging Face
speech-to-speech
Hugging Face@huggingface

#计算机科学#Speech To Speech: an effort for an open-sourced and modular GPT4-o

人工智能assistantlanguage-model机器学习Python
Python4.09 k
3 个月前
OpenAI
whisper
OpenAI@openai

whisper 是一个通用语音识别模型

Python84.27 k
7 天前
ponyzhang
awesome-speech-recognition-speech-synthesis-papers
ponyzhang@zzw922cn

#新手入门#Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

automatic-speech-recognitionpapers路线图rnncnn
3.05 k
2 年前
mlx-audio
@Blaizzy

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

apple-siliconaudio-processingMLXmultimodalspeech-recognition
Python2.44 k
23 天前
VoiceCraft
@jasonppy

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook8.31 k
4 个月前
awesome-speech-enhancement
@WenzheLiu-Speech

speech enhancement\speech seperation\sound source localization

1.15 k
2 年前
csm
@SesameAILabs

A Conversational Speech Generation Model

Python13.61 k
1 个月前
NVIDIA Corporation
NeMo
NVIDIA Corporation@NVIDIA

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

machine-translationspeaker-recognitionasrttsgenerative-ai
Python14.98 k
13 小时前
ChatTTS
@2noise

#大语言模型#ChatTTS是专门为对话场景设计的文本转语音模型,例如LLM助手对话任务。它支持英文和中文两种语言

agenttext-to-speechchatChatGPTchattts
Python37.01 k
1 个月前
cmusphinx
pocketsphinx
cmusphinx@cmusphinx

A small speech recognizer

speech-recognitionPythonC
C4.14 k
17 天前
subsync存档
@sc0ty

Subtitle Speech Synchronizer

synchronizationsubtitlesspeech-recognition
C++1.37 k
9 个月前
Alex Gotev
android-speech
Alex Gotev@gotev

#安卓#Android speech recognition and text to speech made easy

Androidspeechrecognitiontts
Java524
4 年前
voicefixer
@haoheliu

General Speech Restoration

speech-processingspeech-synthesisspeech-enhancementspeech-analysisspeech
Python1.18 k
5 个月前
speech-aligner
@open-speech

speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

speechkaldiC++
C++400
5 年前
StreamSpeech
@ictnlp

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

speechspeech-recognitionspeech-synthesis
Python1.11 k
4 天前
espnet
@espnet

#计算机科学#End-to-End Speech Processing Toolkit

深度学习end-to-endchainerPyTorchkaldi
Python9.26 k
1 天前
deepspeech.pytorch
@SeanNaren

Speech Recognition using DeepSpeech2.

Python2.12 k
3 年前
botium-speech-processing
@codeforequity-at

Botium Speech Processing

botiumspeech-to-texttext-to-speech
JavaScript943
3 天前
阿里巴巴
Alibaba-MIT-Speech
阿里巴巴@alibaba

Alibaba speech technology

915
7 年前
loading...