#

speech-synthesis

https://static.github-zh.com/github_avatars/NVIDIA-NeMo?size=40

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 15.9 k
10 小时前
https://static.github-zh.com/github_avatars/PaddlePaddle?size=40

PaddleSpeech 是基于飞桨 PaddlePaddle 的语音方向的开源模型库,用于语音和音频中的各种关键任务的开发,典型的应用包括:语音识别、语音翻译、语音合成等

Python 12.29 k
22 天前
https://static.github-zh.com/github_avatars/rhasspy?size=40

A fast, local neural text to speech system

C++ 10.15 k
2 个月前
open-mmlab/Amphion
https://static.github-zh.com/github_avatars/open-mmlab?size=40

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...

Python 9.44 k
5 个月前
https://static.github-zh.com/github_avatars/rany2?size=40

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python 9.21 k
2 个月前
https://static.github-zh.com/github_avatars/jaywalnut310?size=40

#计算机科学#VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7.71 k
2 年前
https://static.github-zh.com/github_avatars/yl4579?size=40
Python 6 k
1 年前
https://static.github-zh.com/github_avatars/espeak-ng?size=40

#安卓#eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

C 5.71 k
10 天前
https://static.github-zh.com/github_avatars/snakers4?size=40

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Jupyter Notebook 5.52 k
2 年前
abus-aikorea/voice-pro
https://static.github-zh.com/github_avatars/abus-aikorea?size=40

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isola...

Python 4.96 k
15 天前
https://static.github-zh.com/github_avatars/MoonInTheRiver?size=40
Python 4.63 k
7 个月前
https://static.github-zh.com/github_avatars/WhisperSpeech?size=40

An Open Source text-to-speech system built by inverting Whisper.

Jupyter Notebook 4.5 k
4 个月前
loading...
Website
Wikipedia