tts · GitHub Topics

CorentinJ / Real-Time-Voice-Cloning

#计算机科学#Real-Time-Voice-Cloning 是一个基于深度学习的语音合成工具，5秒内即可克隆一个声音。

深度学习 PyTorch Tensorflow tts 声音克隆 Python

Python 54.61 k

1 个月前

RVC-Boss / GPT-SoVITS

强大的少样本语音转换与语音合成Web用户界面。

text-to-speech tts vits voice-clone voice-cloneai 声音克隆

Python 48.25 k

4 天前

unslothai / unsloth

#大语言模型#Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

finetuning fine-tuning llama 大语言模型 lora mistral qlora gemma llama3 unsloth deepseek deepseek-r1 gemma3 llama-4 llama4 text-to-speech tts qwen qwen3

Python 41.31 k

6 小时前

coqui-ai / TTS

#计算机科学#🐸💬 - 一个深度学习的 TTS 语言合成库

Python text-to-speech 深度学习 speech PyTorch tts vocoder tacotron glow-tts melgan speaker-encoder hifigan speaker-encodings multi-speaker-tts tts-model speech-synthesis 声音克隆 voice-synthesis voice-conversion

Python 41.06 k

10 个月前

2noise / ChatTTS

#大语言模型#ChatTTS是专门为对话场景设计的文本转语音模型，例如LLM助手对话任务。它支持英文和中文两种语言

agent text-to-speech chat ChatGPT chattts 中文 chinese-language english english-language gpt 大语言模型 llm-agent natural-language-inference Python torch tts

Python 36.98 k

1 个月前

babysor / MockingBird

#计算机科学#🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

人工智能 speech PyTorch 深度学习 text-to-speech tts

Python 36.38 k

8 个月前

mudler / LocalAI

#大语言模型#:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, tr...

llama rwkv 人工智能大语言模型 stable-diffusion API Kubernetes gpt4all tts musicgen mamba audio-generation image-generation text-generation gemma mistral llama3 rerank distributed libp2p

Go 33.57 k

1 天前

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

text-to-speech tts voice-clone zero-shot-tts

Python 32.79 k

2 个月前

fishaudio / fish-speech

SOTA Open Source TTS

llama transformer tts valle vits vqgan vqvae

Python 22.13 k

18 天前

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

machine-translation speaker-recognition asr tts generative-ai multimodal 深度学习 neural-networks speaker-diariazation speech-translation speech-synthesis large-language-models

Python 14.96 k

5 小时前

FunAudioLLM / CosyVoice

#大语言模型#Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

audio-generation gpt-4o text-to-speech tts cantonese 聊天机器人 ChatGPT 中文 english fine-grained fine-tuning japanese korean multi-lingual natural-language-generation Python cosyvoice cross-lingual 声音克隆

Python 14.87 k

2 天前

mastra-ai / mastra

#大语言模型#The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama.

agents 人工智能 chatbots JavaScript 大语言模型 Next Node.js React TypeScript workflows evals mcp tts

TypeScript 14.59 k

2 天前

pot-app / pot-desktop

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

translation pot Tauri translate pot-app OCR Linux macOS Windows recognize tts

JavaScript 13.39 k

5 天前

PaddlePaddle / PaddleSpeech

PaddleSpeech 是基于飞桨 PaddlePaddle 的语音方向的开源模型库，用于语音和音频中的各种关键任务的开发，典型的应用包括：语音识别、语音翻译、语音合成等

transformer conformer speech-translation streaming-asr speech-alignment punctuation-restoration streaming-tts speech-synthesis tts asr speech-recognition 声音克隆 vocoder voice-recognition self-supervised-learning Whisper

Python 12.04 k

5 天前

DrewThomasson / ebook2audiobook

Generate audiobooks from e-books, voice cloning & 1107+ languages!

audiobooks Docker epub Linux macOS tts Windows xtts 声音克隆 gradio 中文 english multilingual colab-notebook kaggle audiobook

Python 10.38 k

5 小时前

mozilla / TTS

#计算机科学#:robot: 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

深度学习 text-to-speech Python PyTorch tacotron tts speaker-encoder dataset-analysis tacotron2 tensorflow2 vocoder melgan glow-tts speech

Jupyter Notebook 9.89 k

2 年前

rhasspy / piper

A fast, local neural text to speech system

speech-synthesis text-to-speech tts

C++ 9.49 k

5 天前

readest / readest

#安卓#Readest is a modern, feature-rich ebook reader designed for avid readers offering seamless cross-platform access, powerful tools, and an intuitive interface to elevate your reading experience.

ebook ebook-reader epub Next reader Tauri tts Android cross-platform iOS sync

TypeScript 8.98 k

3 天前