voice-conversion · GitHub Topics

#计算机科学#🐸💬 - 一个深度学习的 TTS 语言合成库

Python text-to-speech 深度学习 speech PyTorch tts vocoder tacotron glow-tts melgan speaker-encoder hifigan speaker-encodings multi-speaker-tts tts-model speech-synthesis 声音克隆 voice-synthesis voice-conversion

Python 39.27 k

8 个月前

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

一个基于VITS的简单易用的语音转换（变声器）框架

change sovits vits voice voice-conversion rvc audio-analysis conversational-ai conversion converter retrieval-model retrieve-data so-vits-svc vc voice-converter voiceconversion

Python 28.61 k

5 个月前

svc-develop-team / so-vits-svc

#计算机科学#SoftVC VITS Singing Voice Conversion

人工智能 audio-analysis Generative Adversarial Network singing-voice-conversion so-vits-svc sovits variational-inference vc vits voice voice-conversion voiceconversion voice-changer flow 深度学习 PyTorch speech

Python 26.89 k

1 年前

espnet / espnet

#计算机科学#End-to-End Speech Processing Toolkit

翻译 - 端到端语音处理工具包

深度学习 end-to-end chainer PyTorch kaldi speech-recognition speech-synthesis speech-translation machine-translation voice-conversion speech-enhancement speech-separation singing-voice-synthesis speaker-diarization text-to-speech

Python 8.99 k

5 小时前

voicepaw / so-vits-svc-fork

#计算机科学#基于 so-vits-svc4.0(V1)的一个分支，支持实时推理和图形化推理界面，且兼容其模型。

sovits vits voice-conversion so-vits-svc hubert softvc realtime voice-changer 深度学习 PyTorch speech-synthesis Generative Adversarial Network lightning pytorch-lightning Hacktoberfest

Python 8.96 k

5 天前

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...

audio-generation audio-synthesis audioldm music-generation naturalspeech2 singing-voice-conversion speech-synthesis text-to-audio text-to-speech vall-e voice-conversion audit fastspeech2 vits emilia maskgct vocoder

Python 8.93 k

10 小时前

abus-aikorea / voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isola...

faster-whisper tts Whisper gradio subtitles transcription translator webui speech-recognition speech-synthesis speech-to-text text-to-speech yt-dlp 声音克隆 podcasts audiobook voice-conversion karaoke whisperx

Python 3.59 k

14 天前

zzw922cn / awesome-speech-recognition-speech-synthesis-papers

#新手入门#Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)