#计算机科学#基于 so-vits-svc4.0(V1)的一个分支,支持实时推理和图形化推理界面,且兼容其模型。
#计算机科学#End-to-End Speech Processing Toolkit
翻译 - 端到端语音处理工具包
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2E, F5-TTS, CosyVoice), with Whisper audio processing, RVC voice changer, YouTube download,...
#新手入门#Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
A simple, high-quality voice conversion tool focused on ease of use and performance.
#数据仓库#🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
zero-shot voice conversion & singing voice conversion, with real-time support
#计算机科学#This is now the official location of the Merlin project.
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
#计算机科学#NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
可本地部署的AI语音工具箱 | A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.
The code for the bark-voicecloning model. Training and inference.
Unsupervised Speech Decomposition Via Triple Information Bottleneck
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
singing voice change based on whisper, and lora for singing voice clone