Instant voice cloning by MIT and MyShell. Audio foundation model.
#计算机科学#Foundational model for human-like, expressive TTS
Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation
Automated voice dubbing for YouTube videos using Docker, OpenVoice, and FastAPI. Translates and dubs videos with original voice timbre.
Dockerized Voicecraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild
Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis