Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...
微软VALL-E X 零样本语音合成模型的开源实现
An unofficial PyTorch implementation of the audio LM VALL-E
#大语言模型#PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
OpenMusic: SOTA Text-to-music (TTM) Generation
AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
An unofficial PyTorch implementation of VALL-E
Applying deep learning to translate animation and re-generate audio.