Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...
A webui for different audio related Neural Networks
OpenMusic: SOTA Text-to-music (TTM) Generation
#大语言模型#(Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on 3 languages
Text prompt steered synthetic audio generators
Code for Investigating Personalization Methods in Text to Music Generation
A comprehensive, click to install, fully open-source, Video + Audio Generation AIO Toolkit using advanced prompt engineering plus the power of CogVideox + AudioLDM2 + Python!
#大语言模型#Generative AI version of the GeoGuesser game.
Workshop for Multimodale media generator