Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...
#计算机科学#Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
#计算机科学#[CVPR 2025] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Official PyTorch implementation of BigVGAN (ICLR 2023)
#计算机科学#Deep Convolutional Neural Networks for Musical Source Separation
A soundfont editor for quickly designing musical instruments.
openFrameworks addon for audio synthesis and generative music
Pytorch implementation of BigVSAN
Library for pure Rust advanced audio synthesis.
A python toolkit for automatic audio/MIDI rendering using REAPER
Pythonic audio processing and generation framework
#计算机科学#jazznet dataset of piano patterns for music audio machine learning research
Interactive audio in Jupyter
A creative coding library.
Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the YourTTS TTS model to clone and generate realistic audio waves
Text prompt steered synthetic audio generators
#计算机科学#Deep Performer: Score-to-audio music performance synthesis
Really-Real Time FM Tone Transfer Audio Pluigin
Wavetable creation and manupilation tool