Generative models for conditional audio generation
#计算机科学#🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
#计算机科学#Audio generation using diffusion models, in PyTorch.
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable mu...
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...
SampleRNN: An Unconditional End-to-End Neural Audio Generation Model
A family of diffusion models for text-to-audio generation.
Implementation of DiffWave and SaShiMi audio generation models
#计算机科学#Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
#计算机科学#A timeline of the latest AI models for audio generation, starting in 2023!
Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)
翻译 - 对抗性纠缠的视听表示产生人脸的代码(AAAI 2019)
PyTorch implementation of SampleRNN: An Unconditional End-to-End Neural Audio Generation Model
Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)
#计算机科学#Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Code for One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning (AAAI 2022)
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
#多媒体#Audacity 是一款跨平台的音频编辑软件,用于录音和编辑音频