🚀🚀🚀A curated list of papers on controllable video generation.
Conditional Transformer Language Model for Controllable Generation
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable mu...
[CVPR 2025] Learning Flow Fields in Attention for Controllable Person Image Generation
Unified Controllable Visual Generation Model
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)
Code for Text2Human (SIGGRAPH 2022). Paper: Text2Human: Text-Driven Controllable Human Image Generation
Mustango: Toward Controllable Text-to-Music Generation
Pytorch implementation for Controllable Text-to-Image Generation.
Transformer-based Conditional Variational Autoencoder for Controllable Story Generation
[CVPR 2023] LayoutDM: Discrete Diffusion Model for Controllable Layout Generation
Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets
#Awesome#A collection of resources on controllable generation with text-to-image diffusion models.
[ICLR 2025] Codebase for "CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation"
[ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"
Disentangled and Controllable Face Image Generation via 3D Imitative-Contrastive Learning (CVPR 2020 Oral)
Papers and resources on Controllable Generation using Diffusion Models, including ControlNet, DreamBooth, IP-Adapter.
Official Implementation of "Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models"
Official implementation of "DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from Low-Dimensional Latents"
[ACM MM 2021 Best Paper Award] Video Background Music Generation with Controllable Music Transformer
[ICML 2023] Official PyTorch Implementation of "Hierarchical Neural Coding for Controllable CAD Model Generation".