DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing
Lumina-T2X is a unified framework for Text to Any Modality Generation
MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model