SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Official repository for LTX-Video
LTX-Video Support for ComfyUI
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...
⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)
OpenMusic: SOTA Text-to-music (TTM) Generation
MoH: Multi-Head Attention as Mixture-of-Head Attention
📖A curated list of Awesome Diffusion Inference Papers with codes: Sampling, Caching, Multi-GPUs, etc. 🎉🎉
NMCN(Niche Multi Channel Network),小眾多頻道網絡,是「同和新媒體矩陣」創始團隊於輿論資本全球化背景下率先提出的一種非營利性的去中心化自媒體聯盟形式,通過聯盟內創作單位的交流互推、共享資源等方式對抗資本侵蝕,在產出卓越作品的同時保障亞文化生存空間,為守護寶貴的非物質文化遺產盡綿薄之力。
CogVideoX-5B 4-bit quantization model
An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community to help implement this model!
Official implementation of "JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization"
This is the project for 'Any2Caption', Interpreting Any Condition to Caption for Controllable Video Generation
UK - Great.gov - Export Opportunities - Find and apply for overseas opportunities from businesses looking for products or services like yours.
Serve Text-to-Video Models in Production
从0到1手写基于mnist手写数字数据集的diffusion transformer模型复现