#计算机科学# A collection of resources and papers on Diffusion Models
HunyuanVideo: A Systematic Framework For Large Video Generation Model
#自然语言处理#OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
翻译 - OpenVINO™工具包存储库
#计算机科学#MMagic (Multimodal Advanced, Generative, and Intelligent Creation) 是一个供专业人工智能研究人员和机器学习工程师去处理、编辑和生成图像与视频的开源 AIGC 工具箱
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultr...
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
#计算机科学#StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
#计算机科学#SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
A curated list of recent diffusion models for video generation, editing, and various other applications.
#大语言模型#《Pytorch实用教程》(第二版)无论是零基础入门,还是CV、NLP、LLM项目应用,或是进阶工程化部署落地,在这里都有。相信在本书的帮助下,读者将能够轻松掌握 PyTorch 的使用,成为一名优秀的深度学习工程师。
Diffusion model papers, survey, and taxonomy
Official repository for LTX-Video
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
#新手入门#Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Lumina-T2X is a unified framework for Text to Any Modality Generation
#计算机科学#A general fine-tuning kit geared toward diffusion models.