GFPGAN 是腾讯开源的人脸修复算法,它利用预先训练好的面部 GAN(如 StyleGAN2)中封装的丰富和多样的先验因素进行盲脸 (blind face) 修复
PhotoMaker [CVPR 2024]
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
T2I-Adapter
Real-ESRGAN 的目标是开发出实用的图像修复算法
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Open-Sora: 完全开源的高效复现类Sora视频生成方案
#计算机科学#Stable Diffusion web UI
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Bringing Old Photo Back to Life (CVPR 2020 oral)
翻译 - 重现旧照片(CVPR 2020口头)
#大语言模型#利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
PhotoMaker [CVPR 2024]
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Zero-Shot Speech Editing and Text-to-Speech in the Wild
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
上传截图通过GPT生成HTML/Tailwind/JavaScript代码
Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also sup...
翻译 - 开源图像和视频恢复工具箱,特别适用于超分辨率,包括EDSR,RCAN,SRResNet,SRGAN,ESRGAN,EDVR等。还支持StyleGAN2,DFDNet。