This repository contains demos I made with the Transformers library by HuggingFace.
翻译 - 这个存储库包含我用 HuggingFace 的 Transformers 库制作的演示。
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simp...
#Awesome#An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
SwinIR: Image Restoration Using Swin Transformer (official repository)
翻译 - SwinIR:使用 Swin Transformer 的图像恢复
Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.
翻译 - [CVPR2020]超越MobileNetV3:“ GhostNet:廉价运营带来的更多功能”
#计算机科学#Scenic: A Jax Library for Computer Vision Research and Beyond
翻译 - Scenic:用于计算机视觉研究及其他领域的 Jax 库
Efficient vision foundation models for high-resolution generation and perception.
EVA Series: Visual Representation Fantasies from BAAI
An all-in-one toolkit for computer vision
This is a collection of our NAS and Vision Transformer work.
翻译 - [NeurIPS'20]作物的精华:为一击式神经结构搜索提炼优先路径
VRT: A Video Restoration Transformer (official repository)
翻译 - VRT:视频恢复变压器
#网络爬虫#Extract clean markdown from PDFs, URLs, Word docs, slides, videos, and more, ready for any LLM. ⚡
A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
[NeurIPS 2021] You Only Look at One Sequence
翻译 - 你只看一个序列 (https://arxiv.org/abs/2106.00666)
#计算机科学#Official PyTorch implementation of VoxFormer [CVPR 2023 Highlight]
This is an official implementation for "Contextual Transformer Networks for Visual Recognition".
翻译 - 这是“用于视觉识别的上下文转换器网络”的官方实现。
#计算机科学#[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers