Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
翻译 - 在Pytorch中实现视觉变压器,这是仅使用一个变压器编码器即可在视觉分类中实现SOTA的简单方法
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
翻译 - 这是“Swin Transformer:Hierarchical Vision Transformer using Shifted Windows”的官方实现。
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
翻译 - 收集一些有远见的有关变压器的论文。具有计算机视觉 (CV) 功能的出色 Transformer
Recent Transformer-based CV and related works.
A paper list of some recent Transformer-based CV works.
#计算机科学#Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
翻译 - 在Pytorch中为CNN和视觉变压器实现了许多类激活图方法。包括Grad-CAM,Grad-CAM ++,Score-CAM,Ablation-CAM和XGrad-CAM
Explainability for Vision Transformers
Let's train vision transformers (ViT) for cifar 10!
Vision Transformer (ViT) in PyTorch
Self-supervised vIsion Transformer (SiT)
Vision Transformer Cookbook with Tensorflow
Keras implementation of ViT (Vision Transformer)
Code for the Convolutional Vision Transformer (ConViT)
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022
Implementation of ViViT: A Video Vision Transformer
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
Hiera: A fast, powerful, and simple hierarchical vision transformer.
基于pytorch的Vision Transformer
Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)
[CVPR 2022] MPViT:Multi-Path Vision Transformer for Dense Prediction
Two simple and effective designs of vision transformer, which is on par with the Swin transformer
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference
翻译 - LeViT是ConvNet服装中的视觉转换器,可加快推理速度