This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
翻译 - 这是“Swin Transformer:Hierarchical Vision Transformer using Shifted Windows”的官方实现。
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
翻译 - 显示,参加和讲述|PyTorch图像字幕教程
#计算机科学# CVNets: A library for training computer vision networks
This is an official implementation for "Contextual Transformer Networks for Visual Recognition".
翻译 - 这是“用于视觉识别的上下文转换器网络”的官方实现。