The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT)...
翻译 - PyTorch图像模型,脚本,预训练权重-(SE)ResNet / ResNeXT,DPN,EfficientNet,MixNet,MobileNet-V3 / V2,MNASNet,单路径NAS,FBNet等
Multi-label classification based on timm.
#计算机科学#Florence-2 is a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-language tasks.
Multi-label classification based on timm, and add SimCLR to timm.
#计算机科学#Solution for NeurIPS 2023 - MedFM Challenge
#计算机科学#This project focuses on evaluating Convolutional Neural Networks (CNN) and Vision Transformers (ViT) for image classification tasks, specifically distinguishing between Asian elephants and African ele...
Code for the base version of the the model vision transformer in pytorch.