Skip to content
#

vision-transformer-models

Here are 7 public repositories matching this topic...

Language: All
Filter by language

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

  • Updated Dec 24, 2024
  • Python

This project focuses on evaluating Convolutional Neural Networks (CNN) and Vision Transformers (ViT) for image classification tasks, specifically distinguishing between Asian elephants and African elephants.

  • Updated Apr 8, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the vision-transformer-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision-transformer-models topic, visit your repo's landing page and select "manage topics."

Learn more