ViT

Category: Computer Vision

Framework: PyTorch

Dataset: Custom

Created: June 20, 2024

GitHub: View Implementation

Overview

From scratch implementation of ViT

Implmented a ViT Architecture from Scratch using Pytorch on a subset of Food-101 dataset.

Dataset (Train): Subset of Food101 (3 classes-255 images total) Dataset (Test): Subset of Food101 (3 classes-75 images total)

Pytorch

Training loss: 1.20
Test loss: 1.52

📁 GitHub Repository: ViT

View the complete implementation, training scripts, and documentation on GitHub.