This is a simple PyTorch implementation of Vision Transformer (ViT) described in the paper "An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale"
-
Updated
Mar 6, 2023 - Python
This is a simple PyTorch implementation of Vision Transformer (ViT) described in the paper "An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale"
Simple, Small and Efficient Algorithms which takes less memory and run faster to help us in Daily life Embedded Systems Development
Add a description, image, and links to the simple-implementations topic page so that developers can more easily learn about it.
To associate your repository with the simple-implementations topic, visit your repo's landing page and select "manage topics."