GitHub - u84819482/Nano-transformer: Minimal encoder for text classification, decoder for text generation, ViT for image classification

Minimal implementation of transformer-encoder for text classification, transformer-decoder for text generation, and ViT for image classification (diffusion transformer for image generation is in this repo.)

The .py file contains codes for:

Word, character, BPE tokenizers and vocabulary generation,
Text generation and text classification dataset formation,
Text and image embeddings,
Encoder, decoder, and ViT models, with modules shared as much as possible,
Training and evaluation, common for all three tasks.

.ipynb files minimally illustrate the training and evaluation of models by using toy datasets (including MNIST for ViT) and light-weight transformers. However, the code in .py file should allow training scaled-up models on large datasets as well.

References:

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
training data		training data
LICENSE		LICENSE
README.md		README.md
transformer_decoder_text _generation.ipynb		transformer_decoder_text _generation.ipynb
transformer_encoder_text_classification.ipynb		transformer_encoder_text_classification.ipynb
transformer_utils.py		transformer_utils.py
vision_transformer_image_classification.ipynb		vision_transformer_image_classification.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

License

u84819482/Nano-transformer

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages