From 4048e22b5ae07749910fe8a9985a81dacd60423d Mon Sep 17 00:00:00 2001 From: Abhas Kumar Sinha <31654395+abhaskumarsinha@users.noreply.github.com> Date: Wed, 2 Aug 2023 00:04:21 +0530 Subject: [PATCH] Update README.md --- README.md | 5 ++++- 1 file changed, 4 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index b409d43..f802990 100644 --- a/README.md +++ b/README.md @@ -1,5 +1,5 @@ # MinimalGPT: The 'Tiniest and Simplest GPT Model' - + MinimalGPT Logo @@ -8,6 +8,9 @@ [[`GPT-1 Paper`](https://s3-us-west-2.amazonaws.com/openai-assets/research-covers/language-unsupervised/language_understanding_paper.pdf)] [[`1002 short stories from project guttenberg`](https://www.kaggle.com/datasets/shubchat/1002-short-stories-from-project-guttenberg)] [[`logo.com`](https://wwww.logo.com/)] [[`Transformer - Paper`](https://arxiv.org/abs/1706.03762)] [[`Huggingface Transformers`](https://huggingface.co/docs/transformers/index)] [[`TensorFlow`](https://www.tensorflow.org/)] [[`BPE Tokenizer: subword-nmt`](https://github.com/rsennrich/subword-nmt)]

MinimalGPT is a concise, adaptable, and streamlined code framework that encompasses the essential components necessary for the construction, training, inference, and fine-tuning of the GPT model. This framework is implemented exclusively using Keras and TensorFlow, ensuring compatibility and coherence within the broader deep learning ecosystem.

+

+

NEW: CPU/GPU/TPU Support and support for loading big file datasets!

+

Code Specifications