AMLM

A small transformer implementation.

Can be configured from the config file. This is designed to be a small benchmark of the SwiGLU activation in the Transformer vs ReLU and of GQA vs MHA.

In the config.json file, "SwiGLU" utilises SwiGLU, "ReLU" utilises relu, "GQA" utilises GQA and "MHA" utilises MHA. It should be thoes strings exactly or undefined behaviour.

The generator file allows for a promt-response cycle to generate from the transformer.

The tokeniser file can be used to train a BPE tokeniser.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
config.json		config.json
generator.py		generator.py
model.py		model.py
old_model		old_model
tokeniser.json		tokeniser.json
tokeniser.py		tokeniser.py
trainer.py		trainer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AMLM

About

Releases

Packages

Languages

archiekind/AMLM

Folders and files

Latest commit

History

Repository files navigation

AMLM

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages