Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible experimentation and exploration.
machine-learning deep-learning transformers pytorch gpt attention-mechanisms gpt-2 position-embedding large-language-models llm llm-training hymba
-
Updated
Dec 3, 2024 - Python