mosaicml / composer Star 5.3k Code Issues Pull requests Supercharge Your Model Training machine-learning deep-learning neural-network pytorch neural-networks ml-training ml-systems ml-efficiency Updated Jan 29, 2025 Python
MyDarapy / SmolLM-experiments-with-grouped-query-attention Star 1 Code Issues Pull requests (Unofficial) building Hugging Face SmolLM-blazingly fast and small language model with PyTorch implementation of grouped query attention (GQA) transformer attention smol huggingface ml-efficiency llm grouped-query-attention smol-lm huggingface-smol-lm Updated Jan 11, 2025 Python