Skip to content
Change the repository type filter

All

    Repositories list

    • gpt-neox

      Public
      An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
      Python
      Apache License 2.0
      1k7.1k6223Updated Feb 1, 2025Feb 1, 2025
    • Jupyter Notebook
      Apache License 2.0
      1614071Updated Jan 31, 2025Jan 31, 2025
    • A framework for few-shot evaluation of language models.
      Python
      MIT License
      2k7.6k35198Updated Jan 31, 2025Jan 31, 2025
    • clearnets

      Public
      Python
      MIT License
      0400Updated Jan 31, 2025Jan 31, 2025
    • Closed-form polynomial approximations to neural networks
      Python
      MIT License
      0200Updated Jan 31, 2025Jan 31, 2025
    • Precisely estimating the volume of basins in neural net parameter space corresponding to interpretable behaviors
      Jupyter Notebook
      Apache License 2.0
      0100Updated Jan 31, 2025Jan 31, 2025
    • Experiments in transformer knowledge and reasoning
      Jupyter Notebook
      MIT License
      131000Updated Jan 30, 2025Jan 30, 2025
    • DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
      Python
      Apache License 2.0
      4.2k16402Updated Jan 29, 2025Jan 29, 2025
    • A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
      Python
      Apache License 2.0
      352000Updated Jan 29, 2025Jan 29, 2025
    • Acompanying code for our research on SAE feature overlap when trained on different seeds.
      Jupyter Notebook
      Apache License 2.0
      1200Updated Jan 28, 2025Jan 28, 2025
    • mdl

      Public
      Minimum Description Length probing for neural network representations
      Python
      MIT License
      21802Updated Jan 28, 2025Jan 28, 2025
    • elk

      Public
      Keeping language models honest by directly eliciting knowledge encoded in their activations.
      Python
      MIT License
      331941510Updated Jan 27, 2025Jan 27, 2025
    • MIDI tokenizers and pre-processing utils.
      Python
      Apache License 2.0
      1100Updated Jan 27, 2025Jan 27, 2025
    • Erasing concepts from neural representations with provable guarantees
      Python
      MIT License
      1522122Updated Jan 27, 2025Jan 27, 2025
    • cookbook

      Public
      Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
      Python
      Apache License 2.0
      3875981Updated Jan 23, 2025Jan 23, 2025
    • sae

      Public
      Sparse autoencoders
      Python
      MIT License
      5442052Updated Jan 21, 2025Jan 21, 2025
    • aria

      Public
      Python
      Apache License 2.0
      114200Updated Dec 24, 2024Dec 24, 2024
    • Jupyter Notebook
      MIT License
      0400Updated Dec 14, 2024Dec 14, 2024
    • website

      Public
      New website for EleutherAI based on Hugo static site generator
      HTML
      6402Updated Dec 12, 2024Dec 12, 2024
    • Jupyter Notebook
      Apache License 2.0
      21800Updated Dec 11, 2024Dec 11, 2024
    • pythia

      Public
      The hub for EleutherAI's work on interpretability and learning dynamics
      Jupyter Notebook
      Apache License 2.0
      1752.4k253Updated Dec 5, 2024Dec 5, 2024
    • aria-amt

      Public
      Efficient and robust implementation of seq-to-seq automatic piano transcription.
      Python
      Apache License 2.0
      83200Updated Dec 2, 2024Dec 2, 2024
    • The simplest, fastest repository for training/finetuning medium-sized GPTs.
      Python
      MIT License
      6.3k9100Updated Nov 19, 2024Nov 19, 2024
    • Jupyter Notebook
      54514Updated Nov 17, 2024Nov 17, 2024
    • monkfish

      Public
      Python
      MIT License
      1400Updated Nov 1, 2024Nov 1, 2024
    • Understanding how features learned by neural networks evolve throughout training
      Python
      MIT License
      13200Updated Oct 24, 2024Oct 24, 2024
    • The code used in "Balancing Label Quantity and Quality for Scalable Elicitation"
      Jupyter Notebook
      MIT License
      4300Updated Oct 22, 2024Oct 22, 2024
    • Efficiently computing & storing token n-grams from large corpora
      Rust
      MIT License
      31700Updated Oct 6, 2024Oct 6, 2024
    • Adds GaLore style projection wrappers to optax optimizers
      Python
      MIT License
      0400Updated Oct 3, 2024Oct 3, 2024
    • Equinox implementation of llama3 and llama3.1
      Python
      MIT License
      0710Updated Oct 3, 2024Oct 3, 2024