Skip to content
Change the repository type filter

All

    Repositories list

    • Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.
      Python
      Other
      43800Updated Feb 11, 2025Feb 11, 2025
    • sglang

      Public
      This is a fork of SGLang for hip-attention integration. Please refer to hip-attention for detail.
      Python
      Apache License 2.0
      878300Updated Feb 6, 2025Feb 6, 2025
    • A framework that encodes and learns from diverse architectures, enabling rapid adaptation, new model generation, and performance gains via latent space exploration.
      Apache License 2.0
      0000Updated Jan 21, 2025Jan 21, 2025
    • triton

      Public
      Development repository for the Triton language and compiler
      C++
      MIT License
      1.8k000Updated Oct 11, 2024Oct 11, 2024
    • vllm

      Public
      Up to 4x faster decoding than vLLM using HiP Attention: https://github.com/DeepAuto-AI/hip-attention
      Python
      Apache License 2.0
      5.6k100Updated Oct 11, 2024Oct 11, 2024
    • Forked vLLM Framework, for DeepAuto Chat Platform. Supports HiP Attention
      Python
      Apache License 2.0
      0100Updated Jul 8, 2024Jul 8, 2024