Change the repository type filter
All
Repositories list
6 repositories
- Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.
auto-evolution
Publicvllm
PublicUp to 4x faster decoding than vLLM using HiP Attention: https://github.com/DeepAuto-AI/hip-attentionvllm-legacy
Public