Change the repository type filter
All
Repositories list
8 repositories
flash-linear-attention
Public🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Tritonfla-zoo
PublicFlash-Linear-Attention models beyond languageflame
Public🔥 A minimal training framework for scaling FLA modelsfla-rl
PublicThunderKittens
Publicnative-sparse-attention
Publicflash-hybrid-attention
Public- Triton implement of bi-directional (non-causal) linear attention