feat: Naive Support for Hopper FP8 Prefill Kernel with Per-Head Quant… #587
Artifacts
Produced during runtime
Name | Size | |
---|---|---|
github-pages
|
555 KB |
|