forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 32
Pull requests: ROCm/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Quant] [Feature] Per-Token-Activation Per-Channel-Weight FP8 Quantization
#412
opened Feb 7, 2025 by
tjtanaa
Loading…
K8test baseline -> Testing a single MI300 8x GPU node for CI performance // no need to merge
#409
opened Feb 6, 2025 by
Alexei-V-Ivanov-AMD
Loading…
Add TritonScaledMMLinearKernel to fix broken support for int8 models
#377
opened Jan 21, 2025 by
rasmith
Loading…
[Cleanup] Remove obsolete patches and references and test CI
#354
opened Jan 9, 2025 by
hongxiayang
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-01-09.