Skip to content

Actions: neuralmagic/vllm

PR Reminder Comment Bot

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
54 workflow runs
54 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add hf_transfer to testing image
PR Reminder Comment Bot #29: Pull request #29 opened by mgoin
November 6, 2024 22:03 12s
November 6, 2024 22:03 12s
[V1] Various updates
PR Reminder Comment Bot #28: Pull request #28 opened by njhill
November 6, 2024 20:47 12s
November 6, 2024 20:47 12s
Emulated dynamic MX quantization
PR Reminder Comment Bot #27: Pull request #27 opened by mgoin
November 6, 2024 18:21 11s
November 6, 2024 18:21 11s
patch
PR Reminder Comment Bot #26: Pull request #26 opened by dsikka
November 4, 2024 23:31 16s
November 4, 2024 23:31 16s
Overlap io
PR Reminder Comment Bot #25: Pull request #25 opened by robertgshaw2-redhat
November 4, 2024 18:51 14s
November 4, 2024 18:51 14s
Stop String Plumbing
PR Reminder Comment Bot #24: Pull request #24 opened by varun-sundar-rabindranath
November 4, 2024 04:02 10s
November 4, 2024 04:02 10s
Stop strings
PR Reminder Comment Bot #23: Pull request #23 opened by robertgshaw2-redhat
November 1, 2024 00:39 11s
November 1, 2024 00:39 11s
split core process into separate class
PR Reminder Comment Bot #22: Pull request #22 opened by njhill
October 31, 2024 16:10 14s
October 31, 2024 16:10 14s
Hqq support
PR Reminder Comment Bot #21: Pull request #21 opened by ElizaWszola
October 14, 2024 12:18 16s
October 14, 2024 12:18 16s
Add silu-mul to rms-norm fusion
PR Reminder Comment Bot #20: Pull request #20 opened by ProExpertProg
October 11, 2024 02:00 15s
October 11, 2024 02:00 15s
Another branch me
PR Reminder Comment Bot #19: Pull request #19 opened by dsikka
September 30, 2024 21:54 11s
September 30, 2024 21:54 11s
Awq moe another
PR Reminder Comment Bot #18: Pull request #18 opened by dsikka
September 30, 2024 19:45 16s
September 30, 2024 19:45 16s
Awq debug moe
PR Reminder Comment Bot #17: Pull request #17 opened by dsikka
September 30, 2024 19:24 14s
September 30, 2024 19:24 14s
Awq moe debug
PR Reminder Comment Bot #16: Pull request #16 opened by dsikka
September 30, 2024 18:20 10s
September 30, 2024 18:20 10s
Awq moe
PR Reminder Comment Bot #15: Pull request #15 opened by dsikka
September 30, 2024 18:15 10s
September 30, 2024 18:15 10s
Temp moe branch
PR Reminder Comment Bot #14: Pull request #14 opened by dsikka
September 30, 2024 16:59 10s
September 30, 2024 16:59 10s
add awq moe
PR Reminder Comment Bot #13: Pull request #13 opened by dsikka
September 26, 2024 18:05 12s
September 26, 2024 18:05 12s
Update cpu_extension.cmake
PR Reminder Comment Bot #12: Pull request #12 opened by ProExpertProg
September 23, 2024 01:28 14s
September 23, 2024 01:28 14s
Dynamic group blocks in Marlin MoE
PR Reminder Comment Bot #11: Pull request #11 opened by ElizaWszola
September 20, 2024 12:05 15s
September 20, 2024 12:05 15s
Add zero point support to Marlin MoE kernel
PR Reminder Comment Bot #10: Pull request #10 opened by ElizaWszola
September 17, 2024 13:58 16s
September 17, 2024 13:58 16s
Guided gen support
PR Reminder Comment Bot #9: Pull request #9 opened by DhruvaBansal00
September 10, 2024 01:25 11s
September 10, 2024 01:25 11s
GPTQ Fused MoE class
PR Reminder Comment Bot #8: Pull request #8 opened by ElizaWszola
September 3, 2024 15:07 15s
September 3, 2024 15:07 15s
test
PR Reminder Comment Bot #7: Pull request #7 opened by robertgshaw2-redhat
August 28, 2024 20:09 33s
August 28, 2024 20:09 33s
Run metrics async
PR Reminder Comment Bot #6: Pull request #6 opened by robertgshaw2-redhat
August 10, 2024 21:38 11s
August 10, 2024 21:38 11s
[WIP, Kernel] (2/N) Machete - Integrate into GPTQMarlinLinearMethod and CompressedTensorsWNA16
PR Reminder Comment Bot #5: Pull request #5 opened by LucasWilkinson
August 7, 2024 23:10 13s
August 7, 2024 23:10 13s