Skip to content

Actions: neuralmagic/vllm

pre-commit

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
33 workflow runs
33 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Ignore] Temporary PR for comparing changes
pre-commit #33: Pull request #55 synchronize by afeldman-nm
January 31, 2025 20:53 4m 31s afeldman-nm/v1_logprobs_test
January 31, 2025 20:53 4m 31s
[Ignore] Temporary PR for comparing changes
pre-commit #32: Pull request #55 synchronize by afeldman-nm
January 31, 2025 19:46 4m 29s afeldman-nm/v1_logprobs_test
January 31, 2025 19:46 4m 29s
[Ignore] Temporary PR for comparing changes
pre-commit #31: Pull request #55 opened by afeldman-nm
January 31, 2025 19:46 4m 35s afeldman-nm/v1_logprobs_test
January 31, 2025 19:46 4m 35s
Add favicon to docs (#12611)
pre-commit #30: Commit e3f7ff6 pushed by tlrmchlsmth
January 31, 2025 18:00 4m 47s main
January 31, 2025 18:00 4m 47s
[Bugfix] Gracefully handle huggingface hub http error (#12571)
pre-commit #29: Commit 7a8987d pushed by rahul-tuli
January 31, 2025 14:20 4m 42s main
January 31, 2025 14:20 4m 42s
[Kernel] Update cutlass_scaled_mm to support 2d group (blockwise) s…
pre-commit #28: Commit 9798b2f pushed by tlrmchlsmth
January 31, 2025 03:25 4m 24s main
January 31, 2025 03:25 4m 24s
[Kernel] Triton Configs for Fp8 Block Quantization (#11589)
pre-commit #27: Commit 9b0c4ba pushed by tlrmchlsmth
January 30, 2025 19:58 4m 37s main
January 30, 2025 19:58 4m 37s
[Misc] fix typo: add missing space in lora adapter error message (#12…
pre-commit #26: Commit 41bf561 pushed by tlrmchlsmth
January 30, 2025 16:09 4m 36s main
January 30, 2025 16:09 4m 36s
[V1][BugFix] Free encoder cache for aborted requests (#12545)
pre-commit #25: Commit e0cc5f2 pushed by SageMoore
January 29, 2025 23:26 5m 27s main
January 29, 2025 23:26 5m 27s
[WIP] Working Grouped gemm with group ID
pre-commit #24: Pull request #48 synchronize by ElizaWszola
January 29, 2025 13:55 4m 43s grouped-gemm-with-group-id
January 29, 2025 13:55 4m 43s
[WIP] Working Grouped gemm with group ID
pre-commit #23: Pull request #48 synchronize by ElizaWszola
January 29, 2025 07:10 4m 28s grouped-gemm-with-group-id
January 29, 2025 07:10 4m 28s
[WIP] Working Grouped gemm with group ID
pre-commit #22: Pull request #48 synchronize by ElizaWszola
January 28, 2025 04:46 4m 31s grouped-gemm-with-group-id
January 28, 2025 04:46 4m 31s
[Bugfix] Fix gpt2 GGUF inference (#12467)
pre-commit #21: Commit ce69f7f pushed by tlrmchlsmth
January 27, 2025 15:14 4m 36s main
January 27, 2025 15:14 4m 36s
January 27, 2025 09:38 4m 31s
[Bugfix/CI] Fix broken kernels/test_mha.py (#12450)
pre-commit #19: Commit 72f4880 pushed by tlrmchlsmth
January 26, 2025 18:59 4m 26s main
January 26, 2025 18:59 4m 26s
[Bugfix] Disable w16a16 2of4 sparse CompressedTensors24 (#12417)
pre-commit #18: Commit aa2cd2c pushed by tlrmchlsmth
January 26, 2025 17:09 4m 21s main
January 26, 2025 17:09 4m 21s
Bump jinja2 from 3.1.4 to 3.1.5
pre-commit #17: Pull request #49 synchronize by dependabot bot
January 25, 2025 20:52 4m 31s dependabot/pip/jinja2-3.1.5
January 25, 2025 20:52 4m 31s
[TPU][CI] Update torchxla version in requirement-tpu.txt (#12422)
pre-commit #16: Commit 324960a pushed by tlrmchlsmth
January 25, 2025 20:48 4m 18s main
January 25, 2025 20:48 4m 18s
[ROCm][MoE] MI300 tuned configs Mixtral-8x(7B,22B) | fp16, fp8 (#12408)
pre-commit #15: Commit bf21481 pushed by tlrmchlsmth
January 25, 2025 04:40 4m 32s main
January 25, 2025 04:40 4m 32s
[WIP] Working Grouped gemm with group ID
pre-commit #14: Pull request #48 synchronize by ElizaWszola
January 24, 2025 22:41 4m 30s grouped-gemm-with-group-id
January 24, 2025 22:41 4m 30s
[Bugfix][Kernel] FA3 Fix - RuntimeError: This flash attention build o…
pre-commit #13: Commit 3132a93 pushed by tlrmchlsmth
January 24, 2025 21:08 4m 22s main
January 24, 2025 21:08 4m 22s
[Bugfix][Kernel] Fix CUDA 11.8 being broken by FA3 build (#12375)
pre-commit #12: Commit ab5bbf5 pushed by tlrmchlsmth
January 24, 2025 16:11 4m 23s main
January 24, 2025 16:11 4m 23s
[Misc] Enable proxy support in benchmark script (#12356)
pre-commit #11: Commit 3bb8e2c pushed by tlrmchlsmth
January 24, 2025 15:03 4m 29s main
January 24, 2025 15:03 4m 29s
[Docs] Document Phi-4 support (#12362)
pre-commit #10: Commit 2cbeeda pushed by SageMoore
January 23, 2025 20:17 4m 21s main
January 23, 2025 20:17 4m 21s
[WIP] Working Grouped gemm with group ID
pre-commit #9: Pull request #48 synchronize by ElizaWszola
January 23, 2025 18:28 4m 37s grouped-gemm-with-group-id
January 23, 2025 18:28 4m 37s