Skip to content

Actions: neuralmagic/vllm

yapf

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
288 workflow runs
288 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Rebased ds
yapf #214: Pull request #38 opened by dsikka
December 7, 2024 03:49 1m 4s rebased-ds
December 7, 2024 03:49 1m 4s
[ci] fix broken tests (#10956)
yapf #213: Commit dcdc3fa pushed by tlrmchlsmth
December 6, 2024 19:45 1m 54s main
December 6, 2024 19:45 1m 54s
[torch.compile] Dynamic fp8 + rms_norm fusion
yapf #212: Pull request #31 synchronize by ProExpertProg
December 5, 2024 01:26 1m 56s luka/rms-norm-fusion-refactor
December 5, 2024 01:26 1m 56s
[benchmark] Make H100 benchmark optional (#10908)
yapf #211: Commit 7883c2b pushed by tlrmchlsmth
December 5, 2024 01:09 1m 39s main
December 5, 2024 01:09 1m 39s
[torch.compile] Dynamic fp8 + rms_norm fusion
yapf #210: Pull request #31 synchronize by ProExpertProg
December 5, 2024 00:20 1m 41s luka/rms-norm-fusion-refactor
December 5, 2024 00:20 1m 41s
[torch.compile] Dynamic fp8 + rms_norm fusion
yapf #209: Pull request #31 synchronize by ProExpertProg
December 4, 2024 22:48 1m 49s luka/rms-norm-fusion-refactor
December 4, 2024 22:48 1m 49s
[CI/Build] improve python-only dev setup (#9621)
yapf #208: Commit e4c34c2 pushed by ProExpertProg
December 4, 2024 22:46 2m 4s main
December 4, 2024 22:46 2m 4s
[torch.compile] Dynamic fp8 + rms_norm fusion
yapf #207: Pull request #31 synchronize by ProExpertProg
December 4, 2024 22:45 1m 47s luka/rms-norm-fusion-refactor
December 4, 2024 22:45 1m 47s
December 3, 2024 14:50 2m 9s
[misc] use out argument for flash attention (#10822)
yapf #205: Commit a4c4daf pushed by tlrmchlsmth
December 2, 2024 14:49 2m 3s main
December 2, 2024 14:49 2m 3s
[Bugfix] Ignore lm_head when loading embedding models (#10719)
yapf #204: Commit 9b4b150 pushed by tlrmchlsmth
November 27, 2024 21:13 1m 49s main
November 27, 2024 21:13 1m 49s
[Model] Enable optional prefix when loading embedding models (#10639)
yapf #203: Commit cf73f0c pushed by tlrmchlsmth
November 25, 2024 18:15 1m 58s main
November 25, 2024 18:15 1m 58s
November 23, 2024 01:12 1m 53s
[bugfix] fix full graph tests (#10581)
yapf #201: Commit db100c5 pushed by tlrmchlsmth
November 22, 2024 19:21 2m 47s main
November 22, 2024 19:21 2m 47s
[Minor] Revert change in offline inference example (#10545)
yapf #200: Commit 46fe9b4 pushed by ProExpertProg
November 21, 2024 23:35 1m 54s main
November 21, 2024 23:35 1m 54s
November 21, 2024 14:14 1m 50s
[ci][bugfix] fix kernel tests (#10431)
yapf #198: Commit 2298e69 pushed by robertgshaw2-redhat
November 18, 2024 23:32 1m 45s main
November 18, 2024 23:32 1m 45s
[Model] Remove redundant softmax when using PoolingType.STEP (#10415)
yapf #197: Commit 01aae1c pushed by ElizaWszola
November 18, 2024 12:23 1m 53s main
November 18, 2024 12:23 1m 53s
November 18, 2024 06:24 1m 44s
Semi structured v2
yapf #195: Pull request #32 synchronize by ilmarkov
November 15, 2024 16:53 1m 51s semi_structured_v2
November 15, 2024 16:53 1m 51s
November 15, 2024 14:30 1m 46s
[Bugfix] bitsandbytes models fail to run pipeline parallel (#10200)
yapf #193: Commit ac49b59 pushed by tlrmchlsmth
November 13, 2024 19:27 1m 47s main
November 13, 2024 19:27 1m 47s
Semi structured v2
yapf #192: Pull request #32 opened by ilmarkov
November 13, 2024 11:55 1m 46s semi_structured_v2
November 13, 2024 11:55 1m 46s
[V1] Enable Inductor when using piecewise CUDA graphs (#10268)
yapf #191: Commit 1f55e05 pushed by SageMoore
November 12, 2024 21:44 2m 11s main
November 12, 2024 21:44 2m 11s
[1/N] torch.compile user interface design (#10237)
yapf #190: Commit eea55cc pushed by varun-sundar-rabindranath
November 12, 2024 03:09 1m 53s main
November 12, 2024 03:09 1m 53s