Skip to content

Issues: vllm-project/vllm

[Roadmap] vLLM Roadmap Q2 2025
#15735 opened Mar 29, 2025 by simon-mo
Open 1
[V1] Feedback Thread
#12568 opened Jan 30, 2025 by simon-mo
Open 81
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Bug]: TP with external_launcher is not working with vLLM version 0.8.0 and above bug Something isn't working
#15895 opened Apr 1, 2025 by toslali-ibm
1 task done
[Feature]: Fused MoE config for Nvidia RTX 3090 feature request New feature or request
#15880 opened Apr 1, 2025 by davidsyoung
1 task done
[Bug]: CPU offload not working for vllm serve bug Something isn't working
#15877 opened Apr 1, 2025 by hamaadtahiir
1 task done
[Bug]: building docker from Dockerfile bug Something isn't working
#15872 opened Apr 1, 2025 by surak
1 task done
[Bug]: CPU offload not working for DeepSeek-V2-Lite-Chat bug Something isn't working
#15871 opened Apr 1, 2025 by ymcki
1 task done
[Bug]: FP8 accuracy decreases with long inputs bug Something isn't working
#15865 opened Apr 1, 2025 by fan-niu
1 task done
[Bug]: qwen2.5-omni model failed to start bug Something isn't working
#15864 opened Apr 1, 2025 by hackerHiJu
1 task done
Does VLLM support structured pruning? usage How to use vllm
#15854 opened Apr 1, 2025 by wangwenmingaa
1 task done
[Bug]: [V1] Testla T4 cannot work for V1 bug Something isn't working
#15853 opened Apr 1, 2025 by maobaolong
1 task done
[New Model]: nomic-ai/nomic-embed-text-v2-moe and nvidia/NV-Embed-v2 new model Requests to new models
#15849 opened Apr 1, 2025 by RohitRathore1
1 task done
[Bug]: served-model-name not being returned in model field of response bug Something isn't working
#15845 opened Apr 1, 2025 by nbertagnolli
1 task done
[Feature]: A Hacked Classifier Free Guidance Metho feature request New feature or request
#15839 opened Mar 31, 2025 by MSLDCherryPick
1 task done
[Bug]: [TPU] V1 seems to silently crash after a while bug Something isn't working
#15833 opened Mar 31, 2025 by kiratp
1 task done
[Bug]: Docker build takes more than 5000 seconds bug Something isn't working
#15827 opened Mar 31, 2025 by HadiSDev
[Bug]: Gemma 3 27B IT Model Doesn't Read Image (Responds To Text Only) bug Something isn't working
#15825 opened Mar 31, 2025 by dawnik17
1 task done
[Bug]: Failed to run an GPTQModel quantized model with vLLM bug Something isn't working
#15817 opened Mar 31, 2025 by Maglanyulan
1 task done
ProTip! no:milestone will show everything without a milestone.