-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WIP upstream merge (DO NOT MERGE) #70
Conversation
…vllm-project#5710) Co-authored-by: Roger Wang <ywang@roblox.com>
…penai/run_batch.py (vllm-project#5756)
Co-authored-by: Varun Sundar Rabindranath <varun@neuralmagic.com>
Signed-off-by: kevin <kevin@anyscale.com>
…lel size than target model (vllm-project#5414)
…ements, test fixes (vllm-project#5422)
Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
…rly with ring buffer. (vllm-project#5905) Signed-off-by: Xiaowei Jiang <xwjiang2010@gmail.com> Co-authored-by: Roger Wang <ywang@roblox.com>
Co-authored-by: ywang96 <ywang@roblox.com>
…_tokens` is set too high (vllm-project#5894) Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
…vllm-project#5927) Signed-off-by: Xiaowei Jiang <xwjiang2010@gmail.com>
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: prashantgupta24 The full list of commands accepted by this bot can be found here.
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/test all |
Still included in built docker images Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>
Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>
* Enabling some basic tests for ROCm 6.2 Use strict xfail for ROCm 6.2 test repairs * Use lenient xfail instead --------- Co-authored-by: Alexei V. Ivanov <alexei.ivanov@amd.com>
Created only for the sole purpose of trying to merge squash the commits in a different way