Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

(DO NOT MERGE) IBM release WIP #76

Closed
wants to merge 106 commits into from
Closed

Conversation

prashantgupta24
Copy link

@prashantgupta24 prashantgupta24 commented Jul 1, 2024

All vllm integration tests passing on this image!

mgoin and others added 30 commits July 1, 2024 11:54
Co-authored-by: Varun Sundar Rabindranath <varun@neuralmagic.com>
Signed-off-by: kevin <kevin@anyscale.com>
Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
robertgshaw2-redhat and others added 18 commits July 1, 2024 11:54
Co-authored-by: Robert Shaw <rshaw@neuralmagic>
[ci][distributed] fix some cuda init that makes it necessary to use spawn (vllm-project#5991)
Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
…xpected modules. (vllm-project#5909)

Co-authored-by: sang <sangcho@anyscale.com>
Co-authored-by: rshaw@neuralmagic.com <rshaw@neuralmagic>
…Weight Loading) (vllm-project#5940)

Co-authored-by: Robert Shaw <rshaw@neuralmagic>
@openshift-ci openshift-ci bot requested review from dtrifiro and rpancham July 1, 2024 19:08
Copy link

openshift-ci bot commented Jul 1, 2024

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: prashantgupta24
Once this PR has been reviewed and has the lgtm label, please assign terrytangyuan for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@prashantgupta24
Copy link
Author

/test all

@prashantgupta24 prashantgupta24 changed the title July 1 upstream 4645 IBM release WIP Jul 1, 2024
@prashantgupta24 prashantgupta24 changed the title IBM release WIP (DO NOT MERGE) IBM release working Jul 1, 2024
@prashantgupta24 prashantgupta24 changed the title (DO NOT MERGE) IBM release working (DO NOT MERGE) IBM release WIP Jul 1, 2024
Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>
@prashantgupta24 prashantgupta24 deleted the july-1-upstream-4645 branch July 3, 2024 14:04
prarit pushed a commit to prarit/vllm that referenced this pull request Oct 18, 2024
* fix gradlib fp8 output

* add condition check for existing tune result

* fix linter

* fix import order

* fix lint
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.