LocalVLLMModel and deployment handler #102

michaelharrisonmai · 2025-03-04T22:39:41Z

This PR introduces the LocalVLLMModel and LocalVLLMDeploymentHandler, the former is a usual Model (i.e. implements generate) and the latter is a new class to handle aspects of deployment (deployment itself, health checks on your deployment, shutdown of servers).

Use LocalVLLMModel either by defining a ModelConfig or by passing info in the command line so that vllm can recognize your deployment or even deploy for you.

If you have already deployed, pass "ports" parameter, otherwise the LocalVLLMDeploymentHandler will spin up "num_servers" (default = 1) servers for you, wait for deployment to finish, and continue with the eval pipeline.

eureka_ml_insights/models/__init__.py

eureka_ml_insights/models/models.py

eureka_ml_insights/models/vllm_deployment_script.sh

deploy_and_run.sh

…soft/eureka-ml-insights into mharrison/local-vllm-model

michaelharrisonmai · 2025-03-06T21:46:32Z

@microsoft-github-policy-service agree company="Microsoft"

safooray · 2025-03-06T21:45:59Z

main.py

+            init_args["model_config"] = model_config
+            # Logic above is that certain deployment parameters like ports and num_servers
+            # can be variable and so we allow them to be overridden by command line args.
+        except:


Please specify what exception you are catching here (AttributeError?)

safooray · 2025-03-06T21:49:15Z

main.py

+            init_args["model_config"] = ModelConfig(
+                LocalVLLMModel,
+                {
+                    "model_name": args.model_config,


Should model_name be passed as a separate commandline argument instead of re-using the exisiting model_config argument? Looks like this should be a string not a ModelConfig object

my thought was: someone passing --local-vllm has the option to provide --model_config as a ModelConfig or a string which uniquely identifies the model to vLLM, so we can try to catch either. this allows a bit more flexibility so that they don't have to change scripts based on which type they're passing when they call eureka (and having both model_config and model_name as command line args might confuse someone in case it's not clear to only pass the latter in conjunction with vllm?) i don't really have a strong opinion though, happy to do what you think is best.

(the obvious negative of the way i've done it is that the name "model_config" is no longer accurate)

safooray · 2025-03-07T19:48:34Z

eureka_ml_insights/models/models.py

@@ -2,9 +2,13 @@

 import json
 import logging
+import random
+import re


This import seems to be unused. isort should remove unused imports if you run make format-inplace

safooray · 2025-03-07T20:12:33Z

eureka_ml_insights/models/models.py

+            "n_output_tokens": raw_output["usage"]["completion_tokens"]
+        }
+
+    def generate(self, text_prompt, query_images=None, system_message=None):


Should we have support for previous_messages similar to other models to enable chat?

safooray · 2025-03-07T20:17:43Z

eureka_ml_insights/models/models.py

+
+
+@dataclass
+class LocalVLLMModel(Model, OpenAICommonRequestResponseMixIn):


Can we extend from EndpointModel instead of Model to inherit the retry, exception handling, chat history maintenance etc?

I'm not confident about threadsafety here, don't we have the same potential issues with all calls to EndpointModel's generate() sharing the instance variables model_output, is_valid, etc? But if this is solved in EndpointModel and OpenAICommonRequestResponseMixIn I do think LocalVLLMModel can basically inherit its methods from these two (with just the minor logic to get a random client)

deploy_vllm_and_run_eval.sh

Michael Harrison added 2 commits February 27, 2025 04:15

introduce LocalVLLMModel

70367a5

improved functionality for vllm deployments

540cbe9

michaelharrisonmai requested review from vidhishanair, gugarosa, safooray and piero2c March 4, 2025 22:39

example script to deploy servers and run pipeline

bb6f0b3

gugarosa previously approved these changes Mar 5, 2025

View reviewed changes

eureka_ml_insights/models/__init__.py Outdated Show resolved Hide resolved

eureka_ml_insights/models/models.py Outdated Show resolved Hide resolved

eureka_ml_insights/models/vllm_deployment_script.sh Outdated Show resolved Hide resolved

deploy_and_run.sh Outdated Show resolved Hide resolved

improved localvllm logic

5ed17e4

michaelharrisonmai dismissed gugarosa’s stale review via 5ed17e4 March 6, 2025 16:56

Vidhisha Balachandran added 3 commits March 6, 2025 13:15

retrigger checks

4aa3a21

Merge branch 'mharrison/local-vllm-model' of https://github.com/micro…

e4f83a5

…soft/eureka-ml-insights into mharrison/local-vllm-model

retrigger checks

714bde5

safooray reviewed Mar 7, 2025

View reviewed changes

deploy_vllm_and_run_eval.sh Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LocalVLLMModel and deployment handler #102

LocalVLLMModel and deployment handler #102

michaelharrisonmai commented Mar 4, 2025

michaelharrisonmai commented Mar 6, 2025

safooray Mar 6, 2025

safooray Mar 6, 2025

michaelharrisonmai Mar 7, 2025 •

edited

Loading

michaelharrisonmai Mar 7, 2025

safooray Mar 7, 2025

safooray Mar 7, 2025

safooray Mar 7, 2025

michaelharrisonmai Mar 7, 2025



		@dataclass
		class LocalVLLMModel(Model, OpenAICommonRequestResponseMixIn):

LocalVLLMModel and deployment handler #102

Are you sure you want to change the base?

LocalVLLMModel and deployment handler #102

Conversation

michaelharrisonmai commented Mar 4, 2025

michaelharrisonmai commented Mar 6, 2025

safooray Mar 6, 2025

Choose a reason for hiding this comment

safooray Mar 6, 2025

Choose a reason for hiding this comment

michaelharrisonmai Mar 7, 2025 • edited Loading

Choose a reason for hiding this comment

michaelharrisonmai Mar 7, 2025

Choose a reason for hiding this comment

safooray Mar 7, 2025

Choose a reason for hiding this comment

safooray Mar 7, 2025

Choose a reason for hiding this comment

safooray Mar 7, 2025

Choose a reason for hiding this comment

michaelharrisonmai Mar 7, 2025

Choose a reason for hiding this comment

michaelharrisonmai Mar 7, 2025 •

edited

Loading