generated from kubernetes/kubernetes-template-project
-
Notifications
You must be signed in to change notification settings - Fork 27
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Enhancements to LLM Instance Gateway: Scheduling Logic, and Documenta…
…tion Updates (#78) * squashed modify filter for LoRA affinity modify filter for LoRA affinity * update llm service and llm server pool yaml, readme * remove ununsed method from metrics.go * add flowchart image * update size flowchart image * remove image name * update queueingThresholdLoRA to 50 * roll back manifest changes * roll back manifest changes * update filter and scheduler based on comments * rename filters * update filter names and comments * fix readme * fix comment * modify flowchart * add comment to lowLoRACostPredicate reasoning when it can be useful
- Loading branch information
1 parent
83f701b
commit 5372efb
Showing
5 changed files
with
70 additions
and
12 deletions.
There are no files selected for viewing
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -136,4 +136,4 @@ spec: | |
emptyDir: | ||
medium: Memory | ||
- name: adapters | ||
emptyDir: {} | ||
emptyDir: {} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters