[BugFix][Core] Fix BlockManagerV2 when Encoder Input is None #9103

sroy745 · 2024-10-06T04:15:17Z

This PR addresses an issue encountered with certain encoder-decoder models, e.g. Llama-3.2-11B-Vision-Instruct, where encoderInput is set to None. This situation arises in the following test cases:

tests/models/encoder_decoder/vision_language/test_mllama.py::test_models[5-128-bfloat16-sizes0-meta-llama/Llama-3.2-11B-Vision-Instruct]
tests/models/encoder_decoder/vision_language/test_mllama.py::test_models[5-128-bfloat16-sizes4-meta-llama/Llama-3.2-11B-Vision-Instruct]

Currently, these tests function correctly with BlockManagerV1 because, when encoder_input is None during the allocation of cross_block_table, an empty BlockTable is created without any blocks. However, this behavior does not hold for BlockManagerV2, where the addition of a BlockTable and attempts to allocate blocks for an empty or None encoder_input lead to assertion failures.

We make the following changes in this PR:

For cases where encoder_input is None, we create an empty BlockTable, similar to the behavior in BlockManagerV1, but without adding any blocks to the BlockTable.
We have removed two assertions in BlockManagerV2 that pertain to free blocks and the calculation of physical_block_ids. These assertions check whether the BlockTable is empty, which is valid given the use case. The removal of these assertions will not impact the functionality of the BlockManager.
With these changes, we are re-enabling BlockManagerV2 for encoder-decoder models.

FIX #9099
FIX #9084

Pull from head

github-actions · 2024-10-06T04:15:29Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

sroy745 · 2024-10-06T19:02:07Z

@comaniac / @heheda12345 PTAL when you get a chance

comaniac

LGTM

vllm/core/block_manager_v2.py

Co-authored-by: Cody Yu <hao.yu.cody@gmail.com>

heheda12345 · 2024-10-06T20:33:41Z

LGTM too.

sroy745 added 30 commits May 28, 2024 20:39

Merge pull request #1 from vllm-project/main

5650b95

Pull from head

Merge branch 'vllm-project:main' into main

8f36146

Merge branch 'vllm-project:main' into main

9e75057

Merge branch 'vllm-project:main' into main

db2c679

Merge branch 'vllm-project:main' into main

8d7512c

Merge branch 'vllm-project:main' into main

1473f74

Merge branch 'vllm-project:main' into main

4013e1a

Merge branch 'vllm-project:main' into main

2dbdd78

Merge branch 'vllm-project:main' into main

b3575e9

Merge branch 'vllm-project:main' into main

94b0d43

Merge branch 'vllm-project:main' into main

fa8fedf

Merge branch 'vllm-project:main' into main

6ed96b4

Merge branch 'vllm-project:main' into main

b71c533

Merge branch 'vllm-project:main' into main

57babef

Merge branch 'vllm-project:main' into main

4b19bac

Merge branch 'vllm-project:main' into main

eb7a1c4

Merge branch 'vllm-project:main' into main

7e2c87e

Merge branch 'vllm-project:main' into main

6212d5f

Merge branch 'vllm-project:main' into main

5491438

Merge branch 'vllm-project:main' into main

68e080a

Merge branch 'vllm-project:main' into main

55e4332

Merge branch 'vllm-project:main' into main

532eb48

Merge branch 'vllm-project:main' into main

7cea056

Merge branch 'vllm-project:main' into main

185e056

Merge branch 'vllm-project:main' into main

e2be95f

Merge branch 'vllm-project:main' into main

2ed5473

Merge branch 'vllm-project:main' into main

efa4714

Merge branch 'vllm-project:main' into main

fb87d34

Merge branch 'vllm-project:main' into main

5419e49

Merge branch 'vllm-project:main' into main

9ba12f8

sroy745 and others added 13 commits September 24, 2024 11:28

Merge branch 'vllm-project:main' into main

56f2065

Merge branch 'vllm-project:main' into main

28e103e

Merge branch 'vllm-project:main' into main

2fc1490

Merge branch 'vllm-project:main' into main

8805750

Merge branch 'vllm-project:main' into main

b30e5af

Merge branch 'vllm-project:main' into main

92322f1

Merge branch 'vllm-project:main' into main

85e9001

Merge branch 'vllm-project:main' into main

cd4ff89

Merge branch 'vllm-project:main' into main

0dd96ed

Merge branch 'vllm-project:main' into main

9d4d969

Merge branch 'vllm-project:main' into main

7d223b5

Merge branch 'vllm-project:main' into main

f327d91

Fix BlockManager V2 when the encoder input is None

d515360

sroy745 marked this pull request as draft October 6, 2024 04:22

sroy745 added 2 commits October 6, 2024 18:57

Comments

effc48d

Comment

3296228

sroy745 changed the title ~~[WIP][BugFix][Core] Fix BlockManagerV2 when Encoder Input is None~~ [BugFix][Core] Fix BlockManagerV2 when Encoder Input is None Oct 6, 2024

comaniac approved these changes Oct 6, 2024

View reviewed changes

vllm/core/block_manager_v2.py Outdated Show resolved Hide resolved

comaniac marked this pull request as ready for review October 6, 2024 19:12

comaniac added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 6, 2024

Update vllm/core/block_manager_v2.py

2a484cb

Co-authored-by: Cody Yu <hao.yu.cody@gmail.com>

comaniac enabled auto-merge (squash) October 6, 2024 20:02

Dummy

56b446d

auto-merge was automatically disabled October 6, 2024 23:56
Head branch was pushed to by a user without write access

Dummy

27047a8

comaniac enabled auto-merge (squash) October 7, 2024 00:03

comaniac merged commit c8f26bb into vllm-project:main Oct 7, 2024
60 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix][Core] Fix BlockManagerV2 when Encoder Input is None #9103

[BugFix][Core] Fix BlockManagerV2 when Encoder Input is None #9103

sroy745 commented Oct 6, 2024 •

edited

Loading

github-actions bot commented Oct 6, 2024

sroy745 commented Oct 6, 2024

comaniac left a comment

heheda12345 commented Oct 6, 2024

[BugFix][Core] Fix BlockManagerV2 when Encoder Input is None #9103

[BugFix][Core] Fix BlockManagerV2 when Encoder Input is None #9103

Conversation

sroy745 commented Oct 6, 2024 • edited Loading

github-actions bot commented Oct 6, 2024

sroy745 commented Oct 6, 2024

comaniac left a comment

Choose a reason for hiding this comment

heheda12345 commented Oct 6, 2024

sroy745 commented Oct 6, 2024 •

edited

Loading