[VLM] Qwen 2.5 VL #1113

kylesayrs · 2025-01-29T02:48:38Z

Purpose

Support quantizing Qwen 2.5 VL model

Changes

Add example examples/multimodal_vision/qwen_2_5_vl_example.py
Implement TraceableQwen2_5_VLForConditionalGeneration
Small style changes to TraceableQwen2_VLForConditionalGeneration

Testing

Ran added example to completion

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

github-actions · 2025-01-29T02:48:48Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

brian-dellabetta

I must admit I don't understand a good amount of the qwen2.5-specific tracing code, and shared this with @kylesayrs , but as this is code that will only run specific to the Qwen 2/2.5 VL models, I am not being too meticulous in the review.

Thanks Kyle! 🚀

brian-dellabetta · 2025-02-05T17:30:50Z

examples/multimodal_vision/qwen_2_5_vl_example.py

+from llmcompressor.transformers.tracing import (
+    TraceableQwen2_5_VLForConditionalGeneration,
+)
+


a quick ELI5 would be nice to have at the top

Suggested change

# The following example shows how the Qwen2.5 Vision-Language model can be W4A16 quantized

I think the readme in this folder provides enough context for a user to know how to use this example/what it's doing

kylesayrs · 2025-02-06T18:26:47Z

Need to wait until 2.5 lands in transformers

kylesayrs added 2 commits January 29, 2025 02:45

add qwen_2_5_vl

526d3c2

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

remove non-vision example

da2a2fe

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs self-assigned this Jan 29, 2025

kylesayrs added the ready When a PR is ready for review label Jan 29, 2025

kylesayrs added 2 commits January 29, 2025 02:50

fix typo

a4dbb05

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

Merge remote-tracking branch 'origin' into kylesayrs/qwen2_5_vl

59c19fe

brian-dellabetta approved these changes Feb 5, 2025

View reviewed changes

kylesayrs marked this pull request as draft February 6, 2025 18:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[VLM] Qwen 2.5 VL #1113

[VLM] Qwen 2.5 VL #1113

kylesayrs commented Jan 29, 2025

github-actions bot commented Jan 29, 2025

brian-dellabetta left a comment

brian-dellabetta Feb 5, 2025

kylesayrs Feb 5, 2025

kylesayrs commented Feb 6, 2025



	# The following example shows how the Qwen2.5 Vision-Language model can be W4A16 quantized

[VLM] Qwen 2.5 VL #1113

Are you sure you want to change the base?

[VLM] Qwen 2.5 VL #1113

Conversation

kylesayrs commented Jan 29, 2025

Purpose

Changes

Testing

github-actions bot commented Jan 29, 2025

brian-dellabetta left a comment

Choose a reason for hiding this comment

brian-dellabetta Feb 5, 2025

Choose a reason for hiding this comment

kylesayrs Feb 5, 2025

Choose a reason for hiding this comment

kylesayrs commented Feb 6, 2025