Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

optimum export openvino ,use gptq&scale_estimation error #1124

Open
qihui-liu opened this issue Jan 21, 2025 · 1 comment
Open

optimum export openvino ,use gptq&scale_estimation error #1124

qihui-liu opened this issue Jan 21, 2025 · 1 comment

Comments

@qihui-liu
Copy link

Exception has occurred: ValueError
could not broadcast input array from shape (84934656,) into shape (9216,)
File "C:\Users\admin\Desktop\convert_model.py", line 15, in
model = OVModelForCausalLM.from_pretrained(model_id, export=True,quantization_config=quantization_config)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
ValueError: could not broadcast input array from shape (84934656,) into shape (9216,)

source code:

from optimum.intel import OVModelForCausalLM,OVWeightQuantizationConfig
from transformers import AutoTokenizer

model_id = "./Phi-3.5-mini-instruct"
quantization_config = OVWeightQuantizationConfig(
bits=4,
sym=True,
quant_method="awq",
scale_estimation=True,
group_size=-1,
gptq=True,
dataset="wikitext2"
)

model = OVModelForCausalLM.from_pretrained(model_id, export=True,quantization_config=quantization_config)
tokenizer = AutoTokenizer.from_pretrained(model_id)
save_directory = "./Phi3.5-ov-awq-gptq"
model.save_pretrained(save_directory)
tokenizer.save_pretrained(save_directory)

What shall I do?

@qihui-liu
Copy link
Author

optimum-cli export openvino --model .\Phi-3.5-mini-instruct\ --task text-generation-with-past --weight-format int4 --group-size -1 --sym --awq --dataset wikitext2 --gptq --scale-estimation phi-3.5-mini-int4-awq-gptq-scale

It has the same effect.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant