Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[GPU] Update PagedAttention creation logic (#28802)
### Details: - Currently GenAI library sets fully dynamic shapes for Key/Value cache buffers, which causes GPU Plugin to fail during compile_model() call. Therefore, update PagedAttention creation logic to use head_size and heads_num parameters from the rt_info if available - GenAI related PR: openvinotoolkit/openvino.genai#1666
- Loading branch information