[GPU] Fix access dimension error in dynamic shape #32626

hyunback · 2025-10-30T09:59:01Z

Description of the issue(symptom, root-cause, how it was resolved)

The regression was observed in the recent memory reset bugfix PR. The issue is get_dims() failure that occurs when calling the convolution feature in dynamic shape.

Solution:
The fix integrates the logic from the existing get_conv_channel_count() utility to correctly and safely determine the channel dimension of the convolution's input/output under dynamic condition, preventing the dimension access failure.

Reproduction step and snapshot (if applicable. Do not attach for customer model)

E2E

python tools/llm_bench/benchmark.py -m ov-share-05.sclab.intel.com/cv_bench_cache/latest_models_llm/qwen2-vl-7b-instruct/pytorch/ov/OV_FP16-4BIT_DEFAULT -d GPU.1 -mc 1 -ic 256 -n 3 -pf frameworks.ai.openvino.llm.prompts/32_1024/qwen2-vl-7b-instruct.jsonl

Benchmark_app

benchmark_app -d GPU.1 --hint none -nireq 1 -niter 1 -m ov-share-05.sclab.intel.com/cv_bench_cache/latest_models_llm/qwen2-vl-7b-instruct/pytorch/ov/OV_FP16-4BIT_DEFAULT/openvino_vision_embeddings_model.xml -data_shape hidden_states[1,1176]

Checklist

Is it a proper fix? Yes
Did you include test case for this fix, if necessary?
Did you review existing test that can be extended to cover this scenario? Which test did you review? mem_reset_test.cpp

Tickets:

CVS-175613

The regression was observed in the recent memory reset bugfix PR. The issue is get_dims() failure that occurs when calling the convolution feature in dynamic shape. Solution: The fix integrates the logic from the existing get_conv_channel_count() utility to correctly and safely determine the channel dimension of the convolution's input/output under dynamic condition, preventing the dimension access failure. Signed-off-by: hyunback <[email protected]>

Lyamin-Roman · 2025-10-30T14:30:15Z

Looks like this is a fix for the same problem #32601

isanghao · 2025-10-31T06:38:02Z

src/plugins/intel_gpu/src/graph/primitive_inst.cpp

+            // If the channel count is dynamic, we cannot verify feature alignment,
+            // so we conservatively skip the reset and return false for this condition.
+            if (in_channel_count == -1)
+                return false;


I think the conservative behavior here is to reset memory for potential accuracy issue. Do we have a case where this makes difference in performance? If not, let's return true.

Agree, return true is correct. Applied

isanghao · 2025-10-31T06:39:02Z

src/plugins/intel_gpu/src/graph/primitive_inst.cpp

            // if layout is single blocked and feature size is not aligned with the blocking size, need to reset output so that we can guarantee zero-filling
            // NOTE: We may improve this logic to avoid reset if we are sure that it is not "corrupted" by other layers.
-            if (output_layout.feature() % feature_block_size != 0) {
+            if (in_channel_count % feature_block_size != 0) {


could you add test case in mem_reset_test.cpp?

Signed-off-by: hyunback <[email protected]>

isanghao

LGTM

### Description of the issue(symptom, root-cause, how it was resolved) The regression was observed in the recent memory reset bugfix PR. The issue is get_dims() failure that occurs when calling the convolution feature in dynamic shape. Solution: The fix integrates the logic from the existing get_conv_channel_count() utility to correctly and safely determine the channel dimension of the convolution's input/output under dynamic condition, preventing the dimension access failure. #### Reproduction step and snapshot (if applicable. Do not attach for customer model) ##### E2E python tools/llm_bench/benchmark.py -m ov-share-05.sclab.intel.com/cv_bench_cache/latest_models_llm/qwen2-vl-7b-instruct/pytorch/ov/OV_FP16-4BIT_DEFAULT -d GPU.1 -mc 1 -ic 256 -n 3 -pf frameworks.ai.openvino.llm.prompts/32_1024/qwen2-vl-7b-instruct.jsonl ###### Benchmark_app benchmark_app -d GPU.1 --hint none -nireq 1 -niter 1 -m ov-share-05.sclab.intel.com/cv_bench_cache/latest_models_llm/qwen2-vl-7b-instruct/pytorch/ov/OV_FP16-4BIT_DEFAULT/openvino_vision_embeddings_model.xml -data_shape hidden_states[1,1176] #### Checklist - [x] Is it a proper fix? Yes - [x] Did you include test case for this fix, if necessary? - [x] Did you review existing test that can be extended to cover this scenario? Which test did you review? mem_reset_test.cpp ### Tickets: - *CVS-175613* --------- Signed-off-by: hyunback <[email protected]>

hyunback requested review from a team as code owners October 30, 2025 09:59

hyunback added category: GPU OpenVINO GPU plugin WIP work in progress under_perf_check labels Oct 30, 2025

hyunback changed the title ~~[GPU] Fix Qwen2VL7BInstructInt4 failure.~~ [GPU] Fix access dimension error in dynamic shape Oct 30, 2025

hyunback removed the WIP work in progress label Oct 31, 2025

isanghao reviewed Oct 31, 2025

View reviewed changes

Apply unit-test

a9a03ed

Signed-off-by: hyunback <[email protected]>

hyunback removed the under_perf_check label Nov 1, 2025

isanghao approved these changes Nov 3, 2025

View reviewed changes

geunhwan added this to the 2025.4 milestone Nov 3, 2025

geunhwan added the Code Freeze label Nov 3, 2025

isanghao added this pull request to the merge queue Nov 3, 2025

Merged via the queue into openvinotoolkit:master with commit b4bac1c Nov 3, 2025
187 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[GPU] Fix access dimension error in dynamic shape #32626

[GPU] Fix access dimension error in dynamic shape #32626

hyunback commented Oct 30, 2025 •

edited

Loading

Uh oh!

Lyamin-Roman commented Oct 30, 2025

Uh oh!

isanghao Oct 31, 2025 •

edited

Loading

Uh oh!

hyunback Nov 1, 2025

Uh oh!

isanghao Oct 31, 2025

Uh oh!

hyunback Nov 1, 2025

Uh oh!

isanghao left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[GPU] Fix access dimension error in dynamic shape #32626

[GPU] Fix access dimension error in dynamic shape #32626

Conversation

hyunback commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of the issue(symptom, root-cause, how it was resolved)

Reproduction step and snapshot (if applicable. Do not attach for customer model)

E2E

Benchmark_app

Checklist

Tickets:

Uh oh!

Lyamin-Roman commented Oct 30, 2025

Uh oh!

isanghao Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hyunback Nov 1, 2025

Choose a reason for hiding this comment

Uh oh!

isanghao Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

hyunback Nov 1, 2025

Choose a reason for hiding this comment

Uh oh!

isanghao left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

hyunback commented Oct 30, 2025 •

edited

Loading

isanghao Oct 31, 2025 •

edited

Loading