Fix OpenVINO GPU Reshape Error & Disable model compilation for CPU EP #491

weiyuanyue · 2025-11-05T06:44:43Z

Fix OpenVINO GPU Reshape Error

A prior PR introduced explicit free dimension overrides (batch, channels, height, width) to help NVTensorRTRTX (TensorRT EP) initialize models with dynamic dimensions. In our current pipeline the VAE latent input size is effectively static (e.g. 64×64 for 512×512 output), so height/width do not need overriding. Leaving them in place is harmless for most EPs, but OpenVINO GPU applies additional graph optimizations that, with these overrides present, transform a transpose + reshape sequence incorrectly and result in an impossible reshape (element count mismatch).
- Root Cause Details
  - Overrides applied: AddFreeDimensionOverrideByName("height", H/8) and ("width", W/8).
    Actual model latent spatial dims are already constant; the overrides re‑assert values the optimizer no longer treats as purely static.
  - OpenVINO GPU optimization path (layout normalization + constant folding) duplicates or misinterprets flattened spatial tokens, producing an inferred input shape [1,4096,4096,512] for a node that then attempts to reshape to [4096,512]. Element counts differ (8,589,934,592 vs 2,097,152) ⇒ failure.
  - Removing only the height/width overrides makes the issue disappear across repeated runs; batch/channels overrides are safe.
  - This does not reproduce with CPU EP or other providers because their optimization passes are less aggressive with respect to these particular shape transformations.
- Current Mitigation
  We have commented out the height/width overrides; initialization succeeds on OpenVINO GPU. No other EP shows negative impact from their removal.
Disable model compilation for CPU EP

The CPU Execution Provider in ONNX Runtime does not implement EPContext model compilation, so invoking compilation on the CPU EP will fail. We removed the [Compile model] checkbox when the user selects the CPU EP.

Milly Wei (from Dev Box) and others added 13 commits October 29, 2025 18:58

clean code

c557581

update

385b5b4

Merge remote-tracking branch 'origin/main' into milly/fixep

50d8242

update

4ae718a

update

779c426

update

c9bce06

update

779c006

update

4d55afc

update

db92b81

update

b42b245

update

76f1cd2

update

1675400

Merge branch 'main' into milly/fixep

13a581e

weiyuanyue marked this pull request as ready for review November 5, 2025 06:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix OpenVINO GPU Reshape Error & Disable model compilation for CPU EP #491

Fix OpenVINO GPU Reshape Error & Disable model compilation for CPU EP #491

weiyuanyue commented Nov 5, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix OpenVINO GPU Reshape Error & Disable model compilation for CPU EP #491

Are you sure you want to change the base?

Fix OpenVINO GPU Reshape Error & Disable model compilation for CPU EP #491

Conversation

weiyuanyue commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

weiyuanyue commented Nov 5, 2025 •

edited

Loading