[GPU] Allow host buffer access for Xe2+ iGPUs #32912

mklimenk · 2025-11-18T15:27:22Z

Description of the issue

Integrated GPUs starting from Xe2 can benefit from reusing the host-sided buffer for the weights. This allows to avoid the allocation of the device-sided buffer in the same physical memory with significant memory footprint reduction and no runtime penalty. Previously it was enabled only for LNL (#31600), but for AI weights that don't benefit from compression there's no need to limit this functionality only to that platform.

Reproduction step and snapshot

$ benchmark_app -d GPU -hint latency -nireq 1 -t 30 -b 1 -m -ip f32 -op f32
Check for the "Compile model ram used" metric. For an fp16 stable diffusion model with size of 600MB, there is a ~600MB on multiple platforms, more details in the ticket.

Checklist

Is it a proper fix? (not a workaround)
Did you include test case for this fix, if necessary?
Did you review existing test that can be extended to cover this scenario? Which test did you review?

Tickets:

CVS-176845

p-durandin · 2025-11-18T15:55:27Z

build_jenkins

Lyamin-Roman

Have there been any performance tests? It's necessary to verify that this really doesn't cause any performance drops

mklimenk · 2025-11-19T08:12:46Z

@Lyamin-Roman, yes, I've checked performance on a set of models, as well as some synthetic tests, such as a model consisting of a single GEMM with different dimensions. Every tests demonstrated the same performance (±1%)

isanghao · 2025-11-19T12:50:25Z

Did you check with driver team for this change? Actually this is different from what we heard from driver team previously..

Memory footprint reduction is something unexpected and I guess there is some issue for memory footprint, which is hided by this change.

MichalMrozek · 2025-11-19T14:06:13Z

src/plugins/intel_gpu/src/graph/program.cpp

            if (alloc_type == allocation_type::usm_host || alloc_type == allocation_type::usm_shared) {
-                // usm_device memory does not provide performance benefits on the LNL platform
-                if (get_engine().get_device_info().arch == gpu_arch::xe2 &&
+                // usm_device memory does not provide performance benefits on the integrated Xe2+ platforms


On PTL integrated parts we have e2e compression available for device USM allocations.
It means that if data is nicely compressible, you may see compression benefits from using device USM.

Yes, but at the same time the trained weights aren't typically compressible.

yes then this would help to reduce the memory culprit and reduce amount of copies

Allow host buffer access for Xe2+ iGPUs

05f8eef

mklimenk requested review from a team as code owners November 18, 2025 15:27

Merge branch 'master' into private/mklimenk/igpu_host_buffer

b91c540

github-actions bot added the category: GPU OpenVINO GPU plugin label Nov 18, 2025

sys-openvino-ci added the ExternalIntelPR External contributor from Intel label Nov 18, 2025

mklimenk changed the title ~~Allow host buffer access for Xe2+ iGPUs~~ [GPU] Allow host buffer access for Xe2+ iGPUs Nov 18, 2025

Lyamin-Roman reviewed Nov 18, 2025

View reviewed changes

Lyamin-Roman approved these changes Nov 19, 2025

View reviewed changes

p-durandin approved these changes Nov 19, 2025

View reviewed changes

p-durandin added this pull request to the merge queue Nov 19, 2025

MichalMrozek reviewed Nov 19, 2025

View reviewed changes

Merged via the queue into openvinotoolkit:master with commit a930596 Nov 19, 2025
212 of 216 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[GPU] Allow host buffer access for Xe2+ iGPUs #32912

[GPU] Allow host buffer access for Xe2+ iGPUs #32912

mklimenk commented Nov 18, 2025

Uh oh!

p-durandin commented Nov 18, 2025

Uh oh!

Lyamin-Roman left a comment

Uh oh!

mklimenk commented Nov 19, 2025

Uh oh!

isanghao commented Nov 19, 2025

Uh oh!

MichalMrozek Nov 19, 2025

Uh oh!

mklimenk Nov 19, 2025

Uh oh!

MichalMrozek Nov 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[GPU] Allow host buffer access for Xe2+ iGPUs #32912

[GPU] Allow host buffer access for Xe2+ iGPUs #32912

Conversation

mklimenk commented Nov 18, 2025

Description of the issue

Reproduction step and snapshot

Checklist

Tickets:

Uh oh!

p-durandin commented Nov 18, 2025

Uh oh!

Lyamin-Roman left a comment

Choose a reason for hiding this comment

Uh oh!

mklimenk commented Nov 19, 2025

Uh oh!

isanghao commented Nov 19, 2025

Uh oh!

MichalMrozek Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

mklimenk Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

MichalMrozek Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants