Add self-detecting on-the-fly bfloat16->float16 conversion pass #741

mklimenk · 2025-07-14T12:01:15Z

A follow-up to #740 with changed logic. Instead of relying on an external configuration key, perform bfloat16->float16 conversion in case there is at least one tensor in bfloat16 in the model.

https://jira.devtools.intel.com/browse/CVS-170592

sfatimar

I am more aligned with this change.

onnxruntime/core/providers/openvino/backend_manager.cc

sfatimar · 2025-07-30T13:04:47Z

onnxruntime/core/providers/openvino/backend_manager.cc

@@ -453,6 +465,16 @@ BackendManager::GetModelProtoFromFusedNode(const onnxruntime::Node& fused_node,
    DumpOpenVINOEPModel(onnx_model_path_name, model_proto.get(), fused_node);
    ORT_ENFORCE(status.IsOK(), status.ErrorMessage());
    return model_proto;
+  } else if (HasBf16(subgraph)) {


Is a check needed for enable_qdq_optimizer . Should you check for GPU here ? Please let me know if you support ep context graphs

Not necessarily, this is a universal pass, which works for all the IPs.
UPD to an edited comment: this is an else condition for all the qdq_scales-related graph modifications. Overall, qdq_scales and bfloat16 are mutually exclusive, so the current logic is the following: if qdq_scaling pass is requested, we go there with two different paths for NPU and GPU. Else, if the model has bfloat16 initializers, we convert them to fp16 in this pass. Otherwise, we just transfer the model directly to openvino.
Regarding ep context graphs: no, they're not supported, since they're basically an encapsulated OVIR and we can only redirect it to OV, nothing more. So if there is a request from a customer to work with bfloat16 ep context models, we'd need to solve it on the OV side.

sfatimar

Changes look good. Please look at review comments and see if you have subscribed to Coding Style.

sfatimar · 2025-07-30T13:06:49Z

Please update branch.

vthaniel · 2025-07-31T16:18:11Z

@mklimenk
Can you please rebase this branch

mklimenk added 3 commits July 11, 2025 15:57

Add on-the-fly bfloat16->float16 conversion pass

d062cf1

Fix undetected bfloat16 initializers

4a2c571

Remove the option and make the logic implicit

601dd30

mklimenk changed the title ~~Add self-detecting on-the-fly bfloat16->float16 conversion pass~~ [Draft] Add self-detecting on-the-fly bfloat16->float16 conversion pass Jul 14, 2025

sfatimar reviewed Jul 15, 2025

View reviewed changes

Add tests

c594c4d

mklimenk force-pushed the private/mklimenk/bfloat16_fix_implicit branch from a02a919 to c594c4d Compare July 29, 2025 12:27

mklimenk marked this pull request as ready for review July 29, 2025 15:33

mklimenk changed the title ~~[Draft] Add self-detecting on-the-fly bfloat16->float16 conversion pass~~ Add self-detecting on-the-fly bfloat16->float16 conversion pass Jul 29, 2025

mklimenk mentioned this pull request Jul 29, 2025

Add on-the-fly bfloat16->float16 conversion pass #740

Closed

2 tasks

sfatimar reviewed Jul 30, 2025

View reviewed changes

onnxruntime/core/providers/openvino/backend_manager.cc Outdated Show resolved Hide resolved

sfatimar reviewed Jul 30, 2025

View reviewed changes

sfatimar requested changes Jul 30, 2025

View reviewed changes

mklimenk added 2 commits July 30, 2025 15:08

Merge branch 'ovep-develop' into private/mklimenk/bfloat16_fix_implicit

111208d

Rename detection function

58e12da

sfatimar approved these changes Jul 30, 2025

View reviewed changes

sfatimar requested a review from vthaniel July 30, 2025 14:14

Fix CI for strict aliasing rules

d846db0

Merge branch 'ovep-develop' into private/mklimenk/bfloat16_fix_implicit

e07701c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add self-detecting on-the-fly bfloat16->float16 conversion pass #741

Add self-detecting on-the-fly bfloat16->float16 conversion pass #741

Uh oh!

mklimenk commented Jul 14, 2025 •

edited

Loading

Uh oh!

sfatimar left a comment

Uh oh!

Uh oh!

sfatimar Jul 30, 2025 •

edited

Loading

Uh oh!

mklimenk Jul 30, 2025 •

edited

Loading

Uh oh!

sfatimar left a comment

Uh oh!

sfatimar commented Jul 30, 2025

Uh oh!

vthaniel commented Jul 31, 2025

Uh oh!

Uh oh!

Add self-detecting on-the-fly bfloat16->float16 conversion pass #741

Are you sure you want to change the base?

Add self-detecting on-the-fly bfloat16->float16 conversion pass #741

Uh oh!

Conversation

mklimenk commented Jul 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sfatimar left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sfatimar Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mklimenk Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sfatimar left a comment

Choose a reason for hiding this comment

Uh oh!

sfatimar commented Jul 30, 2025

Uh oh!

vthaniel commented Jul 31, 2025

Uh oh!

Uh oh!

mklimenk commented Jul 14, 2025 •

edited

Loading

sfatimar Jul 30, 2025 •

edited

Loading

mklimenk Jul 30, 2025 •

edited

Loading