Bug fix: hpu mrope #167

attafosu · 2025-09-12T16:48:57Z

HPU Mrope implementation had a bug which was exposed by [Bugfix] Fix platform-specific routing in CustomOp implementations vllm#24444
Initial workaround was to use the default implementation: [BUGFIX] qwen2.5-vl failed after PR24444, provide a temp solution #162
This PR fixes the bug in the HPU mrope

Signed-off-by: attafosu <[email protected]>

xuechendi · 2025-09-12T19:12:37Z

Do we see performance improvement?

xuechendi · 2025-09-12T19:18:24Z

vllm_gaudi/ops/hpu_rotary_embedding.py

+
+        key_rot = key[..., :self.rotary_dim]
+        key_pass = key[..., self.rotary_dim:]
+        key_rot = apply_rotary_pos_emb(key_rot, cos, sin, None, 0, rope_mode)


I did a comparison with existing forward_native, seems major difference is apply_rotary_pos_emb vs apply_rotary_emb_torch, may you check if we do gets perf gain with the oot impl, or we can use native ?

Yeah, I did some quick tests and there's some perf gain over the default:
forward_native: 11.53 tok/sec
forward_oot: 12.32 tok/sec
This is on a smaller sized image and I expect it to be more pronounced on an even bigger input (text or image)

xuechendi · 2025-09-12T21:18:46Z

please fix pre-commit

xuechendi · 2025-09-12T21:36:43Z

/run-gaudi-tests

- HPU Mrope implementation had a bug which was exposed by vllm-project/vllm#24444 - Initial workaround was to use the default implementation: vllm-project#162 - This PR fixes the bug in the HPU mrope --------- Signed-off-by: attafosu <[email protected]> Co-authored-by: Chendi.Xue <[email protected]>

attafosu added 2 commits September 12, 2025 19:41

Bug fix: hpu mrope

3b009d3

Signed-off-by: attafosu <[email protected]>

Expand dims for cos/sin in gpt-j style ropes

4b2e194

Signed-off-by: attafosu <[email protected]>

attafosu marked this pull request as ready for review September 12, 2025 18:45

attafosu requested review from kzawora-intel, xuechendi, mswiniarsk and adobrzyn as code owners September 12, 2025 18:45

xuechendi reviewed Sep 12, 2025

View reviewed changes

Merge branch 'main' into dev/attafosu/fix-hpu-mrope

5b59d99

xuechendi approved these changes Sep 12, 2025

View reviewed changes

Merge branch 'main' into dev/attafosu/fix-hpu-mrope

5147deb

xuechendi enabled auto-merge (squash) September 12, 2025 21:36

attafosu and others added 5 commits September 15, 2025 08:19

Merge branch 'main' into dev/attafosu/fix-hpu-mrope

b614469

Merge branch 'main' into dev/attafosu/fix-hpu-mrope

84f5a58

Merge branch 'main' into dev/attafosu/fix-hpu-mrope

cae2a8d

Merge branch 'main' into dev/attafosu/fix-hpu-mrope

890f298

Merge branch 'main' into dev/attafosu/fix-hpu-mrope

9939dd1

xuechendi disabled auto-merge September 16, 2025 23:24

xuechendi merged commit b94548a into vllm-project:main Sep 16, 2025
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bug fix: hpu mrope #167

Bug fix: hpu mrope #167

Uh oh!

attafosu commented Sep 12, 2025

Uh oh!

xuechendi commented Sep 12, 2025

Uh oh!

xuechendi Sep 12, 2025

Uh oh!

attafosu Sep 12, 2025 •

edited

Loading

Uh oh!

xuechendi commented Sep 12, 2025

Uh oh!

xuechendi commented Sep 12, 2025

Uh oh!

Uh oh!

Uh oh!

Bug fix: hpu mrope #167

Bug fix: hpu mrope #167

Uh oh!

Conversation

attafosu commented Sep 12, 2025

Uh oh!

xuechendi commented Sep 12, 2025

Uh oh!

xuechendi Sep 12, 2025

Choose a reason for hiding this comment

Uh oh!

attafosu Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xuechendi commented Sep 12, 2025

Uh oh!

xuechendi commented Sep 12, 2025

Uh oh!

Uh oh!

Uh oh!

attafosu Sep 12, 2025 •

edited

Loading