Added support for TElinear ops #632

kinjalpatel27 · 2025-12-02T20:35:12Z

iCo-authored-by: Asma Kuriparambil Thekkumpate [email protected]

What does this PR do?

Type of change: New Feature

Overview:
This MR adds support for quantizing TE Ops in megatron, specifically TERowParallelLinear, TEColumnParallelLinear and TELayerNormColumnParallelLinear.

Usage

It can be used by enabling TE spec in megatron

Testing

Added unit tests for testing functionality
test_homogeneous_sharded_state_dict_te_spec
test_convert_mcore_te_gpt_model
test_quantize_forward_backward

Before your PR is "Ready for review"

Make sure you read and follow Contributor guidelines and your commits are signed.
Is this change backward compatible?: Yes
Did you write any new necessary tests?: Yes
Did you add or update any necessary documentation?: Yes
Did you update Changelog?: Yes

Additional Information

codecov · 2025-12-02T20:45:50Z

Codecov Report

❌ Patch coverage is 83.78378% with 6 lines in your changes missing coverage. Please review.
✅ Project coverage is 74.50%. Comparing base (53a2dde) to head (ddecbc3).
⚠️ Report is 7 commits behind head on main.

Files with missing lines	Patch %	Lines
modelopt/torch/utils/logging.py	80.00%	6 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #632   +/-   ##
=======================================
  Coverage   74.50%   74.50%           
=======================================
  Files         183      183           
  Lines       18400    18444   +44     
=======================================
+ Hits        13709    13742   +33     
- Misses       4691     4702   +11

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

modelopt/torch/quantization/plugins/transformer_engine.py

mxinO

Looks great to me! Do we still need a special modelopt layer spec after this MR?

tests/gpu/torch/quantization/plugins/test_megatron.py

modelopt/torch/quantization/plugins/transformer_engine.py

iCo-authored-by: Asma Kuriparambil Thekkumpate <[email protected]> Co-authored-by: Kinjal Patel <[email protected]> Signed-off-by: Kinjal Patel <[email protected]>

Signed-off-by: Kinjal Patel <[email protected]>

meenchen

Do you know how much im provement we can get by enabling TE?

modelopt/torch/quantization/plugins/transformer_engine.py

Signed-off-by: Kinjal Patel <[email protected]>

meenchen

LGTM. I am wondering if you have some data points of enabling TE in terms of perf and accuracy. Also, do we need to update the modelopt example in megatron to enable this by default?

kinjalpatel27 · 2025-12-08T21:52:59Z

Thanks @meenchen

I am wondering if you have some data points of enabling TE in terms of perf and accuracy.

@realAsma, do you have data points in terms of perf and accuracy for TE?

Also, do we need to update the modelopt example in megatron to enable this by default?

We will enable TE by default in megatron and remove local spec to keep things consistent between megatron-core and megatron + modelopt example

Signed-off-by: Kinjal Patel <[email protected]>

realAsma · 2025-12-10T17:27:39Z

I am wondering if you have some data points of enabling TE in terms of perf and accuracy.
@meenchen

TELinear support will help us get rid of the ModelOpt model provider and simplify and clean up MCore flows - So we would be able to quantize any MCore model without having to explicitly convert to local spec.
TELayerNorm Linear reduces memory consumption by orders of magnitude because of sequence parallelism. With TELayerNorm we dont checkpoint the intermediate activations between layer norm and linear.

Signed-off-by: Kinjal Patel <[email protected]>

iCo-authored-by: Asma Kuriparambil Thekkumpate <[email protected]> ## What does this PR do? **Type of change:** New Feature **Overview:** This MR adds support for quantizing TE Ops in megatron, specifically TERowParallelLinear, TEColumnParallelLinear and TELayerNormColumnParallelLinear. ## Usage It can be used by enabling TE spec in megatron ## Testing Added unit tests for testing functionality `test_homogeneous_sharded_state_dict_te_spec` `test_convert_mcore_te_gpt_model` `test_quantize_forward_backward` ## Before your PR is "*Ready for review*"  - **Make sure you read and follow [Contributor guidelines](https://github.com/NVIDIA/TensorRT-Model-Optimizer/blob/main/CONTRIBUTING.md)** and your commits are signed. - **Is this change backward compatible?**: Yes - **Did you write any new necessary tests?**: Yes - **Did you add or update any necessary documentation?**: Yes - **Did you update [Changelog](https://github.com/NVIDIA/TensorRT-Model-Optimizer/blob/main/CHANGELOG.rst)?**: Yes ## Additional Information  --------- Signed-off-by: Kinjal Patel <[email protected]>

kinjalpatel27 requested review from a team as code owners December 2, 2025 20:35

kinjalpatel27 requested review from meenchen and mxinO December 2, 2025 20:35

realAsma reviewed Dec 3, 2025

View reviewed changes

modelopt/torch/quantization/plugins/transformer_engine.py Show resolved Hide resolved

realAsma approved these changes Dec 3, 2025

View reviewed changes

mxinO reviewed Dec 4, 2025

View reviewed changes

tests/gpu/torch/quantization/plugins/test_megatron.py Show resolved Hide resolved

modelopt/torch/quantization/plugins/transformer_engine.py Show resolved Hide resolved

kinjalpatel27 added 6 commits December 8, 2025 17:04

Added support for TElinear ops

5b1352e

iCo-authored-by: Asma Kuriparambil Thekkumpate <[email protected]> Co-authored-by: Kinjal Patel <[email protected]> Signed-off-by: Kinjal Patel <[email protected]>

minor

4c2e6df

Signed-off-by: Kinjal Patel <[email protected]>

minor

b83ba6e

Signed-off-by: Kinjal Patel <[email protected]>

minor

1b0f503

Signed-off-by: Kinjal Patel <[email protected]>

minor

d4b2dc0

Signed-off-by: Kinjal Patel <[email protected]>

minor

021718d

Signed-off-by: Kinjal Patel <[email protected]>

kinjalpatel27 requested a review from mxinO December 8, 2025 17:09

kinjalpatel27 force-pushed the kinjal/te_megatron_support branch from d7c4802 to 021718d Compare December 8, 2025 17:28

meenchen reviewed Dec 8, 2025

View reviewed changes

modelopt/torch/quantization/plugins/transformer_engine.py Outdated Show resolved Hide resolved

modelopt/torch/quantization/plugins/transformer_engine.py Show resolved Hide resolved

minor

a694984

Signed-off-by: Kinjal Patel <[email protected]>

kinjalpatel27 requested a review from meenchen December 8, 2025 19:31

kinjalpatel27 added 2 commits December 8, 2025 20:58

minor

1dd7dcd

Signed-off-by: Kinjal Patel <[email protected]>

minor

5fd2281

Signed-off-by: Kinjal Patel <[email protected]>

meenchen approved these changes Dec 8, 2025

View reviewed changes

minor

e82430c

Signed-off-by: Kinjal Patel <[email protected]>

kinjalpatel27 added 2 commits December 10, 2025 21:13

re-enable test

ce86d3e

Signed-off-by: Kinjal Patel <[email protected]>

disable test

ddecbc3

Signed-off-by: Kinjal Patel <[email protected]>

kinjalpatel27 merged commit f731379 into main Dec 11, 2025
48 of 50 checks passed

kinjalpatel27 deleted the kinjal/te_megatron_support branch December 11, 2025 20:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added support for TElinear ops #632

Added support for TElinear ops #632

Uh oh!

kinjalpatel27 commented Dec 2, 2025

Uh oh!

codecov bot commented Dec 2, 2025 •

edited

Loading

Uh oh!

Uh oh!

mxinO left a comment

Uh oh!

Uh oh!

Uh oh!

meenchen left a comment

Uh oh!

Uh oh!

Uh oh!

meenchen left a comment

Uh oh!

kinjalpatel27 commented Dec 8, 2025 •

edited

Loading

Uh oh!

realAsma commented Dec 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Added support for TElinear ops #632

Added support for TElinear ops #632

Uh oh!

Conversation

kinjalpatel27 commented Dec 2, 2025

What does this PR do?

Usage

Testing

Before your PR is "Ready for review"

Additional Information

Uh oh!

codecov bot commented Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

mxinO left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

meenchen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

meenchen left a comment

Choose a reason for hiding this comment

Uh oh!

kinjalpatel27 commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

realAsma commented Dec 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov bot commented Dec 2, 2025 •

edited

Loading

kinjalpatel27 commented Dec 8, 2025 •

edited

Loading