New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[#168 fix] add context manager to fake `ScalingTensor`/`ScalingParameter`'s `class` as `torch.Tensor` #169

Open

152334H wants to merge 2 commits into Azure:main from 152334H:main

152334H commented Mar 6, 2024

Description
See #168. This is the most non-invasive fix I could come up with. Thanks to @aliencaocao for idea.

Minor Revision

adds msamp.common.tensor.tensor.pretend_scaling_is_torch, which can be used to fix GradScaler().step().

This is a non-breaking change as it does not deviate from prior behaviour without explicitly calling with pretend_scaling_is_torch().

152334H added 2 commits

March 6, 2024 04:49


          add context manager to fake ScalingTensor -> torch.Tensor

fcae35d


          mnist ddp/single gpu examples fixed

e1a1a06

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet