Normalize CV-CUDA Backend #9279

justincdavis · 2025-11-19T21:47:40Z

Summary

This PR adds the CV-CUDA backend kernel for the Normalize transform.

How to use

import cvcuda
import torchvision.transforms.v2.functional as F

cvc_tensor = cvcuda.Tensor((1, 224, 224, 3), cvcuda.Type.F32, cvcuda.TensorLayout.NHWC)
# Dispatches to F.normalize_cvcuda
normalized_tensor = F.normalize(cvc_tensor, [0.485, 0.456, 0.406], [0.229, 0.224, 0.225])

Run unit tests

pytest test/test_transforms_v2.py::TestNormalizeCVCUDA
...
60 passed in 0.59s

pytorch-bot · 2025-11-19T21:47:43Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/9279

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 2 Active SEVs

There are 2 currently active SEVs. If your PR is affected, please view them below:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-cla · 2025-11-19T21:47:46Z

Hi @justincdavis!

Thank you for your pull request and welcome to our community.

Action Required

In order to merge any pull request (code, docs, etc.), we require contributors to sign our Contributor License Agreement, and we don't seem to have one on file for you.

Process

In order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with CLA signed. The tagging process may take up to 1 hour after signing. Please give it that time before contacting us about it.

If you have received this in error or have any questions, please contact us at [email protected]. Thanks!

AntoineSimoulin

Hey @justincdavis, thanks for submitting the PR, this is looking good:) I left some minor changes. I think we mainly need to make sure the tests are passing when cvcuda is not installed!

AntoineSimoulin · 2025-11-24T14:41:33Z

test/test_transforms_v2.py

            (F.normalize_video, tv_tensors.Video),
+            pytest.param(
+                F._misc._normalize_cvcuda,
+                _import_cvcuda().Tensor,


@justincdavis it seems that _import_cvcuda().Tensor is still raising an error if cvcuda is not installed. Maybe we can just use cvcuda.Tensor here and see if this works better?

Thank you for pointing this out! I replaced the actual cvcuda.Tensor type with the string "cvcuda.Tensor", then inside the function we resolve the cvcuda.Tensor type if we have the corresponding string. LMK if this looks like a reasonable solution!

torchvision/transforms/v2/functional/_misc.py

test/test_transforms_v2.py

justincdavis · 2025-11-24T17:21:55Z

Following up from my comment in the _normalize_cvcuda function itself. CV-CUDA requires that the mean and scale tensors be on-device when we call cvcuda.normalize. This means that a host->device memcpy must occur twice for each normalize call when using CV-CUDA backend. We could attempt to reduce the impact of this by having a helper function which creates the tuple[cvcuda.Tensor, cvcuda.Tensor] from the mean/std parameters. Based on what I see in the codebase, this seems like it would be a new feature present in torchvision for a functional transform.

# CV-CUDA requires float32 tensors for the mean/std parameters
# at small batchs, this is costly relative to normalize operation
# if CV-CUDA is known to be a backend, could optimize this
# For Normalize class:
# by creating tensors at class initialization time
# For functional API:
# by storing cached tensors in helper function with functools.lru_cache (would it even be worth it?)
# Since CV-CUDA is 1) not default backend, 2) only strictly faster at large batch size, ignore

AntoineSimoulin · 2025-11-26T15:50:00Z

Hey @justincdavis, looking good to me. I don't think the failing test is related to this PR. Seems like a false positive alert to me! Can you sign our Contributor License Agreement (c.f. meta-cla bot comment in the discussion)?

…ation

justincdavis and others added 3 commits November 17, 2025 12:57

initial cvcuda normalize kernel implementation

d1d744f

add comment explaining mean/std behavior, one-line intermediate creation

85276ce

Merge branch 'pytorch:main' into feat/normalize_cvcuda

324cefc

fix: normalize_cvcuda move to correct patterns for tests/exporting

dbf4a5c

AntoineSimoulin reviewed Nov 24, 2025

View reviewed changes

fix tests crashing before run without cvcuda

6821135

justincdavis added 2 commits November 24, 2025 10:35

resolve more review comments

f1bb502

remove extra parameterize for dtype

c16a033

simplify normalize testing into single test parameterize on input cre…

d93938e

…ation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Normalize CV-CUDA Backend #9279

Normalize CV-CUDA Backend #9279

Uh oh!

justincdavis commented Nov 19, 2025

Uh oh!

pytorch-bot bot commented Nov 19, 2025 •

edited

Loading

Uh oh!

meta-cla bot commented Nov 19, 2025

Uh oh!

AntoineSimoulin left a comment

Uh oh!

AntoineSimoulin Nov 24, 2025

Uh oh!

justincdavis Nov 24, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

justincdavis commented Nov 24, 2025 •

edited

Loading

Uh oh!

AntoineSimoulin commented Nov 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Normalize CV-CUDA Backend #9279

Are you sure you want to change the base?

Normalize CV-CUDA Backend #9279

Uh oh!

Conversation

justincdavis commented Nov 19, 2025

Summary

How to use

Run unit tests

Uh oh!

pytorch-bot bot commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/9279

❗ 2 Active SEVs

Uh oh!

meta-cla bot commented Nov 19, 2025

Action Required

Process

Uh oh!

AntoineSimoulin left a comment

Choose a reason for hiding this comment

Uh oh!

AntoineSimoulin Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

justincdavis Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

justincdavis commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AntoineSimoulin commented Nov 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pytorch-bot bot commented Nov 19, 2025 •

edited

Loading

justincdavis commented Nov 24, 2025 •

edited

Loading