Add all fbgemm kernel Tensors into Int4WeightOnlyConfig and Float8DynamicActivationInt4WeightConfig #2474

jerryzh168 · 2025-07-02T01:58:39Z

Stacked PRs:

Add all fbgemm kernel Tensors into Int4WeightOnlyConfig and Float8DynamicActivationInt4WeightConfig

Summary:
we will

deprecate FbgemmConfig since it's a single kernel (later).
we'd like to categorize things to derived dtype + packed format, e.g. int4 preshuffled, float8 plain
Added PackingFormat that has preshuffled, plain and _legacy for legacy implementation

Test Plan:
python test/quantization/quantize_/workflows/int4/test_int4_tensor.py
python test/quantization/quantize_/workflows/int4/test_int4_preshuffled_tensor.py
python test/quantization/quantize_/workflows/float8/test_float8_tensor.py

Reviewers:

Subscribers:

Tasks:

Tags:

pytorch-bot · 2025-07-02T01:58:43Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2474

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 1 Cancelled Job, 1 Unrelated Failure

As of commit 21d492e with merge base 11f1a76 ():

NEW FAILURES - The following jobs have failed:

Run Float8 Tests / test (H100, linux.aws.h100.4, --pre torch torchvision torchaudio fbgemm-gpu-genai --index-url htt... / linux-job (gh)
RuntimeError: Command docker exec -t 71c337b0bf05c096b089c865beeb2722776dcdd00f1b277b602c87398c483c61 /exec failed with exit code 2
Run Float8 Tests / test (SM-89, linux.g6.4xlarge.experimental.nvidia.gpu, --pre torch --index-url https://download.p... / linux-job (gh)
test/quantization/quantize_/workflows/float8/test_float8_tensor.py::TestFloat8Tensor::test_slice_granularity1

CANCELLED JOB - The following job was cancelled. Please retry:

Run Library Integration Tests / test (SM-89, linux.g6.4xlarge.experimental.nvidia.gpu, --pre fbgemm-gpu-genai --index-url https:/... / linux-job (gh)
##[error]The operation was canceled.

FLAKY - The following job failed but was likely due to flakiness present on trunk:

Run Library Integration Tests / test (H100, linux.aws.h100, --pre torchvision torchaudio fbgemm-gpu-genai --index-url https://dow... / linux-job (gh) (detected as infra flaky with no log or failing log classifier)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…micActivationInt4WeightConfig Summary: att, we will deprecate FbgemmConfig since it's a single kernel. we'd like to categorize things to derived dtype + packed format Test Plan: python test/quantization/quantize_/test_int4_groupwise_preshuffle.py Reviewers: Subscribers: Tasks: Tags: stack-info: PR: #2474, branch: jerryzh168/stack/10

…amicActivationInt4WeightConfig Summary: att, we will deprecate FbgemmConfig since it's a single kernel. we'd like to categorize things to derived dtype + packed format Test Plan: python test/quantization/quantize_/test_int4_groupwise_preshuffle.py Reviewers: Subscribers: Tasks: Tags: stack-info: PR: #2474, branch: jerryzh168/stack/10

…amicActivationInt4WeightConfig Summary: we will * deprecate FbgemmConfig since it's a single kernel (later). * we'd like to categorize things to derived dtype + packed format, e.g. int4 preshuffled, float8 plain * Added PackingFormat that has preshuffled, plain and _legacy for legacy implementation Test Plan: python test/quantization/quantize_/workflows/int4/test_int4_tensor.py python test/quantization/quantize_/workflows/int4/test_int4_preshuffled_tensor.py python test/quantization/quantize_/workflows/float8/test_float8_tensor.py Reviewers: Subscribers: Tasks: Tags: stack-info: PR: #2474, branch: jerryzh168/stack/10

jerryzh168 force-pushed the jerryzh168/stack/10 branch from a3d0835 to 4b0c7c7 Compare July 2, 2025 01:58

This was referenced Jul 2, 2025

Add support for Int4GroupwisePreshuffleTensor for fbgemm #2421

Merged

Remove transpose_input from fbgemm configs #2422

Merged

Add support for float8 activation for Int4PreshuffledTensor #2437

Merged

Add Float8Tensor #2463

Open

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 2, 2025

jerryzh168 added the topic: new feature Use this tag if this PR adds a new feature label Jul 2, 2025

jerryzh168 changed the base branch from jerryzh168/stack/9 to main July 2, 2025 20:35

jerryzh168 force-pushed the jerryzh168/stack/10 branch from 4b0c7c7 to f5977ce Compare July 2, 2025 20:36

jerryzh168 changed the base branch from main to jerryzh168/stack/9 July 2, 2025 20:36

jerryzh168 changed the base branch from jerryzh168/stack/9 to main July 2, 2025 21:42

jerryzh168 force-pushed the jerryzh168/stack/10 branch 2 times, most recently from 04ce2c5 to afd8703 Compare July 2, 2025 21:42

jerryzh168 mentioned this pull request Jul 2, 2025

Rename torchao.float8.Float8Tensor to torchao.float8.Float8TrainingTensor #2479

Merged

jerryzh168 changed the base branch from main to jerryzh168/stack/9 July 2, 2025 21:42

jerryzh168 changed the base branch from jerryzh168/stack/9 to main July 2, 2025 23:44

jerryzh168 force-pushed the jerryzh168/stack/10 branch from afd8703 to ff4682e Compare July 2, 2025 23:44

jerryzh168 changed the base branch from main to jerryzh168/stack/9 July 2, 2025 23:44

jerryzh168 changed the base branch from jerryzh168/stack/9 to main July 3, 2025 00:09

jerryzh168 force-pushed the jerryzh168/stack/10 branch from ff4682e to 58f8a2a Compare July 3, 2025 00:09

jerryzh168 changed the base branch from main to jerryzh168/stack/9 July 3, 2025 00:09

jerryzh168 changed the base branch from jerryzh168/stack/9 to main July 3, 2025 02:18

jerryzh168 force-pushed the jerryzh168/stack/10 branch from e7b03a9 to b719048 Compare July 15, 2025 21:11

jerryzh168 changed the base branch from main to jerryzh168/stack/9 July 15, 2025 21:11

jerryzh168 changed the base branch from jerryzh168/stack/9 to main July 15, 2025 23:12

jerryzh168 force-pushed the jerryzh168/stack/10 branch from b719048 to 626c82d Compare July 15, 2025 23:12

jerryzh168 changed the base branch from main to jerryzh168/stack/9 July 15, 2025 23:12

jerryzh168 changed the base branch from jerryzh168/stack/9 to main July 16, 2025 00:18

jerryzh168 force-pushed the jerryzh168/stack/10 branch from 626c82d to aba5d26 Compare July 16, 2025 00:18

jerryzh168 changed the base branch from main to jerryzh168/stack/9 July 16, 2025 00:18

jerryzh168 changed the base branch from jerryzh168/stack/9 to main July 17, 2025 17:57

jerryzh168 force-pushed the jerryzh168/stack/10 branch from aba5d26 to 62b35d9 Compare July 17, 2025 17:57

jerryzh168 changed the base branch from main to jerryzh168/stack/9 July 17, 2025 17:58

jerryzh168 changed the base branch from jerryzh168/stack/9 to main July 18, 2025 01:42

jerryzh168 force-pushed the jerryzh168/stack/10 branch from 62b35d9 to 8d6a4b9 Compare July 18, 2025 01:42

jerryzh168 changed the base branch from main to jerryzh168/stack/9 July 18, 2025 01:42

jerryzh168 changed the base branch from jerryzh168/stack/9 to main July 18, 2025 01:48

jerryzh168 force-pushed the jerryzh168/stack/10 branch from 8d6a4b9 to 9fbc186 Compare July 18, 2025 01:48

jerryzh168 changed the base branch from main to jerryzh168/stack/9 July 18, 2025 01:48

jerryzh168 changed the base branch from jerryzh168/stack/9 to main July 18, 2025 01:51

jerryzh168 force-pushed the jerryzh168/stack/10 branch from 9fbc186 to 0b49235 Compare July 18, 2025 01:51

jerryzh168 changed the base branch from main to jerryzh168/stack/9 July 18, 2025 01:51

jerryzh168 changed the base branch from jerryzh168/stack/9 to main July 18, 2025 02:29

jerryzh168 force-pushed the jerryzh168/stack/10 branch from 0b49235 to 21d492e Compare July 18, 2025 02:29

jerryzh168 changed the base branch from main to jerryzh168/stack/9 July 18, 2025 02:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add all fbgemm kernel Tensors into Int4WeightOnlyConfig and Float8DynamicActivationInt4WeightConfig #2474

Add all fbgemm kernel Tensors into Int4WeightOnlyConfig and Float8DynamicActivationInt4WeightConfig #2474

Uh oh!

jerryzh168 commented Jul 2, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jul 2, 2025 •

edited

Loading

Uh oh!

Uh oh!

Add all fbgemm kernel Tensors into Int4WeightOnlyConfig and Float8DynamicActivationInt4WeightConfig #2474

Are you sure you want to change the base?

Add all fbgemm kernel Tensors into Int4WeightOnlyConfig and Float8DynamicActivationInt4WeightConfig #2474

Uh oh!

Conversation

jerryzh168 commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!