[NVFP4] Expand dynamic types, clean-up conditions #325

dsikka · 2025-05-27T20:03:52Z

Summary:

Expand dynamic to include an enum local - with this change, the following conditions are now accepted:

1. If dynamic is True --> all parameters are generated on the fly
2. If dynamic is False --> all parameters are statically generated and saved to disk
3. If dynamic is local --> all local quantization parameters are generated on the fly

Expand nvfp4a16 to use tensor_group --> this strategy is now assocciated with the initialization of global_scales for weight and activations, not is_fp4
Clean-up/re-order init conditions
Expand testing

Testing

All existing tests pass + new test cases
LLM Compressor Test cases pass
NVFP4/NVFP4A16 recipes work as expected

brian-dellabetta

Few comments

src/compressed_tensors/quantization/quant_args.py

src/compressed_tensors/quantization/lifecycle/initialize.py

src/compressed_tensors/quantization/quant_args.py

Co-authored-by: Brian Dellabetta <[email protected]>

kylesayrs

As long as we feel confident that there's no expected behavior resulting from "local" being evaluated to True, then this looks good

src/compressed_tensors/quantization/lifecycle/forward.py

dsikka · 2025-05-28T17:25:23Z

As long as we feel confident that there's no expected behavior resulting from "local" being evaluated to True, then this looks good

yeah I agree. Used the explicit check for quant_config for bettter readability

brian-dellabetta

nice nice nice

SUMMARY: - Requires neuralmagic/compressed-tensors#325 - Uses the new `tensor_group` strategy for nvfp4a16 quantization - Removes global_scale as an observer class parameter and passes in as a function call, similar to g_idx

dsikka mentioned this pull request May 27, 2025

[NVFP4] Add tensor_group strategy; enable NVFP4 Activations #317

Merged

Base automatically changed from activation_support to main May 28, 2025 02:04

dsikka added 7 commits May 28, 2025 02:14

add DynamicType

a8f3d51

update to use tensor_group

016da27

more condition clean-up

f3ec17a

update global scale creation

595bcd6

fix conditions, fix tests

bf3a9d2

add validation

13ff976

update/fix conditiosn

c2d56d7

dsikka force-pushed the update_dynamic_conditions branch from 1af02a7 to c2d56d7 Compare May 28, 2025 02:17

dsikka marked this pull request as ready for review May 28, 2025 11:32

dsikka mentioned this pull request May 28, 2025

[NVFP4] Update to use tensor_group strategy; update observers vllm-project/llm-compressor#1484

Merged

brian-dellabetta reviewed May 28, 2025

View reviewed changes

src/compressed_tensors/quantization/quant_args.py Outdated Show resolved Hide resolved

src/compressed_tensors/quantization/lifecycle/initialize.py Outdated Show resolved Hide resolved

src/compressed_tensors/quantization/quant_args.py Show resolved Hide resolved

dsikka and others added 2 commits May 28, 2025 12:42

Update src/compressed_tensors/quantization/lifecycle/initialize.py

53b2742

Co-authored-by: Brian Dellabetta <[email protected]>

Update src/compressed_tensors/quantization/quant_args.py

d6d00c6

Co-authored-by: Brian Dellabetta <[email protected]>

brian-dellabetta previously approved these changes May 28, 2025

View reviewed changes

kylesayrs reviewed May 28, 2025

View reviewed changes

src/compressed_tensors/quantization/lifecycle/forward.py Outdated Show resolved Hide resolved

kylesayrs previously approved these changes May 28, 2025

View reviewed changes

use explicit condition

6cb319b

dsikka dismissed stale reviews from kylesayrs and brian-dellabetta via 6cb319b May 28, 2025 18:36

dsikka requested review from brian-dellabetta and kylesayrs May 28, 2025 18:36

dsikka enabled auto-merge (squash) May 28, 2025 18:36

brian-dellabetta approved these changes May 28, 2025

View reviewed changes

kylesayrs approved these changes May 28, 2025

View reviewed changes

dsikka merged commit 3f5705d into main May 28, 2025
1 check passed

dsikka deleted the update_dynamic_conditions branch May 28, 2025 21:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[NVFP4] Expand dynamic types, clean-up conditions #325

[NVFP4] Expand dynamic types, clean-up conditions #325

Uh oh!

dsikka commented May 27, 2025 •

edited

Loading

Uh oh!

brian-dellabetta left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kylesayrs left a comment

Uh oh!

Uh oh!

dsikka commented May 28, 2025 •

edited

Loading

Uh oh!

brian-dellabetta left a comment

Uh oh!

Uh oh!

Uh oh!

[NVFP4] Expand dynamic types, clean-up conditions #325

[NVFP4] Expand dynamic types, clean-up conditions #325

Uh oh!

Conversation

dsikka commented May 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary:

Testing

Uh oh!

brian-dellabetta left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kylesayrs left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dsikka commented May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

brian-dellabetta left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dsikka commented May 27, 2025 •

edited

Loading

dsikka commented May 28, 2025 •

edited

Loading