google / aqt Public

Notifications
Fork 28
Star 290

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: google/aqt

Labels 11 Milestones 0

New pull request New

67 Open 658 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Avoid jax.promote_dtype(float8_e4m3fn, x) failure when using aqt_einsum with fp8_e4m3fn dtype

#759 opened Mar 4, 2025 by copybara-service bot

Update references to JAX's GitHub repo

#752 opened Dec 5, 2024 by copybara-service bot

dummy

#751 opened Nov 21, 2024 by copybara-service bot

Add rounding bias that controls the rounding threshold

#750 opened Nov 20, 2024 by copybara-service bot

Enable fake quant in the bwd with local_aqt

#748 opened Nov 13, 2024 by copybara-service bot

Update the accumulation dtype if FP8 precision is used for AQT.

#745 opened Nov 5, 2024 by copybara-service bot

[sharding_in_types][Take 2] Add out_type argument to einsum and dot_general to allow specifying for the output type. Right now, it only accept a NamedSharding but in the future we can allow a polymorphic type of: jax.ShapeDtypeStruct | Sharding | Layout.

#743 opened Oct 22, 2024 by copybara-service bot

cfg as gradient experiment - unsuccessful

#742 opened Oct 18, 2024 by anfals

Add prototype delayed scaling with overwrite with gradients

#741 opened Oct 17, 2024 by anfals • Draft

Fix dequant logic with zero scaling factors

#730 opened Sep 26, 2024 by copybara-service bot

Allows setting array of bounds in set_calibration_config.

#729 opened Sep 24, 2024 by copybara-service bot

QTensor refactoring: holds AqtTileMap instead of TilingState.

#692 opened Aug 9, 2024 by copybara-service bot

Add support asymmetric fake-quantization to AQTv2.

#675 opened Jul 23, 2024 by copybara-service bot

internal

#672 opened Jul 12, 2024 by copybara-service bot

Set config.set_bits params' default values as None.

#659 opened Jun 25, 2024 by copybara-service bot

Internal prototype

#655 opened Jun 21, 2024 by copybara-service bot

Change a way to split tiling axes without configuration

#644 opened Jun 7, 2024 by copybara-service bot

[EXPERIMENTAL] Remove leading 1s from the scale.

#643 opened Jun 6, 2024 by copybara-service bot

Support pl.BlockSpec that holds block_shape with None.

#630 opened May 24, 2024 by copybara-service bot

Add from_qtensor which transform QTensor into PallasQTensor.

#629 opened May 24, 2024 by copybara-service bot

Move PallasQTensor materialization code as a method. Call materialization before dequant. This simplifies user interface.

#628 opened May 24, 2024 by copybara-service bot

cast scale to dequant_type when restoring scale to original shape in pallas.

#627 opened May 24, 2024 by copybara-service bot

[Pallas Support] Add dot_general for pallas.

#624 opened May 22, 2024 by copybara-service bot

Put numerics and numerics-related logic into QTensor

#623 opened May 22, 2024 by copybara-service bot

[Pallas support] Add quant_blockwisely function, which quantizes tensor according to the BlockSpec of the inputs.

#596 opened Apr 24, 2024 by copybara-service bot

Previous 1 2 3 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly