Skip to content

[Core] Default to using per_token quantization for fp8 when cutlass is supported. #12993

[Core] Default to using per_token quantization for fp8 when cutlass is supported.

[Core] Default to using per_token quantization for fp8 when cutlass is supported. #12993

Triggered via pull request September 20, 2024 04:56
Status Success
Total duration 17s
Artifacts

clang-format.yml

on: pull_request
Matrix: clang-format
Fit to window
Zoom out
Zoom in

Annotations

2 warnings
clang-format (3.11)
The following actions uses node12 which is deprecated and will be forced to run on node16: actions/checkout@v2, actions/setup-python@v2. For more info: https://github.blog/changelog/2023-06-13-github-actions-all-actions-will-run-on-node16-instead-of-node12-by-default/
clang-format (3.11)
The following actions use a deprecated Node.js version and will be forced to run on node20: actions/checkout@v2, actions/setup-python@v2. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/