attributes/codegen: update aarch64 features

mrkajetanp · davidtwco · commit df1159b978cc · 2025-04-16T12:49:19.000Z
Update `attributes/codegen.md` to match aarch64 CPU features known to rustc[1]. [1]: https://github.com/rust-lang/rust/blob/afa859f8121bf2985362a2c8414dc71a825ccf2d/compiler/rustc_target/src/target_features.rs#L179-L373
diff --git a/src/attributes/codegen.md b/src/attributes/codegen.md
@@ -226,52 +226,84 @@ Reference Manual], or elsewhere on [developer.arm.com].
 > The following pairs of features should both be marked as enabled or disabled together if used:
 > - `paca` and `pacg`, which LLVM currently implements as one feature.
 
-Feature        | Implicitly Enables | Feature Name
----------------|--------------------|-------------------
-`aes`          | `neon`         | FEAT_AES & FEAT_PMULL --- Advanced <abbr title="Single Instruction Multiple Data">SIMD</abbr> AES & PMULL instructions
-`bf16`         |                | FEAT_BF16 --- BFloat16 instructions
-`bti`          |                | FEAT_BTI --- Branch Target Identification
-`crc`          |                | FEAT_CRC --- CRC32 checksum instructions
-`dit`          |                | FEAT_DIT --- Data Independent Timing instructions
-`dotprod`      |                | FEAT_DotProd --- Advanced SIMD Int8 dot product instructions
-`dpb`          |                | FEAT_DPB --- Data cache clean to point of persistence
-`dpb2`         |                | FEAT_DPB2 --- Data cache clean to point of deep persistence
-`f32mm`        | `sve`          | FEAT_F32MM --- SVE single-precision FP matrix multiply instruction
-`f64mm`        | `sve`          | FEAT_F64MM --- SVE double-precision FP matrix multiply instruction
-`fcma`         | `neon`         | FEAT_FCMA --- Floating point complex number support
-`fhm`          | `fp16`         | FEAT_FHM --- Half-precision FP FMLAL instructions
-`flagm`        |                | FEAT_FlagM --- Conditional flag manipulation
-`fp16`         | `neon`         | FEAT_FP16 --- Half-precision FP data processing
-`frintts`      |                | FEAT_FRINTTS --- Floating-point to int helper instructions
-`i8mm`         |                | FEAT_I8MM --- Int8 Matrix Multiplication
-`jsconv`       | `neon`         | FEAT_JSCVT --- JavaScript conversion instruction
-`lse`          |                | FEAT_LSE --- Large System Extension
-`lor`          |                | FEAT_LOR --- Limited Ordering Regions extension
-`mte`          |                | FEAT_MTE & FEAT_MTE2 --- Memory Tagging Extension
-`neon`         |                | FEAT_FP & FEAT_AdvSIMD --- Floating Point and Advanced SIMD extension
-`pan`          |                | FEAT_PAN --- Privileged Access-Never extension
-`paca`         |                | FEAT_PAuth --- Pointer Authentication (address authentication)
-`pacg`         |                | FEAT_PAuth --- Pointer Authentication (generic authentication)
-`pmuv3`        |                | FEAT_PMUv3 --- Performance Monitors extension (v3)
-`rand`         |                | FEAT_RNG --- Random Number Generator
-`ras`          |                | FEAT_RAS & FEAT_RASv1p1 --- Reliability, Availability and Serviceability extension
-`rcpc`         |                | FEAT_LRCPC --- Release consistent Processor Consistent
-`rcpc2`        | `rcpc`         | FEAT_LRCPC2 --- RcPc with immediate offsets
-`rdm`          |                | FEAT_RDM --- Rounding Double Multiply accumulate
-`sb`           |                | FEAT_SB --- Speculation Barrier
-`sha2`         | `neon`         | FEAT_SHA1 & FEAT_SHA256 --- Advanced SIMD SHA instructions
-`sha3`         | `sha2`         | FEAT_SHA512 & FEAT_SHA3 --- Advanced SIMD SHA instructions
-`sm4`          | `neon`         | FEAT_SM3 & FEAT_SM4 --- Advanced SIMD SM3/4 instructions
-`spe`          |                | FEAT_SPE --- Statistical Profiling Extension
-`ssbs`         |                | FEAT_SSBS & FEAT_SSBS2 --- Speculative Store Bypass Safe
-`sve`          | `fp16`         | FEAT_SVE --- Scalable Vector Extension
-`sve2`         | `sve`          | FEAT_SVE2 --- Scalable Vector Extension 2
-`sve2-aes`     | `sve2`, `aes`  | FEAT_SVE_AES --- SVE AES instructions
-`sve2-sm4`     | `sve2`, `sm4`  | FEAT_SVE_SM4 --- SVE SM4 instructions
-`sve2-sha3`    | `sve2`, `sha3` | FEAT_SVE_SHA3 --- SVE SHA3 instructions
-`sve2-bitperm` | `sve2`         | FEAT_SVE_BitPerm --- SVE Bit Permute
-`tme`          |                | FEAT_TME --- Transactional Memory Extension
-`vh`           |                | FEAT_VHE --- Virtualization Host Extensions
+Feature        | Stability                                    | Implicity Enables            | Feature Name
+-------        | ---------                                    | -----------------            | ------------
+`aes`          | Stable                                       | `neon`                       | FEAT_AES & FEAT_PMULL --- Advanced <abbr title="Single Instruction Multiple Data">SIMD</abbr> AES & PMULL instructions
+`bf16`         | Stable                                       |                              | FEAT_BF16 --- BFloat16 instructions
+`bti`          | Stable                                       |                              | FEAT_BTI --- Branch Target Identification
+`crc`          | Stable                                       |                              | FEAT_CRC --- CRC32 checksum instructions
+`cssc`         | Unstable (`aarch64_unstable_target_feature`) |                              | FEAT_CSSC --- Common Short Sequence Compression (CSSC) instructions
+`dit`          | Stable                                       |                              | FEAT_DIT  --- Data Independent Timing instructions
+`dotprod`      | Stable                                       | `neon`                       | FEAT_DotProd --- Advanced SIMD Int8 dot product instructions
+`dpb`          | Stable                                       |                              | FEAT_DPB --- Data cache clean to point of persistence
+`dpb2`         | Stable                                       | `dpb`                        | FEAT_DPB2 --- Data cache clean to point of deep persistence
+`ecv`          | Unstable (`aarch64_unstable_target_feature`) |                              | FEAT_ECV --- Enhanced counter virtualization extension
+`f32mm`        | Stable                                       | `sve`                        | FEAT_F32MM --- SVE single-precision FP matrix multiply instruction
+`f64mm`        | Stable                                       | `sve`                        | FEAT_F64MM --- SVE double-precision FP matrix multiply instruction
+`faminmax`     | Unstable (`aarch64_unstable_target_feature`) |                              | FEAT_FAMINMAX --- Enable FAMIN and FAMAX instructions
+`fcma`         | Stable                                       | `neon`                       | FEAT_FCMA --- Floating point complex number support
+`fhm`          | Stable                                       | `fp16`                       | FEAT_FHM --- Half-precision FP FMLAL instructions
+`flagm`        | Stable                                       |                              | FEAT_FLAGM --- Conditional flag manipulation
+`flagm2`       | Unstable (`aarch64_unstable_target_feature`) |                              | FEAT_FLAGM2 --- Enhancements to flag manipulation instructions
+`fp16`         | Stable                                       | `neon`                       | FEAT_FP16 --- Half-precision FP data processing
+`fp8`          | Unstable (`aarch64_unstable_target_feature`) | `faminmax` `lut`, `bf16`,    | FEAT_FP8 --- FP8 (F8CVT Instructions)
+`fp8dot2`      | Unstable (`aarch64_unstable_target_feature`) | `fp8dot4`                    | FEAT_FP8DOT2 --- FP8 2-way dot product instructions
+`fp8dot4`      | Unstable (`aarch64_unstable_target_feature`) | `fp8fma`                     | FEAT_FP8DOT4 --- FP8 4-way dot product instructions
+`fp8fma`       | Unstable (`aarch64_unstable_target_feature`) | `fp8`                        | FEAT_FP8FMA --- FP8 multiply-add instructions
+`frintts`      | Stable                                       |                              | FEAT_FRINTTS --- Floating-point to int helper instructions
+`hbc`          | Unstable (`aarch64_unstable_target_feature`) |                              | FEAT_HBC --- Hinted conditional branches
+`i8mm`         | Stable                                       |                              | FEAT_I8MM --- Int8 Matrix Multiplication
+`jsconv`       | Stable                                       | `neon`                       | FEAT_JSCVT --- JavaScript conversion instruction
+`lor`          | Stable                                       |                              | FEAT_LOR --- Limited Ordering Regions extension
+`lse`          | Stable                                       |                              | FEAT_LSE --- Large System Extensions
+`lse128`       | Unstable (`aarch64_unstable_target_feature`) | `lse`                        | FEAT_LSE128 --- 128-bit Atomics
+`lse2`         | Unstable (`aarch64_unstable_target_feature`) |                              | FEAT_LSE2 --- Large System Extensions version 2
+`lut`          | Unstable (`aarch64_unstable_target_feature`) |                              | FEAT_LUT --- Lookup Table instructions
+`mops`         | Unstable (`aarch64_unstable_target_feature`) |                              | FEAT_MOPS --- memcpy and memset acceleration instructions
+`mte`          | Stable                                       |                              | FEAT_MTE & FEAT_MTE2 --- Memory Tagging Extension
+`neon`         | Stable                                       |                              | FEAT_AdvSimd & FEAT_FP --- Floating Point and Advanced SIMD extension
+`paca`         | Stable                                       |                              | FEAT_PAUTH --- Pointer Authentication (address authentication)
+`pacg`         | Stable                                       |                              | FEAT_PAUTH --- Pointer Authentication (generic authentication)
+`pan`          | Stable                                       |                              | FEAT_PAN --- Privileged Access-Never extension
+`pauth-lr`     | Unstable (`aarch64_unstable_target_feature`) |                              | FEAT_PAuth_LR --- Pointer authentication instructions that allow signing of LR using SP and PC as diversifiers
+`pmuv3`        | Stable                                       |                              | FEAT_PMUv3 --- Performance Monitors extension (v3)
+`rand`         | Stable                                       |                              | FEAT_RNG --- Random Number Generator
+`ras`          | Stable                                       |                              | FEAT_RAS & FEAT_RASv1p1 --- Reliability, Availability and Serviceability extension
+`rcpc`         | Stable                                       |                              | FEAT_LRCPC --- Release consistent Processor Consistent
+`rcpc2`        | Stable                                       | `rcpc`                       | FEAT_LRCPC2 --- RcPc with immediate offsets
+`rcpc3`        | Unstable (`aarch64_unstable_target_feature`) | `rcpc2`                      | FEAT_LRCPC3 --- RcPc instructions version 3
+`rdm`          | Stable                                       | `neon`                       | FEAT_RDM --- Rounding Double Multiply accumulate
+`sb`           | Stable                                       |                              | FEAT_SB --- Speculation Barrier
+`sha2`         | Stable                                       | `neon`                       | FEAT_SHA1 & FEAT_SHA256 --- Advanced SIMD SHA instructions
+`sha3`         | Stable                                       | `sha2`                       | FEAT_SHA512 & FEAT_SHA3 --- Advanced SIMD SHA instructions
+`sm4`          | Stable                                       | `neon`                       | FEAT_SM3 & FEAT_SM4 --- Advanced SIMD SM3/4 instructions
+`sme`          | Unstable (`aarch64_unstable_target_feature`) | `bf16`                       | FEAT_SME --- Scalable Matrix Extension
+`sme-b16b16`   | Unstable (`aarch64_unstable_target_feature`) | `bf16` `sme2`, `sve-b16b16`, | FEAT_SME_B16B16 --- Non-widening BFloat16 to BFloat16 SME ZA-targeting arithmetic
+`sme-f16f16`   | Unstable (`aarch64_unstable_target_feature`) | `sme2`                       | FEAT_SME_F16F16 --- Non-widening half-precision FP16 to FP16 arithmetic for SME2
+`sme-f64f64`   | Unstable (`aarch64_unstable_target_feature`) | `sme`                        | FEAT_SME_F64F64 --- Double-precision floating-point outer product instructions
+`sme-f8f16`    | Unstable (`aarch64_unstable_target_feature`) | `sme-f8f32`                  | FEAT_SME_F8F16 --- SME F8F16 instructions
+`sme-f8f32`    | Unstable (`aarch64_unstable_target_feature`) | `sme2` `fp8`,                | FEAT_SME_F8F32 --- SME F8F32 instructions
+`sme-fa64`     | Unstable (`aarch64_unstable_target_feature`) | `sme` `sve2`,                | FEAT_SME_FA64 --- Full A64 instruction set support in Streaming SVE mode
+`sme-i16i64`   | Unstable (`aarch64_unstable_target_feature`) | `sme`                        | FEAT_SME_I16I64 --- 16-bit to 64-bit integer widening outer product instructions
+`sme-lutv2`    | Unstable (`aarch64_unstable_target_feature`) |                              | FEAT_SME_LUTv2 --- LUTI4 instruction
+`sme2`         | Unstable (`aarch64_unstable_target_feature`) | `sme`                        | FEAT_SME2 --- SME Version 2
+`sme2p1`       | Unstable (`aarch64_unstable_target_feature`) | `sme2`                       | FEAT_SME2p1 --- SME Version 2.1
+`spe`          | Stable                                       |                              | FEAT_SPE --- Statistical Profiling Extension
+`ssbs`         | Stable                                       |                              | FEAT_SSBS & FEAT_SSBS2 --- Speculative Store Bypass Safe
+`ssve-fp8dot2` | Unstable (`aarch64_unstable_target_feature`) | `ssve-fp8dot4`               | FEAT_SSVE_FP8FDOT2 ---  SVE FP8 2-way dot product to half-precision instructions in Streaming SVE mode
+`ssve-fp8dot4` | Unstable (`aarch64_unstable_target_feature`) | `ssve-fp8fma`                | FEAT_SSVE_FP8FDOT4 --- SVE2 FP8 4-way dot product to single-precision instructions in Streaming SVE mode
+`ssve-fp8fma`  | Unstable (`aarch64_unstable_target_feature`) | `sme2` `fp8`,                | FEAT_SSVE_FP8FMA --- SVE2 FP8 multiply-accumulate to half-precision and single-precision instructions in Streaming SVE mode
+`sve`          | Stable                                       | `neon`                       | FEAT_SVE --- Scalable Vector Extension
+`sve-b16b16`   | Unstable (`aarch64_unstable_target_feature`) | `bf16`                       | FEAT_SVE_B16B16 --- Non-widening BFloat16 to BFloat16 arithmetic for SVE2 and SME2
+`sve2`         | Stable                                       | `sve`                        | FEAT_SVE2 --- Scalable Vector Extension 2
+`sve2-aes`     | Stable                                       | `sve2` `aes`,                | FEAT_SVE_AES & FEAT_SVE_PMULL128 --- SVE AES instructions
+`sve2-bitperm` | Stable                                       | `sve2`                       | FEAT_SVE2_BitPerm --- SVE Bit Permute
+`sve2-sha3`    | Stable                                       | `sve2` `sha3`,               | FEAT_SVE2_SHA3 --- SVE SHA3 instructions
+`sve2-sm4`     | Stable                                       | `sve2` `sm4`,                | FEAT_SVE2_SM4 --- SVE SM4 instructions
+`sve2p1`       | Unstable (`aarch64_unstable_target_feature`) | `sve2`                       | FEAT_SVE2p1 --- Scalable Vector Extension 2.1
+`tme`          | Stable                                       |                              | FEAT_TME --- Transactional Memory Extension
+`vh`           | Stable                                       |                              | FEAT_VHE --- Virtualization Host Extensions
+`wfxt`         | Unstable (`aarch64_unstable_target_feature`) |                              | FEAT_WFxT --- WFET and WFIT instructions
 
 r[attributes.codegen.target_feature.riscv]
 #### `riscv32` or `riscv64`