You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[SYCL][HIP] Support of AMD matrix core instructions (#11485)
* Support one block AMD matrix core instructions for `__gfx90a__`
architecture.
* Supports `__builtin_amdgcn_mfma_i32_32x32x8i8`,
`__builtin_amdgcn_mfma_i32_16x16x16i8`,
`__builtin_amdgcn_mfma_f64_16x16x4f64`,
`__builtin_amdgcn_mfma_f32_32x32x8bf16_1k`,
`__builtin_amdgcn_mfma_f32_16x16x16bf16_1k`,
`__builtin_amdgcn_mfma_f32_32x32x8f16` and
`__builtin_amdgcn_mfma_f32_16x16x16f16` instructions.
* Add HIP matrix core support into joint_matrix documentation.
Should be merged after
- #11215
---------
Co-authored-by: Bing1 Yu <[email protected]>
Co-authored-by: mmoadeli <[email protected]>
0 commit comments