[PTX] Enable migration of mma (m16n8k16) #2746

TejaX-Alaghari · 2025-03-27T12:07:09Z

This PR adds support for the migration of mma PTX ASM API

tomflinda · 2025-03-28T05:29:13Z

clang/runtime/dpct-rt/include/dpct/math.hpp

+/// \param [in] item The sycl::nd_item index space class
+template <typename MulType, typename ABType, typename CDType, typename ItemT>
+__attribute__((optnone)) void
+mma(CDType *d0, CDType *d1, CDType *d2, CDType *d3, ABType a0, ABType a1,


According the PTX spec: https://docs.nvidia.com/cuda/parallel-thread-execution/#matrix-multiply-accumulate-operation-using-mma-instruction, there are 11 kinds of shapes for mma:

m8n8k4 m8n8k16 m8n8k32 m8n8k128 m16n8k4 m16n8k8 m16n8k16 m16n8k32 m16n8k64 m16n8k128 m16n8k256

Pls make sure your helper function has the capability to support them besides the shape of "m16n8k16".

As this is the initial PR, I've added support for the shape used in top apps

Will add support for remaining shapes soon

clang/runtime/dpct-rt/include/dpct/math.hpp

TejaX-Alaghari requested a review from a team as a code owner March 27, 2025 12:07

TejaX-Alaghari requested review from intwanghao and ziranzha March 27, 2025 12:07

TejaX-Alaghari force-pushed the ptx_mma branch 2 times, most recently from 2b4c1bd to 6719399 Compare March 28, 2025 03:16

tomflinda reviewed Mar 28, 2025

View reviewed changes

clang/runtime/dpct-rt/include/dpct/math.hpp Show resolved Hide resolved

zhiweij1 reviewed Mar 28, 2025

View reviewed changes

clang/runtime/dpct-rt/include/dpct/math.hpp Outdated Show resolved Hide resolved

TejaX-Alaghari added 2 commits April 15, 2025 11:50

Added support for mma migration

d245736

Added comments and removed optnone attr

db8344a

TejaX-Alaghari force-pushed the ptx_mma branch from 6719399 to db8344a Compare April 15, 2025 04:59

TejaX-Alaghari changed the title ~~[PTX] Enable migration of mma~~ [PTX] Enable migration of mma (m16n8k16) Apr 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PTX] Enable migration of mma (m16n8k16) #2746

[PTX] Enable migration of mma (m16n8k16) #2746

TejaX-Alaghari commented Mar 27, 2025

tomflinda Mar 28, 2025 •

edited

Loading

TejaX-Alaghari Mar 28, 2025

[PTX] Enable migration of mma (m16n8k16) #2746

Are you sure you want to change the base?

[PTX] Enable migration of mma (m16n8k16) #2746

Conversation

TejaX-Alaghari commented Mar 27, 2025

tomflinda Mar 28, 2025 • edited Loading

Choose a reason for hiding this comment

TejaX-Alaghari Mar 28, 2025

Choose a reason for hiding this comment

tomflinda Mar 28, 2025 •

edited

Loading