(Naga) Cooperative Matrix Support #8251

kvark · 2025-09-20T07:25:55Z

Connections
Blocked by gfx-rs/rspirv#265
Since rspirv fails validation of the product, even though it's correct.

Description
Adding shader support for KHR_cooperative_matrix. Considering a rather simple scope that is portable between Vulkan and Metal.

Testing
Adds tests.

Squash or Rebase?
Rebase.

Checklist

Run cargo fmt.
Run taplo format.
Run cargo clippy --tests. If applicable, add:
- --target wasm32-unknown-unknown
Run cargo xtask test to run tests.
If this contains user-facing changes, add a CHANGELOG.md entry.

API choices

SPIRV and Metal have a fine intersection of the cooperative matrix functionality, with some caveats:

GLSL calls it "coopmat" while Metal has "simdgroup_typeNxY". I decided to go with "coop_mat" since WGSL fairly consistently separates sub-words with an underscore, e.g. "texture_cube".
SPIRV requires a "use" to be associated with each matrix type. It's one of A/B/Acc. Metal doesn't. The API decision here is to expose it as a "role" being one of the generic parameters of coop_mat.
SPIRV has OpCooperativeMatrixLoadKHR and OpCooperativeMatrixMulAddKHR as expressions and OpCooperativeMatrixStoreKHR as a statement. Metal has all of them 3 as statements. I followed SPIR-V notion here, as does Google's proposal.
- the "T" suffix is for transposed load/store. No strong opinion here.
Metal also has just the multiplication (as opposed to multiply-add). I opted to not expose this, since we can always follow-up if needed.

Things left for follow-up:

update the API based on whatever the W3C working group converges on
maybe add the multiply without add
implement initialization from a scalar (honestly not sure how useful this is?)
support for coop matrix with scalar binary ops is very limited
could use more validation and better errors

cwfitzgerald · 2025-09-26T17:40:17Z

Haven't actually looked in the PR yet, but you should take a look at the presentation about cooperative matrices from the F2F: https://docs.google.com/presentation/d/1wiy3-ar58ah1W9Qc5trd0gG7fwCo93IJ9YCtQoR6W6c/edit?slide=id.g30fc39156ff_0_0#slide=id.g30fc39156ff_0_0 and the dawn design doc https://dawn.googlesource.com/dawn/+/refs/heads/main/docs/dawn/features/subgroup_matrix.md just to make sure things are synced up with upstream.

kvark · 2025-09-27T17:29:24Z

@cwfitzgerald this is very useful, thanks for linking! Funny to see the timing of that presentation roughly matching when I started working on it, independently. I looked at the slides as well as the design doc, and here is my first feedback. Apologies if it's not thought through enough!

Because the type is abstract it can only be stored in the Function and Private storage classes. Special load and store instructions are used to translate to/from backing memory.

There are very similar types - textures and sampler - which also are very abstract from the shader writer point of view. Was it considered to just use the "Handle" storage class?

subgroup_matrix_left

There is a choice for each of them: scope, role (left/right/acc), type, etc, to be either a generic argument or a part of the name. In this PR, for example, the role is encoded as a generic A/B/C. I think that makes sense because it allows to express operations like matrix store cleanly as generic instead of overloaded for all kinds of the matrix.

Similarly, the "subgroup" part. If we had it as a generic scope, it could also use it in other parts of the language/API (e.g. barriers).

subgroupMatrixLoad(.. col_major : bool, ..) -> T

A boolean argument is generally a bad API pattern, since the call site has no clue about what it means from just looking at the invocation. Since this is supposed to be a constant anyway, maybe this is a good application for including this into the function name itself? This PR is currently exposing it as coopMatrixLoad/coopMatrixLoadT (the "T" suffix - for transposed).

Overall, looks reasonable. Curious if Apple had concerns about some parts as well.
cc @jimblandy if you want to expose this feedback to the group.

kvark · 2025-09-28T00:52:22Z

@cwfitzgerald @jimblandy do you have a strong preference on how to proceed with the changes? I'm at the point where things basically work, and the test is validating correctly. We could:

land as is and then change the names (and a bit of semantics) once the WGSL figures out the standard API for this. I'm fairly confident that most of the IR and inner logic isn't going to be affected.
rewrite this to match Google's proposal text, if the working group is leaning towards that style of API (see my remarks in the comment above).
don't land anything until WGSL is figured out by the group

I'm fine either way. I just want to use this for a project and will be on a branch if I'm not able to merge. My preference would be (1).

kvark · 2025-09-28T08:16:17Z

Ok, I've got coopLoad aligned to the same API as the WGSL proposal. It's a bit strange since it's only the second function we support that even has generic arguments. But the code changes to support this are pretty small, fortunately.
CI should be green ✅ now . Looking forward to get some feedback and/or proceed 🚀 .

jimblandy · 2025-10-01T16:05:35Z

I think it's our standard practice to land experimental things, so I think it's okay for us to review and land this as-is. However, the WebGPU committee will almost certainly approve some version of Alan's proposal, eventually, so if we put something different in wgpu, it will just need to be changed.

So, I'd like to really encourage you to adapt what you've got to Alan's proposal as much as feasible, but we shouldn't block merging on 100% compliance.

kvark force-pushed the cooperative branch 5 times, most recently from 881da16 to 430d104 Compare September 26, 2025 03:30

kvark marked this pull request as ready for review September 26, 2025 03:30

kvark force-pushed the cooperative branch from ab2de4b to 056a5c1 Compare September 28, 2025 00:47

kvark force-pushed the cooperative branch 3 times, most recently from 2bf7828 to 782a0fc Compare September 28, 2025 04:06

kvark added 9 commits September 27, 2025 22:01

Switch rspirv to the latest git version

16eaf64

Add Cooperative* type to IR

d035ecb

coop: first bits of Vulkan support for the type

a05fdf9

coop: wgsl parsing, IR role

23f0cf8

coop: handle simple ops, end-to-end with SPIRV

8b4a22b

coop: mulAdd instruction

d1c9b56

coop: Implement Load/Store statement

e604132

coop: fixes and changelog

7f36536

coop: make stride non-optional

321ddf4

kvark force-pushed the cooperative branch from 782a0fc to 7da965e Compare September 28, 2025 05:01

kvark added 2 commits September 27, 2025 22:38

coop: rewire IR using native variables load/store

87360f2

coop: make cooperativeLoad to be an expression

07be9e9

kvark force-pushed the cooperative branch from 7da965e to 07be9e9 Compare September 28, 2025 07:37

coop: support generic argument on coopLoad

4fa9f00

kvark requested a review from jimblandy October 1, 2025 04:20

cwfitzgerald self-assigned this Oct 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

(Naga) Cooperative Matrix Support #8251

(Naga) Cooperative Matrix Support #8251

kvark commented Sep 20, 2025 •

edited

Loading

Uh oh!

cwfitzgerald commented Sep 26, 2025

Uh oh!

kvark commented Sep 27, 2025

Uh oh!

kvark commented Sep 28, 2025

Uh oh!

kvark commented Sep 28, 2025

Uh oh!

jimblandy commented Oct 1, 2025

Uh oh!

Uh oh!

(Naga) Cooperative Matrix Support #8251

Are you sure you want to change the base?

(Naga) Cooperative Matrix Support #8251

Conversation

kvark commented Sep 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

API choices

Uh oh!

cwfitzgerald commented Sep 26, 2025

Uh oh!

kvark commented Sep 27, 2025

Uh oh!

kvark commented Sep 28, 2025

Uh oh!

kvark commented Sep 28, 2025

Uh oh!

jimblandy commented Oct 1, 2025

Uh oh!

Uh oh!

kvark commented Sep 20, 2025 •

edited

Loading