Skip to content

metal: cache x in registers in rms_single_row to avoid redundant global read#3754

Merged
zcbenz merged 2 commits into
ml-explore:mainfrom
will-march:perf/rms-norm-cache-x
Jun 26, 2026
Merged

metal: cache x in registers in rms_single_row to avoid redundant global read#3754
zcbenz merged 2 commits into
ml-explore:mainfrom
will-march:perf/rms-norm-cache-x

Commits

Commits on Jun 23, 2026

Commits on Jun 24, 2026