@@ -160,37 +160,31 @@ Module Contents
160160 Increment the maximum number of registers to use.
161161
162162
163-
164163.. py :function :: dec_max_nreg(reg_count)
165164
166165 Decrement the maximum number of registers to use.
167166
168167
169-
170168.. py :function :: annotate_producer_reg_dealloc(reg_count = 24 )
171169
172170 Annotate the producer reg dealloc.
173171
174172
175-
176173.. py :function :: annotate_consumer_reg_alloc(reg_count = 240 )
177174
178175 Annotate the consumer reg alloc.
179176
180177
181-
182178.. py :function :: no_set_max_nreg()
183179
184180 Disable the maximum register limit setting.
185181
186182
187-
188183.. py :function :: disable_warp_group_reg_alloc()
189184
190185 Disable the warp group reg alloc.
191186
192187
193-
194188.. py :function :: mbarrier_wait_parity(mbarrier, parity)
195189
196190 Wait for memory barrier parity condition.
@@ -469,19 +463,16 @@ Module Contents
469463 Synchronize all threads in a block.
470464
471465
472-
473466.. py :function :: sync_global()
474467
475468 Synchronize all threads in the entire grid.
476469
477470
478-
479471.. py :function :: sync_grid()
480472
481473 Synchronize all threads in a grid.
482474
483475
484-
485476.. py :function :: initialize_wgmma_descriptor(descriptor, start_address, layout_type_ = 0 , leading_byte_offset = 0 , stride_byte_offset = 0 )
486477
487478 Initialize a WGMMA/UTCMMA shared-memory descriptor.
@@ -510,13 +501,11 @@ Module Contents
510501 Break out of the innermost loop.
511502
512503
513-
514504.. py :function :: cp_async_barrier_noinc(barrier_id)
515505
516506 Perform a ptx async copy barrier using cp.async.mbarrier.arrive.noinc.
517507
518508
519-
520509.. py :function :: tcgen05_mma_arrive(mbar_ptr)
521510
522511 Signal UMMA (TCGEN05) barrier arrival for a shared-memory mbarrier pointer.
0 commit comments