Issue with block_size = 64 #5

PiotrNawrot · 2025-03-10T17:26:10Z

Thanks for the amazing research and releasing the code!
I get the following error when I change the block size to 64 on H100, any ideas?

SharedEncodingAttr builder when the MMAEncodingAttr is Hopper has not been implemented yet
UNREACHABLE executed at /project/python/build/cmake.linux-x86_64-cpython-3.10/include/triton/Dialect/TritonGPU/IR/TritonGPUAttrDefs.cpp.inc:347!
Aborted

The text was updated successfully, but these errors were encountered:

XunhaoLai · 2025-03-13T03:22:39Z

@PiotrNawrot Thank you for your feedback! FlexPrefill was developed and tested on A100 GPUs and has not yet been tested on Hopper GPUs. I’ll work on adding support for this in a future update. Apologies for the inconvenience!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue with block_size = 64 #5

Issue with block_size = 64 #5

PiotrNawrot commented Mar 10, 2025

XunhaoLai commented Mar 13, 2025

Issue with block_size = 64 #5

Issue with block_size = 64 #5

Comments

PiotrNawrot commented Mar 10, 2025

XunhaoLai commented Mar 13, 2025