Skip to content

vulkan: Use fp16 for the flash attention P*V multiplication#12783

Merged
0cc4m merged 1 commit intoggml-org:masterfrom
jeffbolznv:flash_attn_prec
Apr 9, 2025
Merged

vulkan: Use fp16 for the flash attention P*V multiplication#12783
0cc4m merged 1 commit intoggml-org:masterfrom
jeffbolznv:flash_attn_prec

Commits

Commits on Apr 6, 2025