vulkan: Use fp16 for the flash attention P*V multiplication#12783
Merged
0cc4m merged 1 commit intoggml-org:masterfrom Apr 9, 2025
Merged
vulkan: Use fp16 for the flash attention P*V multiplication#127830cc4m merged 1 commit intoggml-org:masterfrom
0cc4m merged 1 commit intoggml-org:masterfrom