FA3 #3623

zhaochaoxing · 2025-06-09T02:09:01Z

lmdeploy/pytorch/backends/cuda/attention.py

grimoire · 2025-06-10T06:20:32Z

lmdeploy chat --backend pytorch deepseek-ai/DeepSeek-V2-Lite-Chat

failed on second round chat

zhaochaoxing · 2025-06-11T01:58:08Z

lmdeploy chat --backend pytorch deepseek-ai/DeepSeek-V2-Lite-Chat

failed on second round chat

The bug has been resolved.

grimoire

LGTM

RunningLeon

LGTM

lvhan028 · 2025-06-13T12:19:17Z

I'd like to hold this PR for a while.
We'd better conduct the evaluation test of some hot models in the env that install FA3

grimoire · 2025-06-20T02:46:22Z

lmdeploy/pytorch/kernels/cuda/flatten_kv_cache.py

@@ -202,7 +202,8 @@ def flatten_kv_cache(k_caches: Tensor,
                     k_scales_zeros: Tensor = None,
                     v_scales_zeros: Tensor = None,
                     quant_policy: Literal[0, 4, 8] = 0,
-                     kv_layout: str = 'bshd'):
+                     kv_layout: str = 'bshd',
+                     flatten_kv_layout: str = 'bhsd'):


Since output is 3d continuous batching tensors, I think 'hsd' is better.

add fa3

c155d58

lvhan028 requested review from grimoire and RunningLeon June 10, 2025 03:20

lvhan028 added the enhancement New feature or request label Jun 10, 2025

grimoire reviewed Jun 10, 2025

View reviewed changes

lmdeploy/pytorch/backends/cuda/attention.py Outdated Show resolved Hide resolved

grimoire reviewed Jun 10, 2025

View reviewed changes

lmdeploy/pytorch/backends/cuda/attention.py Outdated Show resolved Hide resolved

fix cu_seqlens_k on chat mode

1679937

grimoire approved these changes Jun 11, 2025

View reviewed changes

RunningLeon approved these changes Jun 13, 2025

View reviewed changes

lvhan028 self-requested a review June 13, 2025 12:16

zhaochaoxing added 2 commits June 16, 2025 09:36

add fa3 for qwen3

d9debc2

fix fa3

0fb24c8

grimoire reviewed Jun 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

FA3 #3623

FA3 #3623

Uh oh!

zhaochaoxing commented Jun 9, 2025

Uh oh!

Uh oh!

Uh oh!

grimoire commented Jun 10, 2025

Uh oh!

zhaochaoxing commented Jun 11, 2025

Uh oh!

grimoire left a comment

Uh oh!

RunningLeon left a comment

Uh oh!

lvhan028 commented Jun 13, 2025

Uh oh!

grimoire Jun 20, 2025

Uh oh!

Uh oh!

FA3 #3623

Are you sure you want to change the base?

FA3 #3623

Uh oh!

Conversation

zhaochaoxing commented Jun 9, 2025

Uh oh!

Uh oh!

Uh oh!

grimoire commented Jun 10, 2025

Uh oh!

zhaochaoxing commented Jun 11, 2025

Uh oh!

grimoire left a comment

Choose a reason for hiding this comment

Uh oh!

RunningLeon left a comment

Choose a reason for hiding this comment

Uh oh!

lvhan028 commented Jun 13, 2025

Uh oh!

grimoire Jun 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!