Skip to content

Commit 1778fef

Browse files
committed
Lower KV fraction for Blackwell
Signed-off-by: Barry Kang <[email protected]>
1 parent 57f5ba5 commit 1778fef

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

tests/integration/defs/test_e2e.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2208,7 +2208,7 @@ def test_relaxed_acceptance_quickstart_advanced_deepseek_r1_8gpus(
22082208
"--moe_ep_size=8",
22092209
"--tp_size=8",
22102210
"--use_cuda_graph",
2211-
f"--kv_cache_fraction={_MEM_FRACTION_80 if is_blackwell else _MEM_FRACTION_95}",
2211+
f"--kv_cache_fraction={_MEM_FRACTION_50 if is_blackwell else _MEM_FRACTION_95}",
22122212
"--max_batch_size=1",
22132213
"--max_seq_len=3000",
22142214
"--disable_kv_cache_reuse",

0 commit comments

Comments
 (0)