chore: add just max token for different models in router bench #137

rootfs · 2025-09-15T14:10:27Z

What type of PR is this?

When testing different reasoning models, the reason bench needs to adjust max token to avoid partial responses that result in wrong answers to many datasets include MMLU and GPQA

What this PR does / why we need it:

Which issue(s) this PR fixes:

Fixes #

Release Notes: Yes/No

Signed-off-by: Huamin Chen <[email protected]>

netlify · 2025-09-15T14:10:34Z

✅ Deploy Preview for vllm-semantic-router ready!

Name	Link
🔨 Latest commit	`2bc24ff`
🔍 Latest deploy log	https://app.netlify.com/projects/vllm-semantic-router/deploys/68c81e57a4e263000851174b
😎 Deploy Preview	https://deploy-preview-137--vllm-semantic-router.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

github-actions · 2025-09-15T14:11:05Z

👥 vLLM Semantic Team Notification

The following members have been identified for the changed files in this PR and have been automatically assigned:

📁 `bench`

Owners: @yuezhu1, @Xunzhuo
Files changed:

bench/vllm_semantic_router_bench/router_reason_bench_multi_dataset.py

🎉 Thanks for your contributions!

This comment was automatically generated based on the OWNER files in the repository.

chore: add just max token for different models in router bench

2bc24ff

Signed-off-by: Huamin Chen <[email protected]>

rootfs requested review from yuezhu1 and Xunzhuo as code owners September 15, 2025 14:10

github-actions bot assigned Xunzhuo and yuezhu1 Sep 15, 2025

rootfs marked this pull request as draft September 15, 2025 23:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore: add just max token for different models in router bench #137

chore: add just max token for different models in router bench #137

Uh oh!

rootfs commented Sep 15, 2025

Uh oh!

netlify bot commented Sep 15, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Sep 15, 2025

Uh oh!

Uh oh!

chore: add just max token for different models in router bench #137

Are you sure you want to change the base?

chore: add just max token for different models in router bench #137

Uh oh!

Conversation

rootfs commented Sep 15, 2025

Uh oh!

netlify bot commented Sep 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for vllm-semantic-router ready!

Uh oh!

github-actions bot commented Sep 15, 2025

👥 vLLM Semantic Team Notification

📁 bench

🎉 Thanks for your contributions!

Uh oh!

Uh oh!

netlify bot commented Sep 15, 2025 •

edited

Loading

📁 `bench`