Skip to content

Conversation

@nngokhale
Copy link
Contributor

Increase reserved memory
Round down max num seqs
Disable prefix caching, Add option to enable
Add support for prefill seqs
Remove delayed sampling, weight sharing
Fix concurrent_req benchmark option

@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

@nngokhale nngokhale force-pushed the plugin-cd-0.10.2_next_wa branch from 7a51081 to 5b61d0f Compare October 25, 2025 05:43
@nngokhale nngokhale force-pushed the plugin-cd-0.10.2_next_wa branch from 5b61d0f to 1b35dc2 Compare October 25, 2025 05:58
@github-actions
Copy link

✅ CI Passed

All checks passed successfully against the following vllm commit:
01efc7ef781391e744ed08c3292817a773d654e6

@PatrykWo
Copy link
Collaborator

PatrykWo commented Nov 3, 2025

Tested build and sample runs, looks OK.

@PatrykWo PatrykWo self-requested a review November 3, 2025 14:36
@mgawarkiewicz-intel mgawarkiewicz-intel enabled auto-merge (squash) November 3, 2025 14:37
@mgawarkiewicz-intel mgawarkiewicz-intel merged commit 7982652 into vllm-project:v0.10.2_next Nov 3, 2025
7 checks passed
@github-actions
Copy link

github-actions bot commented Nov 3, 2025

✅ CI Passed

All checks passed successfully against the following vllm commit:
01efc7ef781391e744ed08c3292817a773d654e6

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants