Skip to content

Pull requests: kubernetes-sigs/inference-perf

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[WIP] Fix incorrect shared prefix prompt length cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. size/M Denotes a PR that changes 30-99 lines, ignoring generated files.
#299 opened Dec 5, 2025 by Bslabe123 Loading…
feat: add percentiles configuration for request lifecycle metrics reporting cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
#295 opened Nov 29, 2025 by hhk7734 Loading…
Add end-to-end testing using llm-d-inference-sim cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files.
#294 opened Nov 26, 2025 by diamondburned Loading…
feat: Add Chat Completion API support to SharedPrefixDataGenerator cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. size/M Denotes a PR that changes 30-99 lines, ignoring generated files.
#287 opened Nov 19, 2025 by bongwoobak Loading…
Support setting custom y-axis limits optionally cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. size/S Denotes a PR that changes 10-29 lines, ignoring generated files.
#268 opened Nov 3, 2025 by Shuwen-Fang Loading…
feat: Improve client perf and error handling cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. size/M Denotes a PR that changes 30-99 lines, ignoring generated files.
#247 opened Oct 7, 2025 by LukeAVanDrie Loading…
refactor: Make base client concrete and usable cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/S Denotes a PR that changes 10-29 lines, ignoring generated files.
#246 opened Oct 7, 2025 by LukeAVanDrie Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.