feat: add BLS decoupled response iterator's cancel() method for the request cancellation #398

richardhuo-nv · 2025-03-24T05:42:35Z

What does the PR do?

Adding a cancel() method to the BLS decoupled response iterator, so that it will be able to cancel the Triton server inference request corresponding to the response iterator if the stub process gets the enough response from the response iterator.

Due to each stub InferenceRequest object can create multiple BLS Triton Server inference requests, so adding cancel() to the response iterator would be more feasible to manage cancelling individual request rather than cancelling all requests generated with the stub InferenceRequest object.

More details can be found in the change of the README.md

Checklist

Commit Type:

Check the conventional commit type
box here and add the label to the github PR.

Related PRs:

triton-inference-server/server#8097

Where should the reviewer start?

The major changes are on the python backend process, essentially the request executor. The major changes

Test plan:

CI Pipeline ID:
25879687

Caveats:

Background

This feature is useful for stopping long inference requests, such as those from auto-generative large language models, which may run for an indeterminate amount of time and consume significant server resources.

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

https://jirasw.nvidia.com/browse/DLIS-7831

adjust the comments title add finished check rename function adjust comment comment comment fix fix fix readme fix comments fix leak

kthui

Nice work! Left some minor comments on docs and a few questions.

README.md

src/infer_response.cc

src/ipc_message.h

src/request_executor.h

src/pb_stub.h

src/infer_payload.h

Tabrizian

LGTM. Thanks!

responses cancellation

3ac7505

adjust the comments title add finished check rename function adjust comment comment comment fix fix fix readme fix comments fix leak

richardhuo-nv requested review from kthui, rmccorm4, krishung5, yinggeh and ziqif-nv March 24, 2025 05:42

fix commit hook

a6a1a00

richardhuo-nv mentioned this pull request Mar 24, 2025

test: Add tests cancelling BLS decoupled request in Python backend triton-inference-server/server#8097

Merged

20 tasks

richardhuo-nv force-pushed the rihuo/add_bls_decoupled_cancel branch from 4aed0b9 to a6a1a00 Compare March 24, 2025 19:51

kthui reviewed Mar 25, 2025

View reviewed changes

richardhuo-nv added 3 commits March 25, 2025 10:38

resolve comments

02dcf1d

fix README

6f47428

remove include

37f3243

kthui reviewed Mar 26, 2025

View reviewed changes

src/infer_payload.h Outdated Show resolved Hide resolved

resolve comments

5942052

richardhuo-nv requested review from Tabrizian and kthui March 31, 2025 16:55

fix comment

9482ee2

kthui approved these changes Mar 31, 2025

View reviewed changes

Tabrizian approved these changes Apr 1, 2025

View reviewed changes

richardhuo-nv merged commit 7f21b67 into main Apr 8, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add BLS decoupled response iterator's cancel() method for the request cancellation #398

feat: add BLS decoupled response iterator's cancel() method for the request cancellation #398

Uh oh!

richardhuo-nv commented Mar 24, 2025 •

edited

Loading

Uh oh!

kthui left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Tabrizian left a comment

Uh oh!

Uh oh!

Uh oh!

feat: add BLS decoupled response iterator's cancel() method for the request cancellation #398

feat: add BLS decoupled response iterator's cancel() method for the request cancellation #398

Uh oh!

Conversation

richardhuo-nv commented Mar 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does the PR do?

Checklist

Commit Type:

Related PRs:

Where should the reviewer start?

Test plan:

Caveats:

Background

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

Uh oh!

kthui left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Tabrizian left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

richardhuo-nv commented Mar 24, 2025 •

edited

Loading