[rollout] feat: Fix partial load problem, Add vlm support for trtllm rollout by SchumiDing · Pull Request #5149 · verl-project/verl

SchumiDing · 2026-01-31T11:27:06Z

What does this PR do?

Some models do not support partial loading, when this happen the rollout manager back to original full parameter update
Add vlm support for trtllm rollout

Add concise overview of what this PR aims to achieve or accomplish. Reference related GitHub issues and PRs that help with the review.

Checklist Before Starting

Search for similar PRs. Paste at least one query link here: ...
Format the PR title as [{modules}] {type}: {description} (This will be checked by the CI)
- {modules} include fsdp, megatron, veomni, sglang, vllm, rollout, trainer, ci, training_utils, recipe, hardware, deployment, ray, worker, single_controller, misc, perf, model, algo, env, tool, ckpt, doc, data, cfg, reward
- If this PR involves multiple modules, separate them with , like [megatron, fsdp, doc]
- {type} is in feat, fix, refactor, chore, test
- If this PR breaks any API (CLI arguments, config, function signature, etc.), add [BREAKING] to the beginning of the title.
- Example: [BREAKING][fsdp, megatron] feat: dynamic batching

Test

Succeed with llama-3-11b-vision model with trtllm rollout

For changes that can not be tested by CI (e.g., algorithm implementation, new model support), validate by experiment(s) and show results like training curve plots, evaluation results, etc.

API and Usage Example

Demonstrate how the API changes if any, and provide usage example(s) if possible.

# Add code snippet or script demonstrating how to use this

Design & Code Changes

Demonstrate the high-level design if this PR is complex, and list the specific changes.

Checklist Before Submitting

Important

Please check all the following items before requesting a review, otherwise the reviewer might deprioritize this PR for review.

Read the Contribute Guide.
Apply pre-commit checks: pre-commit install && pre-commit run --all-files --show-diff-on-failure --color=always
Add / Update the documentation.
Add unit or end-to-end test(s) to the CI workflow to cover all the code. If not feasible, explain why: ...
Once your PR is ready for CI, send a message in the ci-request channel in the verl Slack workspace. (If not accessible, please try the Feishu group (飞书群).)
If your PR is related to the recipe submodule, please also update the reference to the submodule commit via git submodule update --remote or cd recipe && git pull origin main.

SchumiDing · 2026-01-31T11:55:23Z

#5042 roadmap of trtllm rollout

Superjomn · 2026-02-02T06:43:42Z

verl/workers/rollout/trtllm_rollout/trtllm_worker_extension.py

+from tensorrt_llm.logger import logger
+
+
+class WorkerExtension:


Here is the latest WorkerExtension in the TensorRT-LLM repo. Are there any motivations for implementing a new one in verl repo? I am thinking about how to unify both. Ideally, we may update the one in the TensorRT-LLM codebase, but if we need a minor change on it before the next trtllm version bump up, @hchings do you have a suggestion?

Yeah. Ideally, we should still use the worker extension from the tedsnort-llm repo. But to support model that do not allow partial loading, I suppose the use of self.engine.model_engine.model_loader.reload should be able to use with param: allow_partial_loading=False

I add this new worker extension is to support allow_partial_loading=False, cause tensorrt-llm always set this param as True, but some models do not support

I'll prefer that we keep this at TensorRT-LLM repo instead and make it generic for other RL FWs to reuse in the future.

Superjomn · 2026-02-02T06:50:26Z

verl/workers/rollout/trtllm_rollout/trtllm_async_server.py

+        if self.is_vlm_model:
+            from tensorrt_llm.inputs.multimodal import MultimodalServerConfig
+
+            multimodal_config = MultimodalServerConfig(


Can you add a unittest for this new feature? There is a test_trtllm_async_server.py.

sure, I'm adding one

The test script and relating test workflow has been added

I didn't find the test_trtllm_async_server.py in verl repo, so I write a test script for test on both llm rollout and vlm rollout of tensorrt-llm rollout worker

I didn't find the test_trtllm_async_server.py in verl repo

We have a unittest MR that should be merge shortly, that contains the test_trtllm_async_server.py.

SchumiDing · 2026-02-02T08:41:28Z

Sorry, there may some error when using with latest tensorrt-llm, I'm fixing it

hchings · 2026-02-03T00:08:24Z

verl/workers/rollout/trtllm_rollout/trtllm_worker_extension.py

+from tensorrt_llm.logger import logger
+
+
+class WorkerExtension:


I'll prefer that we keep this at TensorRT-LLM repo instead and make it generic for other RL FWs to reuse in the future.

SchumiDing · 2026-02-03T01:02:45Z

yeah, it's sensible to keep the verl/workers/rollout/trtllm_rollout/trtllm_worker_extension.py in TensorRT-LLM repo
Shall I request a PR to TensorRT-LLM repo first?

Fix partial load problem, Add vlm support for trtllm rollout

dcaacfe

SchumiDing added 2 commits January 31, 2026 20:03

Precommit check

0394ab5

Add check for if the model is vlm in trtllmhttpserver

0664ab1

wuxibin89 requested review from Superjomn and hchings February 2, 2026 04:57

Superjomn reviewed Feb 2, 2026

View reviewed changes

SchumiDing added 6 commits February 2, 2026 17:12

Support latest trtllm

bf71c9b

Support for qwen2.5 vl

f6e58b8

Add trtllm rollout test script

7af6917

Add test_trtllm_rollout workflow to test trtllm_rollout

94c4eb0

Add back mistakenly deleted file

25518fe

Precommit check

fd007fb

hchings suggested changes Feb 3, 2026

View reviewed changes

SchumiDing requested a review from hchings February 4, 2026 03:16

SchumiDing added 3 commits February 4, 2026 12:02

Merge branch 'verl-project:main' into vlm_trtllm_support

659ec01

Merge branch 'verl-project:main' into vlm_trtllm_support

55b55dc

Merge branch 'verl-project:main' into vlm_trtllm_support

e2cc50b

		from tensorrt_llm.logger import logger


		class WorkerExtension:

Conversation

SchumiDing commented Jan 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Checklist Before Starting

Test

API and Usage Example

Design & Code Changes

Checklist Before Submitting

Uh oh!

SchumiDing commented Jan 31, 2026

Uh oh!

Superjomn Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SchumiDing Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

SchumiDing Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hchings Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

Superjomn Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

SchumiDing Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

SchumiDing Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

SchumiDing Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

hchings Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SchumiDing commented Feb 2, 2026

Uh oh!

hchings Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

SchumiDing commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

SchumiDing commented Jan 31, 2026 •

edited

Loading

Superjomn Feb 2, 2026 •

edited

Loading

SchumiDing Feb 2, 2026 •

edited

Loading

hchings Feb 3, 2026 •

edited

Loading

SchumiDing commented Feb 3, 2026 •

edited

Loading