fix(agent_loop): handle batch size smaller than num_workers by aoshen524 · Pull Request #5231 · verl-project/verl

aoshen524 · 2026-02-07T14:38:50Z

Summary

When batch_size < num_workers, prompts.chunk(len(self.agent_loop_workers)) produces fewer chunks than workers, causing zip(..., strict=True) to raise ValueError
Fix caps chunk count at min(len(prompts), len(self.agent_loop_workers)) and uses index-based worker dispatch instead of strict zip

Test plan

Verify with batch size equal to num_workers (no behavior change)
Verify with batch size larger than num_workers (no behavior change)
Verify with batch size smaller than num_workers (previously crashed, now works)

🤖 Generated with Claude Code

…te_sequences When the batch size is smaller than the number of agent loop workers, `prompts.chunk(len(self.agent_loop_workers))` produces fewer chunks than workers, causing `zip(..., strict=True)` to raise a ValueError. This fix caps the chunk count at `min(len(prompts), len(self.agent_loop_workers))` and uses index-based worker dispatch instead. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

gemini-code-assist

Code Review

This pull request fixes a crash that occurs when the batch size is smaller than the number of agent loop workers. The fix correctly calculates the number of workers needed and adjusts the chunking of prompts accordingly. However, the changes could lead to a crash if no workers are available or if the input prompts is empty. I've added a suggestion to handle this edge case gracefully.

gemini-code-assist · 2026-02-07T14:41:05Z

verl/experimental/agent_loop/agent_loop.py

+        num_workers_needed = min(len(prompts), len(self.agent_loop_workers))
+        chunkes = prompts.chunk(num_workers_needed)
        outputs = ray.get(
            [
-                worker.generate_sequences.remote(chunk)
-                for worker, chunk in zip(self.agent_loop_workers, chunkes, strict=True)
+                self.agent_loop_workers[i % len(self.agent_loop_workers)].generate_sequences.remote(chunk)
+                for i, chunk in enumerate(chunkes)
            ]
        )


The logic here can lead to a crash if num_workers_needed is 0. This can happen if prompts is empty or if self.agent_loop_workers is empty. In this case, prompts.chunk(0) will be called, which will raise an error because the number of chunks must be positive.

To prevent this, we should add a check to handle the case where num_workers_needed is 0 and return an empty DataProto.

Suggested change

num_workers_needed = min(len(prompts), len(self.agent_loop_workers))

chunkes = prompts.chunk(num_workers_needed)

outputs = ray.get(

[

worker.generate_sequences.remote(chunk)

for worker, chunk in zip(self.agent_loop_workers, chunkes, strict=True)

self.agent_loop_workers[i % len(self.agent_loop_workers)].generate_sequences.remote(chunk)

for i, chunk in enumerate(chunkes)

]

)

num_workers_needed = min(len(prompts), len(self.agent_loop_workers))

if num_workers_needed == 0:

return DataProto.concat([])

chunkes = prompts.chunk(num_workers_needed)

outputs = ray.get(

[

self.agent_loop_workers[i % len(self.agent_loop_workers)].generate_sequences.remote(chunk)

for i, chunk in enumerate(chunkes)

]

)

Both cases (empty workers / empty prompts) are pre-existing — the original code would also crash with chunk(0) or produce undefined behavior on empty input. These are initialization-time invariants guaranteed by the training loop, so adding a guard here would be over-engineering.

gemini-code-assist bot reviewed Feb 7, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(agent_loop): handle batch size smaller than num_workers#5231

fix(agent_loop): handle batch size smaller than num_workers#5231
aoshen524 wants to merge 1 commit intoverl-project:mainfrom
aoshen524:fix/agent-loop-batch-size-edge-case

aoshen524 commented Feb 7, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Feb 7, 2026

Uh oh!

aoshen524 Feb 7, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

aoshen524 commented Feb 7, 2026

Summary

Test plan

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Feb 7, 2026

Choose a reason for hiding this comment

Uh oh!

aoshen524 Feb 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

aoshen524 Feb 7, 2026 •

edited

Loading