log all trajectory repetitions per training step (not just first) by arteemg · Pull Request #1388 · NovaSky-AI/SkyRL

arteemg · 2026-03-25T20:03:41Z

Problem

During training, log_example was called only on response_ids[0] and rewards[0], meaning only the first of n_samples_per_prompt trajectories was ever logged per training step. With n_samples_per_prompt=8, 7 out of 8 rollouts were invisible, making it impossible to debug agent behavior across repetitions.

Change

Loop over all trajectories in the batch and log each one, labeled with its instance_id and repetition_id from TrajectoryID (when available), or a fallback index.

Example

Run after the fix, with all rollouts for a single contest (4316) being correctly logged:

code_contests_4316_all_reps.log

## Problem During training, `log_example` was called only on `response_ids[0]` and `rewards[0]`, meaning only the first of `n_samples_per_prompt` trajectories was ever logged per training step. With `n_samples_per_prompt=8`, 7 out of 8 rollouts were invisible, making it impossible to debug agent behavior across repetitions. ## Change Loop over all trajectories in the batch and log each one, labeled with its `instance_id` and `repetition_id` from `TrajectoryID` (when available), or a fallback index.

arteemg · 2026-03-26T14:48:43Z

@SumanthRH could you take a look! thank you in advance!

This comment was marked as resolved.

Sign in to view

update for step-wise training mode

dd76564

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

log all trajectory repetitions per training step (not just first)#1388

log all trajectory repetitions per training step (not just first)#1388
arteemg wants to merge 2 commits intoNovaSky-AI:mainfrom
arteemg:main

arteemg commented Mar 25, 2026 •

edited

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

arteemg commented Mar 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

arteemg commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Change

Example

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

arteemg commented Mar 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

arteemg commented Mar 25, 2026 •

edited

Loading