pufferl: log obs max/min/mean per eval step by eugenevinitsky · Pull Request #458 · Emerge-Lab/PufferDrive

eugenevinitsky · 2026-05-30T16:37:01Z

Summary

Adds three wandb scalars — `obs/max`, `obs/min`, `obs/mean` — computed each env step in `PuffeRL.evaluate()` from the same tensor that's about to enter the policy

Three new wandb keys (`obs/max`, `obs/min`, `obs/mean`) collected every env step in evaluate() and rolled up by mean_and_log. Each value is the scalar reduction across the full batch and obs-dim, so the wandb panel shows the absolute range of every feature the policy sees over the eval window. Useful for catching clipping, mis-normalization, or unbounded features post-training-tweak. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Copilot

Pull request overview

Adds three per-env-step observation statistics (obs/max, obs/min, obs/mean) to PuffeRL.evaluate() so they are surfaced in W&B (as environment/obs/{max,min,mean}) to help catch clipping/normalization regressions.

Changes:

Append o_device.max/min/mean().item() into self.stats on each evaluate step, before the policy forward pass.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot AI review requested due to automatic review settings May 30, 2026 16:37

Copilot started reviewing on behalf of eugenevinitsky May 30, 2026 16:37 View session

Copilot AI reviewed May 30, 2026

View reviewed changes

vcharraut approved these changes May 30, 2026

View reviewed changes

eugenevinitsky merged commit 943da10 into emerge/temp_training May 30, 2026
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pufferl: log obs max/min/mean per eval step#458

pufferl: log obs max/min/mean per eval step#458
eugenevinitsky merged 1 commit into
emerge/temp_trainingfrom
ev/log-obs-stats

eugenevinitsky commented May 30, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

eugenevinitsky commented May 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

eugenevinitsky commented May 30, 2026 •

edited

Loading