Skip to content

Local tripletriad newbind dev#211

Merged
jsuarez5341 merged 9 commits into
PufferAI:devfrom
xinpw8:local_tripletriad_newbind_dev
May 19, 2025
Merged

Local tripletriad newbind dev#211
jsuarez5341 merged 9 commits into
PufferAI:devfrom
xinpw8:local_tripletriad_newbind_dev

Conversation

@xinpw8
Copy link
Copy Markdown
Contributor

@xinpw8 xinpw8 commented May 3, 2025

Pushed as-is. Removed \ from env_binding.h. No changes to any other env besides tripletriad. Conversion to new bindings only. Some issues with tripletriad still exist: e.g. env reaches terminal or errantly resets after only a few turns. Can debug in future.

xinpw8 and others added 9 commits April 27, 2025 04:47
…so training may not match pufferbox. indeed, baseline did seem pretty shaky. baseline: https://wandb.ai/xinpw8/pufferlib/runs/m3frlbjf  refactor: https://wandb.ai/xinpw8/pufferlib/runs/58ro6521 also, some offsets weren't adjusted when NOOP was removed (commit e39332b); i updated those as well. interestingly, it doesn't seem to really have improved training, however.
…ad.h was off-by-one; c perf test added + local eval updated
@jsuarez5341 jsuarez5341 merged commit 13a245b into PufferAI:dev May 19, 2025
6 of 12 checks passed
@jsuarez5341
Copy link
Copy Markdown
Contributor

Merged - thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants