You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Updates SB3 ppo cfg so it trains under reasonable amount of time (#3726)
# Description
This PR fixes the sb3_ppo_cfg for task Isaac-Ant-v0
the parameter before had 4096 num_envs + horizon 512 + batch size 128 +
n_epoch 20,
that means the training one cycle it needs to for loop (20 * 512 * 4096)
/ 128 = 327680 times!
which appears as if it is hanging forever
the new config matches more closely with that of rl_games.
I verified it will trains under 5 min
[Screencast from 2025-10-15
13-56-21.webm](https://github.com/user-attachments/assets/2bc7bcd8-0063-46b9-adb0-67a6aa686732)
## Type of change
<!-- As you go through the list, delete the ones that are not
applicable. -->
- Bug fix (non-breaking change which fixes an issue)
## Screenshots
Please attach before and after screenshots of the change if applicable.
<!--
Example:
| Before | After |
| ------ | ----- |
| _gif/png before_ | _gif/png after_ |
To upload images to a PR -- simply drag and drop an image while in edit
mode and it should upload the image directly. You can then paste that
source into the above before/after sections.
-->
## Checklist
- [x] I have read and understood the [contribution
guidelines](https://isaac-sim.github.io/IsaacLab/main/source/refs/contributing.html)
- [x] I have run the [`pre-commit` checks](https://pre-commit.com/) with
`./isaaclab.sh --format`
- [ ] I have made corresponding changes to the documentation
- [x] My changes generate no new warnings
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have updated the changelog and the corresponding version in the
extension's `config/extension.toml` file
- [x] I have added my name to the `CONTRIBUTORS.md` or my name already
exists there
<!--
As you go through the checklist above, you can mark something as done by
putting an x character in it
For example,
- [x] I have done this task
- [ ] I have not done this task
-->
0 commit comments