-
Notifications
You must be signed in to change notification settings - Fork 2.5k
Open
Labels
bugSomething isn't workingSomething isn't working
Description
Describe the Bug
When evaluating a recurrent policy in the RSL-RL framework, its hidden state is not reset upon episode termination.
This leads to an unexpected behavior due to residual hidden states from the previous episode, causing the policy to act inconsistently at the start of new episodes.
Steps to Reproduce
- Train a recurrent policy using the RSL-RL framework.
- Evaluate the trained policy using
play.py. - Let an episode terminate and a new one start.
- Observe that the recurrent memory retains information from the old episode, affecting initial and subsequent actions of the next episode.
Expected Behavior:
Each new episode should start with a fully reset recurrent state.
Actual Behavior:
Recurrent state persists between episodes during evaluation.
System Info
| Component | Version / Info |
|---|---|
| Commit | 1103a0f |
| Isaac Sim Version | 4.5.0 |
| OS | Ubuntu 22.04 |
| GPU | RTX 3060 |
| CUDA | 12.9 |
| GPU Driver | 575.64.03 |
Checklist
- I have checked that there is no similar issue in the repo (required)
- I have confirmed that the issue is not related to Isaac Sim itself but to this repository
Acceptance Criteria
- Hidden state of recurrent policy is reset when an episode terminates during evaluation.
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working