You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello, the author's paper mentions that the range of one-dimensional action is from 0 to 1, where 0 to 0.5 represents a left turn and 0.5 to 1 represents a right turn. Therefore, I think it is okay to set the lower bound of the parameter to 0.
Hello,
I find your work is really helpful and I really appreciate it, however I found a bug at critic update stage which affect the final performance.
It is in TD3HUG.py at L80-L81
I think the first parameter in clamp should be the lower limit rather than 0, and I think noise1 should use the NOISE_CLIP hyperparameter.
I change these two lines into
There might be also bugs about clamp elsewhere, but I didn't check.
Would be very appreciated if you look into this.
The text was updated successfully, but these errors were encountered: