Skip to content

Commit

Permalink
Merge pull request #575 from carschandler/patch-1
Browse files Browse the repository at this point in the history
Confusing wording in self-play.mdx
  • Loading branch information
simoninithomas authored Dec 20, 2024
2 parents d411cea + 1c1cf48 commit dba30dd
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion units/en/unit7/self-play.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -37,7 +37,7 @@ The theory behind self-play is not something new. It was already used by Arthur

Self-Play is integrated into the MLAgents library and is managed by multiple hyperparameters that we’re going to study. But the main focus, as explained in the documentation, is the **tradeoff between the skill level and generality of the final policy and the stability of learning**.

Training against a set of slowly changing or unchanging adversaries with low diversity **results in more stable training. But a risk to overfit if the change is too slow.**
Training against a set of slowly changing or unchanging adversaries with low diversity **results in more stable training. But there is a risk of overfitting if the change is too slow.**

So we need to control:

Expand Down

0 comments on commit dba30dd

Please sign in to comment.