Skip to content

Commit

Permalink
Fix line length in Reinforcement Learning
Browse files Browse the repository at this point in the history
  • Loading branch information
moschmdt committed Aug 2, 2024
1 parent 912b562 commit 31d2660
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions docs/soar_manual/05_ReinforcementLearning.md
Original file line number Diff line number Diff line change
Expand Up @@ -282,8 +282,8 @@ An example walkthrough of a Sarsa update with $\alpha = 0.3$ and $\gamma = 0.9$
a_t) \right] = 0.3 \times [ 1.0 + 0.9 \times 0.5 - 1.3 ] = 0.045
$$
Since `rl-1` and `rl-2` both contributed to the $Q$-value of `O1`, $\delta_t$ is
evenly divided amongst them, resulting in updated values of
Since `rl-1` and `rl-2` both contributed to the $Q$-value of `O1`,
$\delta_t$ is evenly divided amongst them, resulting in updated values of
```Soar
rl-1: (<s> ^operator <o> = 2.3225)
Expand Down

0 comments on commit 31d2660

Please sign in to comment.