Reinforcement Learning Basics

python 3.7

gym 0.18.0

tensorflow 1.15.0

torch 1.7.1

Model-free (tabular setting)

1. Find Value Function

Monte Carlo
Temporal Difference (TD)

2. Find Optimal Policy

Monte Carlo Control
SARSA
Q Learning

Deep RL (non-tabular setting)

1. Value-Based

DQN

2. Policy-Based (policy gradient)

REINFORCE
PPO

3. Actor-Critic (value-based + policy-based)

TD Actor-Critic

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.idea		.idea
ActorCritic		ActorCritic
DQN		DQN
ModelFree_FindOptimalPolicy		ModelFree_FindOptimalPolicy
ModelFree_FindValueFunction		ModelFree_FindValueFunction
PolicyGradient		PolicyGradient
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning Basics

python 3.7

gym 0.18.0

tensorflow 1.15.0

torch 1.7.1

Model-free (tabular setting)

1. Find Value Function

2. Find Optimal Policy

Deep RL (non-tabular setting)

1. Value-Based

2. Policy-Based (policy gradient)

3. Actor-Critic (value-based + policy-based)

About

Uh oh!

Releases

Packages

Languages

sumin123/RLstudy

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning Basics

python 3.7

gym 0.18.0

tensorflow 1.15.0

torch 1.7.1

Model-free (tabular setting)

1. Find Value Function

2. Find Optimal Policy

Deep RL (non-tabular setting)

1. Value-Based

2. Policy-Based (policy gradient)

3. Actor-Critic (value-based + policy-based)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages