Reinforcement Learning

Welcome to this repository dedicated to the exploration of Reinforcement Learning (RL). We'll journey through the fundamentals of RL, starting from the basic exploration/exploitation dilemma, all the way to employing deep neural networks to master complex games.

Overview

Reinforcement Learning is a type of machine learning where an agent learns by interacting with an environment, receiving feedback in the form of rewards or penalties. The main challenge is to find the best strategy, or policy, that will result in the maximum cumulative reward for the agent over time.

PART 1: Exploration vs. Exploitation

The journey begins with the classical problem of Multi-Armed Bandits. This problem elegantly captures the trade-off between exploring new options and exploiting known ones. In this section:

Understand the Exploration/Exploitation dilemma.
Implement a naive solution to the Multi-Armed Bandits problem.
Dive into advanced algorithms such as Upper Confidence Bound (UCB) and Thompson Sampling to optimize our choices.

PART 2: Dynamic Programming

Dynamic Programming offers powerful techniques to solve complex problems by breaking them down into simpler sub-problems. In the context of RL:

Learn about optimizing the value of each state with the Bellman equation.
Understand the idea of knowing the reward of each state and how it affects decision-making.
Get hands-on with the gym library, a toolkit for developing and comparing reinforcement learning algorithms.

PART 3: Q-Learning

Q-Learning provides a way to estimate the value of actions, not just states. Using the Taxi Game as a hands-on example:

Delve into the concept of Q-values and how they help agents make decisions.
Learn the process to optimize these Q-values for better performance.

PART 4: Deep Q-Learning

Taking Q-learning to the next level, we introduce neural networks to deal with high-dimensional input spaces, like images from video games.

Explore the combination of Q-learning with Deep Learning, resulting in Deep Q-Learning.
Implement Deep Q-Learning to master classic Atari games, showcasing the power of modern RL techniques.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
tp1-multi-armed-bandits		tp1-multi-armed-bandits
tp2-dynamic-programming		tp2-dynamic-programming
tp3-q-learning-sarsa		tp3-q-learning-sarsa
tp4-deep-q-learning		tp4-deep-q-learning
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning

Overview

Table of Contents

PART 1: Exploration vs. Exploitation

PART 2: Dynamic Programming

PART 3: Q-Learning

PART 4: Deep Q-Learning

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Ekliipce/Reinforcement-Learning

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning

Overview

Table of Contents

PART 1: Exploration vs. Exploitation

PART 2: Dynamic Programming

PART 3: Q-Learning

PART 4: Deep Q-Learning

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages