PixelCopter Bot

Reinforcement learning agent that learns to play PixelCopter using Q-learning with neural network function approximations.

Developed this project to learn the basics of RL, Q-learning with NNs, and using Keras (Theano backend). As for running simulations, I used the Python Learning Environment, which has a number of pre-built games in Python (like Flappy Bird, Pong, PixelCopter, etc). This way, I didn't have to build the game but instead focus more on designing the agent, which is awesome!

Resources

I read a variety of articles and Medium blog posts to get familiar with RL. There are also a lot of Github implementations of other bots, like Flappy Bird, which I also used as a reference (but I couldn't find any for PixelCopter...)

Here are some links:

Q-learning with Neural Networks - this is the one I used the most, it's amazing. I recommend understanding parts 1 and 2 also.
An introduction to Deep Q-Learning: let’s play Doom
Playing Atari with Deep Reinforcement Learning - DeepMind Paper
David Silver's RL Course - I watched a few of these
Python Learning Environment - used to build the game

Notes on the model

The bot uses a simple 2-hidden layer NN built with Keras with Theano backend (5 input nodes, 20 nodes in first hidden layer, 10 nodes in second hidden layer, and 2 output nodes).
5 input units are given through PLE's getGameState method, which returns features like y position, velocity, distance to ceil, etc.
Initially, PixelCopter has random map generation per new episode and green blocks that the white pixel has to dodge. I removed both of these because training was taking too long to converge with the added complexity.
Training took around 8 hours for ~10,000 episodes on an old Dell Precision. It might have taken less but I just left it overnight.

How to run

Make sure to have Keras installed and running with Theano backend. Also have pygame and PLE installed. Then run the notebook on Jupyter. The game will run by using the most recently updated weights for the NN. To train the model from scratch again, follow uncomment/comment directions in the notebook.

Next steps

Expand PixelCopter map to run longer
Replicate RL agent but use a CNN to use pixel-by-pixel game frames as input instead of relying on PLE's methods to provide inputs

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.idea		.idea
.ipynb_checkpoints		.ipynb_checkpoints
ple		ple
README.md		README.md
heli.gif		heli.gif
heli.ipynb		heli.ipynb
heliVid.mov		heliVid.mov
w_2018-08-16_10_48_56.830225.h5py		w_2018-08-16_10_48_56.830225.h5py
w_2018-08-16_11_04_35.646263.h5py		w_2018-08-16_11_04_35.646263.h5py
w_2018-08-16_11_44_21.189374.h5py		w_2018-08-16_11_44_21.189374.h5py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PixelCopter Bot

Resources

Notes on the model

How to run

Next steps

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PixelCopter Bot

Resources

Notes on the model

How to run

Next steps

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages