GitHub - vermashresth/diagnose-damage-RL: One Shot Damage diagnosis through LSTM based network.

One Shot Damage Diagnosis

This is the code for a portion of my B.Tech Thesis Project Deep Reinforcement Learning for Stability and Safe Adaptation in Damaged Robots. It can diagnoze damage in any locomatory OpenAI gym agent using only one rollout of 20 timesteps.

Requirements

Keras
Tensorflow
OpenAI gym and Mujoco (See installation instructions here)
Joblib

How to use

First collect samples of damaged robot data using

python sampler.py experts/Ant-v1.pkl Ant-v1 --num_rollouts 2000

(Note that this step is paraellized over multiple threads. I have written this code for 4 threads, it can easily be scaled up for clusters having large number of available threads.)

Then load the pickled data and train the LSTM network

python rnn_train.py data_pickles/Ant-v1_4joints20diff102type1.dict -s saved_models/myclean.h5 -e 50

Generate some test data again using sampler and run testing network

python rnn_test.py -m saved_models/my_modelant4jointsday32_eff_div_2type.h5 -d data_pickles/Ant-v1_4joints20diff1002type1.dict

This work is still in progress. Feel free to contact me if you are interested in this kind of architecture or want to discuss any ideas.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
error_logs		error_logs
experts		experts
misc		misc
samplers		samplers
train_old		train_old
.gitignore		.gitignore
README.md		README.md
load_policy.py		load_policy.py
load_policy.pyc		load_policy.pyc
plot_model.py		plot_model.py
rnn_test.py		rnn_test.py
rnn_train.py		rnn_train.py
sampler.py		sampler.py
tf_util.py		tf_util.py
tf_util.pyc		tf_util.pyc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

One Shot Damage Diagnosis

Requirements

How to use

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

One Shot Damage Diagnosis

Requirements

How to use

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages