Skip to content

Releases: mthrok/luchador

Final release before split

20 May 14:50
Compare
Choose a tag to compare

Releasing before splitting NN module and RL module.

Stack ALE state

22 Mar 00:46
Compare
Choose a tag to compare
Move stacking from agent to env (#294)

* Move stacking from Agent to Env

* Add docstring

* Simplify preprocessor

Fix DQN loss function 2

16 Feb 07:01
Compare
Choose a tag to compare
v0.10.1

Get rid of Cost from DQN (#203)

Fix DQN loss function

15 Feb 23:23
Compare
Choose a tag to compare
Move clipping from cost to agent (#194)

* Move clipping from cost to agent

In the original form

- The form of the loss function was wrong

- It prevents us from computing correct error value

* Add gradient clip test

* Loose cost test

* Fix int test

Add Double DQN

10 Feb 16:26
Compare
Choose a tag to compare
v0.8.0

Add double dqn (#153)

v0.7.0: Update format (#160)

04 Feb 08:55
Compare
Choose a tag to compare

v0.6.0: Unify DQN Implementation (#151)

02 Feb 05:12
Compare
Choose a tag to compare
  • Cleaned up DQN implementation and equations
  • Add bunch of element-wise operations to Tensor

v0.5.0

26 Jan 04:35
Compare
Choose a tag to compare
Fix test log (#132)

Add CartPole

31 Dec 06:52
Compare
Choose a tag to compare

Add CartPole environment

v0.3.0

27 Oct 22:23
Compare
Choose a tag to compare
  • Fix Theano Initializers
  • Make compatibility between Theano and Tensorflow (almost) 100 %
  • Change DQN summary timing unit from episode to update
  • Refactor NN components.
  • Add layers to run simple MNIST classifier.