[WIP] Added BCQ #378

sampreet-arthi · 2020-10-09T18:57:42Z

Stuff implemented:

Added BCQ under genrl/agents/offline
BCQ inherits from OffPolicyAgentAC. Architecture was very similar to TD3. Major differences were that the actor took in both state and action as input and the VAE obviously.
OfflineTrainer inherits from OffPolicyTrainer. Only difference is that it loads the buffer.
Refactored buffers and rollouts to inherit from BaseBuffer and remove redundant functions and converted all code to torch. No numpy is used in any of the buffer files now.

Stuff to do:

Haven't tested properly yet if it works. Currently created a toy replay buffer from DDPG on Pendulum-v0 with only 100 experiences
Will have to find a simple way to make the actor take in both state and action.

…save_buffers

sampreet-arthi · 2020-10-09T19:02:27Z

Buffers have been tested but not after the addition of BCQ so tests are failing rn

lgtm-com · 2020-10-09T19:22:16Z

This pull request introduces 3 alerts when merging 6c271ef into b8a45ab - view on LGTM.com

new alerts:

1 for Unused local variable
1 for Unused import
1 for Wrong number of arguments in a call

codecov · 2020-10-10T18:26:26Z

Codecov Report

Merging #378 into master will decrease coverage by 2.76%.
The diff coverage is 58.76%.

@@            Coverage Diff             @@
##           master     #378      +/-   ##
==========================================
- Coverage   91.28%   88.51%   -2.77%     
==========================================
  Files          90       93       +3     
  Lines        3809     3944     +135     
==========================================
+ Hits         3477     3491      +14     
- Misses        332      453     +121

Impacted Files	Coverage Δ
genrl/agents/deep/base/base.py	`93.75% <ø> (ø)`
genrl/agents/deep/base/onpolicy.py	`96.15% <ø> (ø)`
genrl/trainers/onpolicy.py	`92.00% <ø> (ø)`
genrl/agents/offline/bcq/bcq.py	`23.86% <23.86%> (ø)`
genrl/trainers/offline.py	`27.77% <27.77%> (ø)`
genrl/core/models.py	`33.33% <33.33%> (ø)`
genrl/trainers/base.py	`81.30% <47.05%> (-6.87%)`	⬇️
genrl/core/buffers.py	`92.94% <91.80%> (-2.30%)`	⬇️
genrl/core/rollouts.py	`96.77% <96.77%> (ø)`
genrl/agents/__init__.py	`100.00% <100.00%> (ø)`
... and 13 more

lgtm-com · 2020-10-10T18:42:15Z

This pull request introduces 4 alerts when merging b28c1e6 into a2c8c7e - view on LGTM.com

new alerts:

3 for Unused import
1 for Signature mismatch in overriding method

lgtm-com · 2020-10-16T19:22:08Z

This pull request introduces 4 alerts when merging 3db4733 into 25eb018 - view on LGTM.com

new alerts:

3 for Unused import
1 for Signature mismatch in overriding method

lgtm-com · 2020-10-16T20:07:07Z

This pull request introduces 4 alerts when merging 43a483e into 25eb018 - view on LGTM.com

new alerts:

3 for Unused import
1 for Signature mismatch in overriding method

sampreet-arthi added 13 commits August 22, 2020 20:02

minor error fixes

5062268

Merge remote-tracking branch 'upstream/master'

5f22a16

Merge remote-tracking branch 'upstream/master'

0392e53

Merge remote-tracking branch 'upstream/master'

0c09a53

Merge remote-tracking branch 'upstream/master'

985ea1f

Merge remote-tracking branch 'upstream/master'

7b03555

initial commit, all work except DQN

bfb41c7

Merge remote-tracking branch 'upstream/master' into save_buffers

e3fd7cb

Merge remote-tracking branch 'upstream/master'

848c186

fixed actions shape error

424eb0b

Rollouts now support BaseBuffer

94ad00d

Introduced BCQ

e46653e

Merge branch 'master', remote-tracking branch 'upstream/master' into …

6c271ef

…save_buffers

sampreet-arthi added 2 commits October 10, 2020 13:18

functional BCQ now

b28c1e6

Merge remote-tracking branch 'upstream/master'

03e7f69

sampreet-arthi added 4 commits October 12, 2020 13:07

Merge remote-tracking branch 'upstream/master'

f4e782d

Merge branch 'master' into save_buffers

a0b7ff4

Made VAE more general and cleaned code

07a8153

added checkpointing for buffers

3db4733

fix errors

43a483e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Added BCQ #378

[WIP] Added BCQ #378

sampreet-arthi commented Oct 9, 2020 •

edited

Loading

sampreet-arthi commented Oct 9, 2020

lgtm-com bot commented Oct 9, 2020

codecov bot commented Oct 10, 2020 •

edited

Loading

lgtm-com bot commented Oct 10, 2020

lgtm-com bot commented Oct 16, 2020

lgtm-com bot commented Oct 16, 2020

[WIP] Added BCQ #378

Are you sure you want to change the base?

[WIP] Added BCQ #378

Conversation

sampreet-arthi commented Oct 9, 2020 • edited Loading

sampreet-arthi commented Oct 9, 2020

lgtm-com bot commented Oct 9, 2020

codecov bot commented Oct 10, 2020 • edited Loading

Codecov Report

lgtm-com bot commented Oct 10, 2020

lgtm-com bot commented Oct 16, 2020

lgtm-com bot commented Oct 16, 2020

sampreet-arthi commented Oct 9, 2020 •

edited

Loading

codecov bot commented Oct 10, 2020 •

edited

Loading