ExaRL tasks by tcfuji · Pull Request #241 · exalearn/EXARL

tcfuji · 2022-11-02T18:16:21Z

Made update_target method agent specific and not exposed to workflow. It's now used inside the train call.
Made a common folder and separated out models and replay buffers. Agent vault only has agents.
Moved epsilon to agents.
Include PER beta.

This PR builds off of the develop_sept_22 branch.

2. Fixed support for -A option (previously not being respected) 3. Ensured Tensorflow is not loaded when using Pytorch 4. Fixed log/results directory to increment instead of defaulting to RUN001 5. Added convergence cutoffs to workflows 6. Fixed/simplified step counting in workflows 7. Added hyper-parameter tuning using sbatch and optuna

2. Added fix to sync with episode block leading to deadlock 3. Commented hyper-parameter tuning script 4. Fixed RUN000 to the next dir if logs exist 5. Adjusted output to match hyper-parameter script

…t in send_model method and model_count in sync_learning.py

2. Minor fix to bsuite_batch help message

…lop_sept_22

…C should not be in driver/__main__.py

2. Fixed print and end of run 3. Async convergence call had too many parameters

…lop_sept_22

…to dqn.py. Need to resolve sample weight issue (line 392).

…ad of a beta parameter. Also mentioned other troubling areas in dqn.py using TODOs.

2. Updating analyze reward to match the logs without epsilon 3. Fixing plotille to ignore data before complete roll.

Jodasue and others added 8 commits September 24, 2022 15:39

1. Flake8

a526e11

2. Added fix to sync with episode block leading to deadlock 3. Commented hyper-parameter tuning script 4. Fixed RUN000 to the next dir if logs exist 5. Adjusted output to match hyper-parameter script

Fixed refactoring update_target. Found some superfluous variables (ds…

ddcf6eb

…t in send_model method and model_count in sync_learning.py

1. Updating docs for workflows

224a672

2. Minor fix to bsuite_batch help message

Merge branch 'develop_sept_22' of github.com:exalearn/EXARL into deve…

1bd1194

…lop_sept_22

Finished 'Clean directory' task. Also found that exarl.utils.renderDM…

5de2edd

…C should not be in driver/__main__.py

1. Added ability to load external agents and environments

53e4aa6

2. Fixed print and end of run 3. Async convergence call had too many parameters

Merge branch 'develop_sept_22' of github.com:exalearn/EXARL into deve…

dda9cf7

…lop_sept_22

tcfuji requested a review from Jodasue November 2, 2022 18:19

Jodasue self-assigned this Nov 8, 2022

Jodasue requested a review from Himscipy November 8, 2022 18:46

tcfuji and others added 5 commits November 10, 2022 15:17

finished removing epsilon from sync_learner.py and incorporated it in…

3ba9870

…to dqn.py. Need to resolve sample weight issue (line 392).

Merge branch 'develop' into develop_tf

eaf3735

Fixed the part of PER where (1-epsilon) is used as the exponent inste…

d222485

…ad of a beta parameter. Also mentioned other troubling areas in dqn.py using TODOs.

1. Adding some update for epsilon and priority scale fixes

281c2e5

2. Updating analyze reward to match the logs without epsilon 3. Fixing plotille to ignore data before complete roll.

Fixed DQN epsilon.

56c6b01

Jodasue approved these changes Dec 8, 2022

View reviewed changes

This has a lot of changes but is ultimately broken.

87f077e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ExaRL tasks#241

ExaRL tasks#241
tcfuji wants to merge 14 commits intodevelopfrom
develop_tf

tcfuji commented Nov 2, 2022 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tcfuji commented Nov 2, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tcfuji commented Nov 2, 2022 •

edited

Loading