THIX: THICK world models in JAX

A reimplementation of THICK, our algorithm for learning hierarchical world models with adaptive temporal abstractions, in JAX based on DreamerV3.

Installation

The code has been tested on Linux and Mac and requires Python 3.11+.

Install JAX and then the other dependencies:

pip install -U -r embodied/requirements.txt
pip install -U -r dreamerv3/requirements.txt \
  -f https://storage.googleapis.com/jax-releases/jax_cuda_releases.html

Running experiments

Example training script for running DreamerV3 with a THICK world model

python dreamerv3/main.py \
  --logdir ~/logdir/{timestamp} \
  --configs crafter thick \
  --loss_scales.sparse 1.0

The thick config is needed for a THICK world model. Otherwise a flat RSSM is used.

THICK uses one hyperparameter $\beta^{\mathrm{sparse}}$, or loss_scales.sparse, which scales the sparsity regularization term in the loss function. This controls the frequency of context changes and the granularity of temporal abstractions. Typically, this hyperparameter needs to be tuned for every environment or suite. We provide a short tutorial for determining $\beta^{\mathrm{sparse}}$ for new environments.

The following settings have been tested:

Atari:

python dreamerv3/main.py \
  --logdir ~/logdir/{timestamp} \
  --configs atari100k thick size25m\
  --task atari100k_qbert
  --loss_scales.sparse 10.0
  --run.steps 1e6

Pokémon Red: For running Pokémon Red, you need to copy your legally obtained Pokémon Red ROM into the embodied/envs/pokemon_emulator/ directory. You can find the ROM using Google. It should be 1MB of size. Rename it to PokemonRed.gb.

python dreamerv3/main.py \
  --logdir ~/logdir/{timestamp} \
  --configs pokemon thick size25m\
  --loss_scales.sparse 1.0
  --run.steps 1e6

See the DreamerV3 code base for more tips on running Dreamer and our paper for details on THICK.

Acknowledgements

The code was primarily developed by Tomáš Daniš, with support from Christian Gumbsch. This code was developed based on the DreamerV3 code base. From the SENSEI code base we took the modified version of PokeGym. We provide the Pokémon game states from AI plays Pokemon .

Citation

@inproceedings{gumbsch2024thick,
title={Learning Hierarchical World Models with Adaptive Temporal Abstractions from Discrete Latent Dynamics},
author={Christian Gumbsch and Noor Sajid and Georg Martius and Martin V. Butz},
booktitle={The Twelfth International Conference on Learning Representations},
year={2024},
url={https://openreview.net/forum?id=TjCDNssXKU}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
docs		docs
dreamerv3		dreamerv3
embodied		embodied
utils		utils
README.md		README.md
tune_sparsity.md		tune_sparsity.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

THIX: THICK world models in JAX

Installation

Running experiments

Acknowledgements

Citation

About

Uh oh!

Releases

Packages

Languages

CognitiveModeling/thix

Folders and files

Latest commit

History

Repository files navigation

THIX: THICK world models in JAX

Installation

Running experiments

Acknowledgements

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages