diffusion-policies

Diffusion Policy and other generative model policies

Getting Started

At the root level, first create the environment:

conda env create -f environment.yaml

and activate it:

conda activate diffpolicy

Run setup:

python setup.py && pip install -e .

For training data, you can download training data from the original Diffusion Policy repository. Below we use PushT:

cd data && wget https://diffusion-policy.cs.columbia.edu/data/training/pusht.zip && unzip pusht.zip && rm -f pusht.zip && cd ..

Navigate to diffusion_policy/config, select the config for architecture and modality you want to train for, and change the config as necessary (i.e., task). Make sure to wandb login if you have not already. Then, navigate back up to the network architecture folder that you chose and run the training script, for example

python train_unet_image_policy.py

Credit

The implementations here are not necessarily original. Please reference the original repositories (linked below) literature (cited at the bottom)!

Diffusion Policy: see the README.md under diffusion_policy/ for more details on the practical implementation in this repository.

Shape Suffixes and Dimension Key

This repository annotates tensors with shape suffixes, which simply indicate what you would expect to receive after calling .shape on the tensor. To read about why, see this blog post from Noam Shazeer. In this repository, we use the following dimension key:

"""
Dimension key:

B: batch size
T: prediction horizon
    To: observation horizon
    Ta: action horizon
F: feature dimension
    Fo: observation feature dimension
    Fa: action feature dimension
D: embedding dimension
C: (generic) conditioning dimension, sometimes channel dimension
G: global conditioning dimension
L: local conditioning dimension
I: (conv, generic) input channel dimension
O: (conv, generic) output channel dimension
H: (image) height
W: (image) width

Tensors are denoted with brackets [ ], i.e., [ B, T, L ] and suffixed,
i.e., x_BOT. If we are just using the variable as a dimension (int),
no brackets are present, and the name will additionally be suffixed
with _dim_, i.e., inp_dim_F.

To read tensors, i.e., cond_BTL, apply the last dimension as a prefix
when applicable (L, G). Our example would read "local conditioning
tensor of shape (B, T, L)". Otherwise, just read the tensor directly,
adding the shape as a suffix.
"""

Citations

@inproceedings{chi2023diffusionpolicy,
	title={Diffusion Policy: Visuomotor Policy Learning via Action Diffusion},
	author={Chi, Cheng and Feng, Siyuan and Du, Yilun and Xu, Zhenjia and Cousineau, Eric and Burchfiel, Benjamin and Song, Shuran},
	booktitle={Proceedings of Robotics: Science and Systems (RSS)},
	year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
data		data
diffusion_policy		diffusion_policy
shared		shared
shortcut_policy		shortcut_policy
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

diffusion-policies

Getting Started

Credit

Shape Suffixes and Dimension Key

Citations

About

Releases

Packages

Languages

License

UCLA-Robot-Intelligence-Lab/diffusion-policies

Folders and files

Latest commit

History

Repository files navigation

diffusion-policies

Getting Started

Credit

Shape Suffixes and Dimension Key

Citations

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages