Diffusion Policy

[Project page] [Paper] [Data] [Colab (state)] [Colab (vision)]

Cheng Chi¹, Siyuan Feng², Yilun Du³, Zhenjia Xu¹, Eric Cousineau², Benjamin Burchfiel², Shuran Song¹

¹Columbia University, ²Toyota Research Institute, ³MIT

🛝 Try it out!

Our self-contained Google Colab notebooks is the easiest way to play with Diffusion Policy. We provide separate notebooks for state-based environment and vision-based environment.

🛠️ Installation

🖥️ Simulation

To reproduce our simulation benchmark results, install our conda environment on a Linux machine with Nvidia GPU. On Ubuntu 20.04 you need to install the following apt packages for mujoco:

$ sudo apt install -y libosmesa6-dev libgl1-mesa-glx libglfw3 patchelf

We recommend Mambaforge instead of the standard anaconda distribution for faster installation:

$ mamba env create -f conda_environment.yaml

but you can use conda as well:

$ conda env create -f conda_environment.yaml

The conda_environment_macos.yaml file is only for development on MacOS and does not have full support for benchmarks.

🦾 Real Robot

Hardware (for Push-T):

1x RB10
1x D405, D435I
1x VR tracker (for teleop)

Software:

Ubuntu 20.04.3 (tested)

🖥️ Reproducing Simulation Benchmark Results

Download Training Data

Under the repo root, create data subdirectory:

[diffusion_policy]$ mkdir data && cd data

Download the corresponding zip file from https://diffusion-policy.cs.columbia.edu/data/training/

[data]$ wget https://diffusion-policy.cs.columbia.edu/data/training/pusht.zip

Extract training data:

[data]$ unzip pusht.zip && rm -f pusht.zip && cd ..

Grab config file for the corresponding experiment:

[diffusion_policy]$ wget -O image_pusht_diffusion_policy_cnn.yaml https://diffusion-policy.cs.columbia.edu/data/experiments/image/pusht/diffusion_policy_cnn/config.yaml

Running for a single seed

Activate conda environment and login to wandb (if you haven't already).

[diffusion_policy]$ conda activate robodiff
(robodiff)[diffusion_policy]$ wandb login

Launch training with seed 42 on GPU 0.

(robodiff)[diffusion_policy]$ python train.py --config-dir=. --config-name=image_pusht_diffusion_policy_cnn.yaml training.seed=42 training.device=cuda:0 hydra.run.dir='data/outputs/${now:%Y.%m.%d}/${now:%H.%M.%S}_${name}_${task_name}'

This will create a directory in format data/outputs/yyyy.mm.dd/hh.mm.ss_<method_name>_<task_name> where configs, logs and checkpoints are written to. The policy will be evaluated every 50 epochs with the success rate logged as test/mean_score on wandb, as well as videos for some rollouts.

(robodiff)[diffusion_policy]$ tree data/outputs/2023.03.01/20.02.03_train_diffusion_unet_hybrid_pusht_image -I wandb
data/outputs/2023.03.01/20.02.03_train_diffusion_unet_hybrid_pusht_image
├── checkpoints
│   ├── epoch=0000-test_mean_score=0.134.ckpt
│   └── latest.ckpt
├── .hydra
│   ├── config.yaml
│   ├── hydra.yaml
│   └── overrides.yaml
├── logs.json.txt
├── media
│   ├── 2k5u6wli.mp4
│   ├── 2kvovxms.mp4
│   ├── 2pxd9f6b.mp4
│   ├── 2q5gjt5f.mp4
│   ├── 2sawbf6m.mp4
│   └── 538ubl79.mp4
└── train.log

3 directories, 13 files

🦾 Demo, Training and Eval on a Real Robot

Make hdf5 dataset for RB10 robot. Press 's' to start saving data and 'q' to quit. Then, a prompt will appear asking whether to save this demo data: y/n. When you have collected the desired number of demos, press 't' to terminate.

(robodiff)[diffusion_policy]$ python bae_hdf_maker_abs.py

Data format

data
 - demo_0
   - obs
     - robot_eef_pos (3)
     - robot_eef_quat (4)
     - image0 (240, 320)
     - image1 (240, 320)
   - actions (9)
 
 - demo_1
   ...

Train a Diffusion Policy. You can train on various tasks and settings by specifying different configurations. And you can also adjust hyperparameters or dataset in config.

(robodiff)[diffusion_policy]$ python train.py --config-name=bae_train_diffusion_transformer_real_hybrid_workspace task=bae_push_image_abs

Assuming the training has finished and you have a checkpoint at data/outputs/blah/checkpoints/latest.ckpt, launch the evaluation script with:

python bae_eval_real_robot.py --input data/outputs/blah/checkpoints/latest.ckpt --output data/results

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.beads		.beads
diffusion_policy		diffusion_policy
media		media
son_utils		son_utils
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
AGENTS.md		AGENTS.md
LICENSE		LICENSE
OOD_CHANGE_LOG.md		OOD_CHANGE_LOG.md
README.md		README.md
conda_environment.yaml		conda_environment.yaml
conda_environment_macos.yaml		conda_environment_macos.yaml
conda_environment_real.yaml		conda_environment_real.yaml
dataset_gen.py		dataset_gen.py
demo_pusht.py		demo_pusht.py
demo_real_robot.py		demo_real_robot.py
eval.py		eval.py
eval_real_robot.py		eval_real_robot.py
image_pusht_diffusion_policy_cnn.yaml		image_pusht_diffusion_policy_cnn.yaml
pyrightconfig.json		pyrightconfig.json
rb10_angle_test.py		rb10_angle_test.py
rb10_eval_real_robot.py		rb10_eval_real_robot.py
rb10_eval_real_robot_ood.py		rb10_eval_real_robot_ood.py
rb10_eval_vr_collect copy.py		rb10_eval_vr_collect copy.py
rb10_eval_vr_collect.py		rb10_eval_vr_collect.py
rb10_infer_label_demo.py		rb10_infer_label_demo.py
rb10_robot_test.py		rb10_robot_test.py
requirement.txt		requirement.txt
setup.py		setup.py
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diffusion Policy

🛝 Try it out!

🛠️ Installation

🖥️ Simulation

🦾 Real Robot

🖥️ Reproducing Simulation Benchmark Results

Download Training Data

Running for a single seed

🦾 Demo, Training and Eval on a Real Robot

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Diffusion Policy

🛝 Try it out!

🛠️ Installation

🖥️ Simulation

🦾 Real Robot

🖥️ Reproducing Simulation Benchmark Results

Download Training Data

Running for a single seed

🦾 Demo, Training and Eval on a Real Robot

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages