Openpi Comet

Tip

OpenPi Comet is the submission of Team Comet for the 2025 BEHAVIOR Challenge. This repository provides a unified framework for pre-training, post-training, data generation and evaluation of π0.5 (Pi05) models on BEHAVIOR-1K.

Our [submission] achieved a Q-score of 0.2514, securing 2nd place overall and finishing just behind the winning team by a narrow margin—highlighting both the strong competitiveness of our approach and the effectiveness of our end-to-end VLA training strategy.

This codebase contains:

Distributed OpenPi training infrastructure
Various pre-training setup, including hierarchical instructions (global, subtask, skill) and multimodal observations (RGB, depth, point cloud, segmentation, bounding boxes, human pointing)
Post-training via Rejection Sampling Fine-Tuning (RFT) with automated dataset construction
Data generation scripts such as teleoperation and simulation rollouts using existing policy
Model zoo of pretrained VLA checkpoints trained on 1M+ robot interactions

Please check our [Report] for more details.

demo_low_res.mp4

Updates

[Dec 6, 2025] Released the full submission codebase and pre-trained weights.
[TODO] Upload our RFT dataset.

Requirements

To run the models in this repository, you will need an NVIDIA GPU with at least the following specifications. These estimations assume a single GPU, but you can also use multiple GPUs with model parallelism to reduce per-GPU memory requirements by configuring fsdp_devices in the training config. Please also note that the current training script does not yet support multi-node training.

Mode	Memory Required	Example GPU
Inference	> 8 GB	RTX 4090
Fine-Tuning (LoRA)	> 22.5 GB	RTX 4090
Fine-Tuning (Full)	> 70 GB	A100 (80GB) / H100

The repo has been tested with Ubuntu 22.04, we do not currently support other operating systems.

Repo Clone

git clone https://github.com/mli0603/openpi-comet.git
git clone https://github.com/StanfordVL/BEHAVIOR-1K.git

This finetuning instruction is adapted from the original openpi repo.

Installation

Openpi uses uv to manage Python dependencies. See the uv installation instructions to set it up. Once uv is installed, run the following to set up the environment:

cd openpi-comet
GIT_LFS_SKIP_SMUDGE=1 uv sync
GIT_LFS_SKIP_SMUDGE=1 uv pip install -e .

source .venv/bin/activate

# Install behavior for server deploy 
cd $PATH_TO_BEHAVIOR_1K
uv pip install -e bddl
uv pip install -e OmniGibson[eval]

Model Zoo 🤗

We provide a suite of base VLA model checkpoints trained on 1M+ robot trajectories, ideal for BEHAVIOR-1K fine-tuning.

Model Name	Discription	HuggingFace URL
pi05-b1kpt12-cs32	Pretrained Model in tasks `0,1,6,17,18,22,30,32,34,35,40,45` with action chunk size 32	openpi_comet/pi05-b1kpt12-cs32
pi05-b1kpt50-cs32	Pretrained Model in tasks `0-49` with action chunk size 32	openpi_comet/pi05-b1kpt50-cs32

Full Model Zoo (Per Task)

Model	Use Case	Task ID	Task Name	HF URL
pi05-turning_on_radio-sft	SFT	0	turning_on_radio	pi05-turning_on_radio-sft
pi05-pt12-pretrain-75k	Pretrain	1	picking_up_trash	pi05-pt12-pretrain-75k
pi05-putting_away_Halloween_decorations-sft	SFT	2	putting_away_Halloween_decorations	pi05-putting_away_Halloween_decorations-sft
pi05-pt50-pretrain-40k	Pretrain	3, 4, 7, 9, 13, 15, 18, 20, 21, 23, 24, 25, 26, 28, 29, 31, 36, 41, 44, 48, 49	cleaning_up_plates_and_food, can_meat, picking_up_toys, putting_up_Christmas_decorations_inside, loading_the_car, bringing_in_wood, tidying_bedroom, sorting_vegetables, collecting_childrens_toys, boxing_books_up_for_storage, storing_food, clearing_food_from_table_into_fridge, assembling_gift_baskets, getting_organized_for_work, clean_up_your_desk, clean_boxing_gloves, clean_a_patio, cook_cabbage, chopping_wood, canning_food, make_pizza	openpi_comet/pi05-b1kpt50-cs32
pi05-pt50-pretrain-20k	Pretrain	5	setting_mousetraps	pi05-pt50-pretrain-20k
pi05-hiding_Easter_eggs-rft	RFT	6	hiding_Easter_eggs	pi05-hiding_Easter_eggs-rft
pi05-pt50-pretrain-45k	Pretrain	8, 42	rearranging_kitchen_furniture, chop_an_onion	pi05-pt50-pretrain-45k
pi05-set_up_a_coffee_station_in_your_kitchen-rft	RFT	10	set_up_a_coffee_station_in_your_kitchen	pi05-set_up_a_coffee_station_in_your_kitchen-rft
pi05-putting_dishes_away_after_cleaning-rft	RFT	11	putting_dishes_away_after_cleaning	pi05-putting_dishes_away_after_cleaning-rft
pi05-preparing_lunch_box-rft	RFT	12	preparing_lunch_box	pi05-preparing_lunch_box-rft
pi05-pt50-pretrain-35k	Pretrain	14, 47	carrying_in_groceries, freeze_pies	pi05-pt50-pretrain-35k
pi05-pt10-pretrain-40k	Pretrain	16	moving_boxes_to_storage	pi05-pt10-pretrain-40k
pi05-pt10-pretrain-45k	Pretrain	17	bringing_water	pi05-pt10-pretrain-45k
pi05-outfit_a_basic_toolbox-rft	RFT	19	outfit_a_basic_toolbox	pi05-outfit_a_basic_toolbox-rft
pi05-pt12-pretrain-85k	Pretrain	22	putting_shoes_on_rack	pi05-pt12-pretrain-85k
pi05-pt50-pretrain-15k	Pretrain	27	sorting_household_items	pi05-pt50-pretrain-15k
pi05-pt12-pretrain-40k	Pretrain	30, 32	setting_the_fire, wash_a_baseball_cap	openpi_comet/pi05-b1kpt12-cs32
pi05-wash_dog_toys-sft	SFT	33	wash_dog_toys	pi05-wash_dog_toys-sft
pi05-hanging_pictures-rft	RFT	34	hanging_pictures	pi05-hanging_pictures-rft
pi05-pt12-pretrain-25k	Pretrain	35	attach_a_camera_to_a_tripod	pi05-pt12-pretrain-25k
pi05-clean_a_trumpet-sft	SFT	37	clean_a_trumpet	pi05-clean_a_trumpet-sft
pi05-spraying_for_bugs-rft	RFT	38	spraying_for_bugs	pi05-spraying_for_bugs-rft
pi05-pt50-pretrain-50k	Pretrain	39, 46	spraying_fruit_trees, cook_bacon	pi05-pt50-pretrain-50k
pi05-pt7-pretrain-50k	Pretrain	40	make_microwave_popcorn	pi05-pt7-pretrain-50k
pi05-slicing_vegetables-rft	RFT	43	slicing_vegetables	pi05-slicing_vegetables-rft
pi05-pt10-pretrain-40k-re	Pretrain	45	cook_hot_dogs	pi05-pt10-pretrain-40k-re

Finetune OpenPi

Each time we launch the training, we need to compute the normalization statistics for the training data in advance:

uv run scripts/compute_norm_stats.py --config-name pi05_b1k-turning_on_radio

This will create norm_stats.json under assets/pi0_b1k/behavior-1k/2025-challenge-demos, which will be used to normalize the training data.

Update the configs in src/openpi/training/config.py to be the task name you want (or None to include all tasks), for example, you can update the configs as follows for the turning_on_radio task:

TrainConfig(
    name="pi05_b1k-turning_on_radio",
    exp_name="openpi",
    project_name="B1K",
    model=pi0_config.Pi0Config(pi05=True, action_horizon=32),
    data=LeRobotB1KDataConfig(
        repo_id="behavior-1k/2025-challenge-demos",
        base_config=DataConfig(
            prompt_from_task=True,
            episodes_index=list(range(200)),
            behavior_dataset_root="../DATASETS/behavior/2025-challenge-demos",
            tasks=["turning_on_radio"],
            fine_grained_level=0,  # 0: global instruction, 1: subtask instruction, 2: skill instruction
        ),
    ),
    weight_loader=weight_loaders.CheckpointWeightLoader(
        "The Model Path you want to finetune from, e.g., gs://openpi-assets/checkpoints/pi05_base/params\
        or the checkpoint from our model zoo"
    ),
    num_train_steps=30_000,
    lr_schedule=_optimizer.CosineDecaySchedule(
        peak_lr=2.5e-5,
        decay_steps=30_000,
    ),
    freeze_filter=pi0_config.Pi0Config(pi05=True, action_horizon=32).get_freeze_filter(),
    ema_decay=None,
    checkpoint_base_dir=".",
    num_workers=8,
    batch_size=8 * 32,
),

Then run the following command to fintune OpenPi:

export XLA_PYTHON_CLIENT_MEM_FRACTION=0.9 
uv run scripts/train.py \
    pi05_b1k-turning_on_radio \
    --exp_name="openpi_$(date +%Y%m%d_%H%M%S)"

Pre-train OpenPi

To support distributed training, we update src/openpi/training/data_loader.py for data sharding, and the src/openpi/training/checkpoints_dist.py and scripts/train_dist.py for distributed checkpointing management and training. To launch the pretrain, run the following command:

# set dist training envs
export MASTER_ADDR=${SERVICE_PREFIX}-0.${SUBDOMAIN}
export WORLD_SIZE=${LEPTON_JOB_TOTAL_WORKERS}
export WORLD_RANK=${LEPTON_JOB_WORKER_INDEX}
export CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7
export MASTER_PORT=12350

config_name=pi05_b1k-pt50_cs32_bs64_lr2.5e-5_step50k_gpu40
exp_name=pi05_b1k-pt50_pretrain

python scripts/compute_norm_stats.py --config-name ${config_name}

python scripts/train_dist.py ${config_name} --exp_name=${exp_name} --overwrite

Post-train OpenPi using Rejection Sampling fine-tuning (RFT)

To perform RFT, you need to first deploy the finetuned checkpoint, and then rollout the episodes in the BEHAVIOR-1K Simulator. We also observe that the pose perturbator helps improve the robustness of the RFT Algorithm.

Copy the files in openpi_comet/src/behavior/learning to BEHAVIOR-1K/OmniGibson/omnigibson/learning. Be careful to the latest commit of the BEHAVIOR-1K repo and replace the files in the CheckList:

Name	Description
`configs/base_config.yaml`	hydra config with additional params, e.g., `env_wrapper`, `perturb_pose`, and `parallel_evaluator`.
`wrappers/rgb_wrapper.py`	Full Resolution RGB Wrapper, Helpful for evaluation
`wrappers/__init__.py`	Register the RGBWrapper
`pose_perturbator.py`	Pose Perturbator in RFT Rollout
`eval_custom.py`	Custom Evaluation Script

Run the RFT rollout in parallel:

python OmniGibson/omnigibson/learning/eval_custom.py policy=websocket \
    save_rollout=true \
    perturb_pose=true \
    task.name=$TASK_NAME \
    log_path=./outputs/rft \
    use_parallel_evaluator=false \
    parallel_evaluator_start_idx=0 \
    parallel_evaluator_end_idx=10 \
    model.port=8000 \
    env_wrapper._target_=omnigibson.learning.wrappers.RGBWrapper

where parallel_evaluator_start_idx and parallel_evaluator_end_idx are the start and end index of the parallel rollout, we can distribute the rollout to multiple GPUs by splitting the total number of instances into multiple parts.

Build the RFT dataset: After the rollout, you can build the RFT dataset by running the following command:

python data_generation/rollout/create_rft_dataset.py \
    --rollout_dir $PATH_TO_ROLLOUT_DATASET \
    --rft_dir $PATH_TO_RFT_DATASET

Then, we can perform RFT training on the RFT dataset. Please refer to the RFT training config for more details.

Evaluation

After finetuning, you can run evaluation by following the steps below:

Deploy finetuned checkpoint:

source .venv/bin/activate

uv run scripts/serve_b1k.py \
  --task_name=$TASK_NAME \
  policy:checkpoint \
  --control_mode=receeding_horizon \
  --max_len=32 \
  --policy.config=pi05_b1k-base \
  --policy.dir=$PATH_TO_CKPT

This opens a connection listening on 0.0.0.0:8000. Please check the scripts/serve_b1k.py for more details.

Run the evaluation on BEHAVIOR:

Assume you have behavior env installed (check https://github.com/StanfordVL/BEHAVIOR-1K for more details), run the following command within the BEHAVIOR-1K directory:
```
conda activate behavior 

python OmniGibson/omnigibson/learning/eval.py \
  policy=websocket \
  task.name=$TASK_NAME \
  log_path=$LOG_PATH
  # env_wrapper._target_=omnigibson.learning.wrappers.RGBWrapper
```
NOTE: We recommend to use the RGBWrapper for evaluation, please follow the instructions in Post-train OpenPi using Rejection Sampling fine-tuning (RFT) to add RGBWrapper or custom the evaluation script.

FAQs

If you encounter any issues, feel free to open an issue on GitHub or reach out through discussions. We appreciate your feedback and contributions!

Citation

If you find this work useful, please consider citing:

@article{bai2025openpicometcompetitionsolution,
  title={Openpi Comet: Competition Solution For 2025 BEHAVIOR Challenge}, 
  author={Junjie Bai and Yu-Wei Chao and Qizhi Chen and Jinwei Gu and Moo Jin Kim and Zhaoshuo Li and Xuan Li and Tsung-Yi Lin and Ming-Yu Liu and Nic Ma and Kaichun Mo and Delin Qu and Shangkun Sun and Hongchi Xia and Fangyin Wei and Xiaohui Zeng},
  journal={arXiv preprint arXiv:2512.10071},
  year={2025},
  url={https://arxiv.org/abs/2512.10071}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github		.github
data_generation		data_generation
docs		docs
packages/openpi-client		packages/openpi-client
scripts		scripts
src		src
.dockerignore		.dockerignore
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Openpi Comet

Updates

Requirements

Repo Clone

Installation

Model Zoo 🤗

Finetune OpenPi

Pre-train OpenPi

Post-train OpenPi using Rejection Sampling fine-tuning (RFT)

Evaluation

FAQs

Citation

About

Uh oh!

Releases

Packages

Contributors 4

Languages

License

mli0603/openpi-comet

Folders and files

Latest commit

History

Repository files navigation

Openpi Comet

Updates

Requirements

Repo Clone

Installation

Model Zoo 🤗

Finetune OpenPi

Pre-train OpenPi

Post-train OpenPi using Rejection Sampling fine-tuning (RFT)

Evaluation

FAQs

Citation

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages