Adversarial Attacks on Robotic Vision-Language-Action Models

This repository contains the experiments for the paper "Adversarial Attacks on Robotic Vision-Language-Action Models."

Installation

Requirements

Python 3.8+
CUDA-compatible GPU (for efficient optimization)
Dependencies listed in robo_env.yml

Setup

Clone this repository:

git clone https://github.com/eliotjones1/robogcg.git
cd robogcg

Create and activate the conda environment:

conda env create -f robo_env.yml
conda activate robo_env

Install the package:
```
pip install -e .
```

Install cotracker for TraceVLA:

git clone https://github.com/facebookresearch/co-tracker
cd co-tracker
pip install -e .
pip install matplotlib flow_vis tqdm tensorboard

Running Experiments

Single-Step GCG Experiments

To run the main gradient-based adversarial attacks:

python -m experiments.single_step.run_experiment \
    --config experiments/single_step/configs/libero_10/libero_10_0.json \
    --num-gpus 1

Persistence Experiments

To evaluate how well adversarial prompts persist across multiple frames:

./scripts/run_persistence_experiment.sh

Transfer Experiments

To run these experiments, you will need to clone two additional repositories:

For CogACT, simply do the following:

git clone https://github.com/microsoft/CogACT
cd CogACT
pip install -e .

For OpenPi0, you will need to clone the repo into the models directory:

git clone https://github.com/allenzren/open-pi-zero
mv open-pi-zero experiments/models/OpenPi0
cd experiments/models/OpenPi0
pip install -e .

To test transferability of attacks across different models:

./scripts/run_transfer_experiment.sh

TraceVLA-Specific Experiments

To run experiments specifically targeting the TraceVLA model architecture:

./scripts/run_trace_experiment.sh

Defense Mechanisms

Perplexity-Based Defense

./scripts/run_perplexity_defense.sh --perplexity_mode vla
# Or use LLM-only perplexity
./scripts/run_perplexity_defense.sh --perplexity_mode llm
# Run all variants
./scripts/run_perplexity_defense.sh --run_all_variants

Perturbation-Based Defense

./scripts/run_perturbations_defense.sh

System Prompt Defense

./scripts/run_sysprompt_defense.sh

Project Structure

robogcg/
├── experiments/
│   ├── defenses/           # Defense mechanism implementations
│   ├── models/             # Model wrapper implementations
│   └── single_step/        # Single-step experiment code
├── images/                 # Test images for experiments
│   ├── libero_10/          # LIBERO task images
│   ├── libero_goal/        # Goal-oriented task images
│   ├── libero_object/      # Object manipulation task images
│   ├── libero_spatial/     # Spatial reasoning task images
│   └── seed/               # Seed images for experiments
├── roboGCG/                # Core implementation of the RoboGCG framework
├── README.md               # This file
├── robo_env.yml            # Conda environment specification
├── pyproject.toml          # Project metadata
└── setup.py                # Package installation configuration

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adversarial Attacks on Robotic Vision-Language-Action Models

Installation

Requirements

Setup

Running Experiments

Single-Step GCG Experiments

Persistence Experiments

Transfer Experiments

TraceVLA-Specific Experiments

Defense Mechanisms

Perplexity-Based Defense

Perturbation-Based Defense

System Prompt Defense

Project Structure

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
experiments		experiments
images		images
roboGCG		roboGCG
scripts		scripts
README.md		README.md
pyproject.toml		pyproject.toml
robo_env.yml		robo_env.yml
setup.py		setup.py

eliotjones1/robogcg

Folders and files

Latest commit

History

Repository files navigation

Adversarial Attacks on Robotic Vision-Language-Action Models

Installation

Requirements

Setup

Running Experiments

Single-Step GCG Experiments

Persistence Experiments

Transfer Experiments

TraceVLA-Specific Experiments

Defense Mechanisms

Perplexity-Based Defense

Perturbation-Based Defense

System Prompt Defense

Project Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages