ExPO-HM: Learning to Explain-then-Detect for Hateful Meme Detection

This is the official repository for ExPO-HM (ICLR 2026).

Resources:

Paper (OpenReview): https://openreview.net/forum?id=bEejbORUI5
Reproducibility notes: docs/reproducibility.md
Environment setup: docs/environment_setup.md

Updates

[2026] ExPO-HM accepted at ICLR 2026.
[2026] Open-source release with GRPO and SFT training pipelines.

Overview

ExPO-HM provides two training lanes:

scripts/grpo/: GRPO/CDE scripts (data prep, training, merge, eval) on top of verl
sft/: supervised fine-tuning via LLaMA-Factory

Environment Setup

Use two separate conda environments:

verl for GRPO (verl)
expohm-sft for SFT (LLaMA-Factory)

Setup guide:

docs/environment_setup.md

Quick Start

Run all commands from repo root.

1) Prepare GRPO Data

bash scripts/grpo/data_prep/generate_hatefulmemes.sh
bash scripts/grpo/data_prep/generate_cde_all.sh

2) Run GRPO Training

bash scripts/grpo/train/run_qwen2_5_vl-7b-baseline.sh
bash scripts/grpo/train/run_qwen2_5_vl-7b_cde_paper.sh

3) Evaluate

bash scripts/grpo/eval/fb_inference_grpo.sh
python3 eval/judge_reasoning/llm_judge_eval.py --help

Project Structure

scripts/grpo/: GRPO data prep, training, merge, and eval entrypoints
scripts/grpo/train/: GRPO training entrypoints
scripts/sft/: SFT entry scripts
data/gt/: source metadata
data/image/: source images
data/verl/: generated parquet for GRPO
eval/: inference and reasoning judge code
docs/: setup, layout, and reproducibility documents

For detailed layout and data conventions:

docs/repo_layout.md
docs/data_layout.md
docs/README.md

Reproducibility

Reproducibility mapping is documented in:

docs/reproducibility.md

Citation

If this repository helps your research, please cite:

@inproceedings{
EXPOHM2026Mei,
title={Ex{PO}-{HM}: Learning to Explain-then-Detect for Hateful Meme Detection},
author={Jingbiao Mei and Mingsheng Sun and Jinghong Chen and Pengda Qin and Yuhong Li and Da Chen and Bill Byrne},
booktitle={The Fourteenth International Conference on Learning Representations},
year={2026},
url={https://openreview.net/forum?id=bEejbORUI5}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ExPO-HM: Learning to Explain-then-Detect for Hateful Meme Detection

Updates

Table of Contents

Overview

Environment Setup

Quick Start

1) Prepare GRPO Data

2) Run GRPO Training

3) Evaluate

Project Structure

Reproducibility

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data/gt		data/gt
docs		docs
eval		eval
paper		paper
scripts		scripts
sft		sft
verl		verl
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

ExPO-HM: Learning to Explain-then-Detect for Hateful Meme Detection

Updates

Table of Contents

Overview

Environment Setup

Quick Start

1) Prepare GRPO Data

2) Run GRPO Training

3) Evaluate

Project Structure

Reproducibility

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages