This is the official repository for ExPO-HM (ICLR 2026).
Resources:
- Paper (OpenReview): https://openreview.net/forum?id=bEejbORUI5
- Reproducibility notes:
docs/reproducibility.md - Environment setup:
docs/environment_setup.md
- [2026] ExPO-HM accepted at ICLR 2026.
- [2026] Open-source release with GRPO and SFT training pipelines.
ExPO-HM provides two training lanes:
scripts/grpo/: GRPO/CDE scripts (data prep, training, merge, eval) on top ofverlsft/: supervised fine-tuning viaLLaMA-Factory
Use two separate conda environments:
verlfor GRPO (verl)expohm-sftfor SFT (LLaMA-Factory)
Setup guide:
docs/environment_setup.md
Run all commands from repo root.
bash scripts/grpo/data_prep/generate_hatefulmemes.sh
bash scripts/grpo/data_prep/generate_cde_all.shbash scripts/grpo/train/run_qwen2_5_vl-7b-baseline.sh
bash scripts/grpo/train/run_qwen2_5_vl-7b_cde_paper.shbash scripts/grpo/eval/fb_inference_grpo.sh
python3 eval/judge_reasoning/llm_judge_eval.py --helpscripts/grpo/: GRPO data prep, training, merge, and eval entrypointsscripts/grpo/train/: GRPO training entrypointsscripts/sft/: SFT entry scriptsdata/gt/: source metadatadata/image/: source imagesdata/verl/: generated parquet for GRPOeval/: inference and reasoning judge codedocs/: setup, layout, and reproducibility documents
For detailed layout and data conventions:
docs/repo_layout.mddocs/data_layout.mddocs/README.md
Reproducibility mapping is documented in:
docs/reproducibility.md
If this repository helps your research, please cite:
@inproceedings{
EXPOHM2026Mei,
title={Ex{PO}-{HM}: Learning to Explain-then-Detect for Hateful Meme Detection},
author={Jingbiao Mei and Mingsheng Sun and Jinghong Chen and Pengda Qin and Yuhong Li and Da Chen and Bill Byrne},
booktitle={The Fourteenth International Conference on Learning Representations},
year={2026},
url={https://openreview.net/forum?id=bEejbORUI5}
}