GitHub - hustvl/PixelHacker: PixelHacker: Image Inpainting with Structural and Semantic Consistency

PixelHacker: Image Inpainting with Structural and Semantic Consistency

SOTA performance on Places2, CelebA-HQ, and FFHQ & Superior structural and semantic consistency

Ziyang Xu¹, Kangsheng Duan¹, Xiaolei Shen², Zhifeng Ding², Wenyu Liu¹, Xiaohu Ruan²,
Xiaoxin Chen², Xinggang Wang^{1 📧}

(^📧) Corresponding Author.

¹ Huazhong University of Science and Technology. ² VIVO AI Lab.

🌟Highlights

Latent Categories Guidance (LCG): Simple yet effective inpainting paradigm with superior structural and semantic consistency. Let's advance inpainting research to challenge more complex scenarios!
PixelHacker: Diffusion-based inpainting model trained with LCG, outperforming SOTA performance across multiple natural-scene (Places2) and human-face (CelebA-HQ, and FFHQ) benchmarks!
Comprehensive SOTA Performance：
- Places2 (Natural Scene)
  - Evaluated at 512 resolution using 10k test set images with 40-50% masked regions, PixelHacker achieved the best performance with FID 8.59 and LPIPS 0.2026.
  - Evaluated at 512 resolution using 36.5k validation set images with large and small mask settings, PixelHacker achieved the best performance on FID (large: 2.05, small: 0.82) and U-IDS (large:36.07, small:42.21), and the second best performance on LPIPS (large:0.169, small:0.088).
  - Evaluated at 256 and 512 resolutions using validation set images with a highly randomised masking strategy, PixelHacker achieved the best performance at 512 resolution with FID 5.75 and LPIPS 0.305, and the second best performance at 256 resolution with FID 9.25 and LPIPS 0.367.
- CelebA-HQ (Human-Face Scene)
  - Evaluated at 512 resolution, PixelHacker achieved the best performance with FID 4.75 and LPIPS 0.115.
- FFHQ (Human-Face Scene)
  - Evaluated at 256 resolution, PixelHacker achieved the best performance with FID 6.35 and LPIPS 0.229.

🔥Updates

May 1, 2025: 🔥 We have released the project page with 63+ demos on natural and human-face scenes. Have fun! 🤗
April 30, 2025: 🔥 We have released the arXiv paper for PixelHacker. The code and project page will be released soon.

🏕️Performance on Natural Scene

🤗Performance on Human-Face Scene

🎓Citation

@misc{xu2025pixelhacker,
      title={PixelHacker: Image Inpainting with Structural and Semantic Consistency}, 
      author={Ziyang Xu and Kangsheng Duan and Xiaolei Shen and Zhifeng Ding and Wenyu Liu and Xiaohu Ruan and Xiaoxin Chen and Xinggang Wang},
      year={2025},
      eprint={2504.20438},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2504.20438}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
assets		assets
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PixelHacker: Image Inpainting with Structural and Semantic Consistency

🌟Highlights

🔥Updates

🏕️Performance on Natural Scene

🤗Performance on Human-Face Scene

🎓Citation

About

Releases

Packages

Contributors 2

License

hustvl/PixelHacker

Folders and files

Latest commit

History

Repository files navigation

PixelHacker: Image Inpainting with Structural and Semantic Consistency

🌟Highlights

🔥Updates

🏕️Performance on Natural Scene

🤗Performance on Human-Face Scene

🎓Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages