{CertPHash}: Towards Certified Perceptual Hashing via Robust Training

Yuchen Yang; Qichang Liu; Christopher Brix; Huan Zhang; Yinzhi Cao

Yuchen Yang and Qichang Liu, The Johns Hopkins University; Christopher Brix, RWTH Aachen University; Huan Zhang, University of Illinois at Urbana–Champaign; Yinzhi Cao, The Johns Hopkins University

Perceptual hashing (PHash) systems—e.g., Apple's NeuralHash, Microsoft's PhotoDNA, and Facebook's PDQ—are widely employed to screen illicit content. Such systems generate hashes of image files and match them against a database of known hashes linked to illicit content for filtering. One important drawback of PHash systems is that they are vulnerable to adversarial perturbation attacks leading to hash evasion or collision. It is desirable to bring provable guarantees to PHash systems to certify their robustness under evasion or collision attacks. However, to the best of our knowledge, there are no existing certified PHash systems, and more importantly, the training of certified PHash systems is challenging because of the unique definition of model utility and the existence of both evasion and collision attacks.

In this paper, we propose CertPHash, the first certified PHash system with robust training. CertPHash includes three different optimization terms, anti-evasion, anti-collision, and functionality. The anti-evasion term establishes an upper bound on the hash deviation caused by input perturbations, the anti-collision term sets a lower bound on the distance between a perturbed hash and those from other inputs, and the functionality term ensures that the system remains reliable and effective throughout robust training. Our results demonstrate that CertPHash not only achieves non-vacuous certification for both evasion and collision with provable guarantees but is also robust against empirical attacks. Furthermore, CertPHash demonstrates strong performance in real-world illicit content detection tasks.

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

BibTeX

@inproceedings {308050,
author = {Yuchen Yang and Qichang Liu and Christopher Brix and Huan Zhang and Yinzhi Cao},
title = {{CertPHash}: Towards Certified Perceptual Hashing via Robust Training},
booktitle = {34th USENIX Security Symposium (USENIX Security 25)},
year = {2025},
isbn = {978-1-939133-52-6},
address = {Seattle, WA},
pages = {7839--7856},
url = {https://www.usenix.org/conference/usenixsecurity25/presentation/yang-yuchen},
publisher = {USENIX Association},
month = aug
}

Download

Yang PDF

Yang Appendix PDF

Yang (Prepublication) PDF

CertPHash: Towards Certified Perceptual Hashing via Robust Training

Open Access Media