Quantifying Reidentification Risk for ML Models

Tuesday, June 10, 2025 - 2:30 pm2:50 pm

Nitin Agrawal and Laura Book, Snap Inc.

Machine learning models, in particular classification models, are used across a wide spectrum of products and applications. These models may be susceptible to attacks like model inversion and attribute inference attacks that could allow for reconstruction of the training data and re-indentification of the data subjects. However, not all models are attackable: well generalized ones specifically are less prone to memorize their training data, and privacy preserving techniques can be used to help ensure training is generalized rather than memorized. However, a key challenge at an industrial scale lies in identifying the attackability of a model as well as calibrating the need for privacy mitigations. Academic literature has established an order relationship between attacks, demonstrating that membership inference attacks are a precursor to the reconstruction and re-identification of training data. In this talk we'll discuss a mechanism to repurpose those attacks into a practical quantifiable metric for ML model attackability measurement. This could be critical in ensuring model privacy and ongoing monitoring of the model in the model deployment lifecycle.

Nitin Agrawal is currently a Privacy Engineer at Snap Inc., focussing on privacy validation, AI privacy, and data classification. Previously, he worked as an Applied Scientist for Alexa Privacy at Amazon. He holds a Ph.D. in Computer Science from the University of Oxford, where his research focused on advancing techniques for effective and equitable privacy-preserving machine learning.

Laura Book is a Privacy Engineer at Snap Inc., where she is currently focusing on validating privacy adherence across the product. Previously, she worked at Google as a software engineer with a focus on monetization, privacy and data governance. She holds a PhD in Physics from the California Institute of Technology.

BibTeX
@conference {306709,
author = {Nitin Agrawal and Laura Book},
title = {Quantifying Reidentification Risk for {ML} Models},
year = {2025},
address = {Santa Clara, CA},
publisher = {USENIX Association},
month = jun
}