Building an End-to-End De-Identification Pipeline for Advertising Activity Data at LinkedIn

Monday, June 09, 2025 - 3:45 pm–4:05 pm

Saikrishna Badrinarayanan and Chris Harris, LinkedIn

Advertising platforms rely heavily on activity data to measure and optimize ads performance. With current privacy regulations and platform requirements, LinkedIn is held to increasingly rigorous standards in the handling of our members' personal data. This is acute for our ads business, as we adhere to strict regulations that necessitate stringent measures when handling user data, including data minimization, which is almost expanding into a global requirement. These regulations continue to evolve, requiring constant adaptation to new standards, while our data pipelines were originally established in a time when the use of personal data was less regulated.

Motivated by the principle of building privacy by design, we undertook a comprehensive project involving numerous stakeholders to address these challenges, and built an end-to-end robust pipeline that de-identifies advertising activity data. The goal of this project was to ensure that user information is protected while still enabling processing on this de-identified data to generate valuable analytics and enable advertisers to learn the effectiveness of their ad spend. We have onboarded products such as performance reporting and billing as the hero use-cases onto this pipeline. This talk will cover the design, implementation and innovative aspects of this pipeline. We will discuss the various privacy enhancing technologies we applied, our system architecture, challenges faced such as scalability (to process billions of events a day) and balancing privacy with the needs of the business. Finally, we will also highlight the outcomes and practical insights gained from this project.

Saikrishna Badrinarayanan is a Staff Privacy Engineer at LinkedIn. He has spent the last two years building privacy-preserving systems for problems in ads measurement and responsible AI. Before LinkedIn, he worked on privacy/security teams at Snap and Visa. He is a cryptographer by training and holds a PhD from UCLA. He is passionate about using privacy enhancing technologies to derive value from data while protecting user privacy.

Chris Harris is a Senior Staff Engineer at LinkedIn, where they have spent the past nine years working on ads measurement, privacy, and data governance. Passionate about hands-on coding and system performance optimization, they focus on building scalable, privacy-conscious solutions that balance business needs with user trust. Beyond engineering, they are committed to fostering a culture of doing the right thing—ensuring that privacy and integrity remain at the forefront of innovation.

BibTeX
@conference {306677,
author = {Saikrishna Badrinarayanan and Chris Harris},
title = {Building an {End-to-End} {De-Identification} Pipeline for Advertising Activity Data at {LinkedIn}},
year = {2025},
address = {Santa Clara, CA},
publisher = {USENIX Association},
month = jun
}

Presentation Video