From Thundering Herd to Zero Outages: Building Reliable Inventory Sync

Tuesday, March 24, 2026 - 2:40 pm3:25 pm

Rushikesh Ghatpande, Broadcom Inc

Managing accurate inventory across distributed infrastructure is critical for security policy enforcement and operational reliability. Enterprise datacenter software requires centralized policy management across thousands of servers and hundreds of thousands of VMs and containers, yet resources are distributed across multiple data centers.

This talk shares a battle-tested inventory synchronization protocol that evolved over 6 years of production experience, handling real-world challenges from thundering herd problems during full datacenter restarts to fairness in queue processing. The protocol uses a 5-stage finite state machine to ensure reliable, consistent inventory sync while preventing system overload.

You'll learn how we evolved from a naive 3-step process to a robust 5-stage protocol, how we solved the thundering herd problem, ensured fairness in queue processing, and separated connection establishment from application readiness. We'll share empirical analysis that led to specific timeout values and demonstrate how bidirectional communication patterns eliminated message ordering complexity.

This protocol has been validated at scale across 10,000+ servers with zero customer escalations over 4 years. The patterns are immediately applicable to any distributed state synchronization challenge - whether managing VMs, containers, network devices, or any distributed resources.

Rushikesh Ghatpande is a Principal Engineer at Broadcom, where he leads critical architectural initiatives in network virtualization and security.

With over a decade of experience in distributed systems and software engineering, he has risen from a new graduate to a principal engineer through his work on VMware's flagship NSX product.

His expertise spans software architecture, cross-team engineering leadership, and building highly available, planet-scale systems. Beyond his engineering contributions, Rushikesh has published research papers and filed multiple patents in the networking and virtualization space, reflecting his passion for innovation in cloud infrastructure and distributed systems.

BibTeX
@conference {316258,
author = {Rushikesh Shashank Ghatpande},
title = {From Thundering Herd to Zero Outages: Building Reliable Inventory Sync},
year = {2026},
address = {Seattle, WA},
publisher = {USENIX Association},
month = mar
}

Presentation Video