Graphite@Scale or How to Store Millions of Metrics per Second

Monday, May 22, 2017 - 2:05pm3:00pm

Vladimir Smirnov, System Adminstrator, Booking.com

Abstract: 

This is a story about dealing with metrics at scale. A lot of metrics.

This is our story of the challenges we’ve faced at Booking.com and how we made our Graphite system handle millions of metrics per second.

You will learn about one of the most high load Graphite-compatible stacks, the problems it poses and the challenges in maintaining it and scaling it further, pushing Graphite to its limits and beyond.

System Administrators and SREs who are interested in monitoring and scalability would find this useful.

Vladimir Smirnov, Booking.com

I've dealt with large scale systems design and administration in IT for over 6 years. For the last 1.5 years I've been working Booking.com, specializing in scaling our Graphite stack, improving its reliability and performance.

We at Booking.com have hundreds of backend servers, hundreds TB of data which we use to handles millions of metrics per second using our Graphite stack. The rate of growth is enormous and constantly growing.

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

BibTeX
@conference {202737,
author = {Vladimir Smirnov},
title = {{Graphite@Scale} or How to Store Millions of Metrics per Second},
year = {2017},
publisher = {USENIX Association},
month = may
}

Presentation Video 

Presentation Audio