Availability—Thinking beyond 9s

Wednesday, June 12, 2019 - 9:55 am10:15 am

Kumar Srinivasamurthy, Bing, Microsoft Corp

Abstract: 

It's very easy and convenient to build metrics at the service level. These often hide a wide array of issues that users might face. Having the right metrics is a key component of building sustainable SRE culture.

In this talk, you will learn:

  • How do you measure Availability for your product, not just a service?
  • How to think beyond just 9's
  • What are the common pitfalls for a beginner engineer?
  • Mistakes in metric calculations
  • Some examples of issues faced by our product and lessons learnt

Kumar Srinivasamurthy, Bing, Microsoft Corp

Kumar works at Microsoft and is currently a Group Engineering Manager for the Bing Team. For the last several years, he has focused on building reliable high scale systems, availability, performance, capacity engineering, online safety, data mining, metrics, and educating teams on how to build services that run at scale.

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

BibTeX
@conference {233311,
author = {Kumar Srinivasamurthy},
title = {Availability{\textemdash}Thinking beyond 9s},
year = {2019},
address = {Singapore},
publisher = {{USENIX} Association},
}