Availability—Thinking beyond 9s

Wednesday, June 12, 2019 - 9:55 am10:15 am

Kumar Srinivasamurthy, Bing, Microsoft Corp


It's very easy and convenient to build metrics at the service level. These often hide a wide array of issues that users might face. Having the right metrics is a key component of building sustainable SRE culture.

In this talk, you will learn:

  • How do you measure Availability for your product, not just a service?
  • How to think beyond just 9's
  • What are the common pitfalls for a beginner engineer?
  • Mistakes in metric calculations
  • Some examples of issues faced by our product and lessons learnt

Kumar works at Microsoft and is currently a Group Engineering Manager for the Bing Team. For the last several years, he has focused on building reliable high scale systems, availability, performance, capacity engineering, online safety, data mining, metrics, and educating teams on how to build services that run at scale.

