Azure SREBot: More than a Chatbot—an Intelligent Bot to Crush Mitigation Time

Tuesday, May 23, 2017 - 4:20pm4:45pm

Cezar Guimaraes, Microsoft


Azure SREBot is more than just a Chat Bot. Azure SREBot is a knowledgeable and intelligent engine that replaces tribal knowledge and automates incident response activities. It is also extensible, allowing other teams to add their own knowledge.

In this talk you will hear how SREBot is being developed and used to reduce the Time to Mitigate (TTM) Azure incidents. We will explain how it was designed and the share the main issues we are facing.

Cezar Guimaraes, Microsoft

Cezar Guimaraes is a Site Reliability Engineer Lead on the Microsoft Azure team. He has more than 15 year of experience and has worked at Microsoft for 11 years as a Software Engineer. Currently he is working on Azure to identify and resolve problems that stand in the way of service uptime through engineering solutions such as bots and intelligence/correlation engines.

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

@conference {202775,
author = {Cezar Guimaraes},
title = {Azure {SREBot}: More than a {Chatbot{\textemdash}an} Intelligent Bot to Crush Mitigation Time},
year = {2017},
publisher = {USENIX Association},
month = may

Presentation Video 

Presentation Audio