SREBot—More Than a Chatbot—An Intelligent Bot to Crush Mitigation Time

Wednesday, November 01, 2017 - 2:00 pm2:30 pm

Cezar Alevatto Guimaraes, Microsoft

Abstract: 

SREBot is a knowledgeable and intelligent engine that replaces tribal knowledge and automates incident management activities. It is also extensible, allowing other teams to add their own knowledge. In this talk you will hear how SREBot is being developed and used to reduce the Time to Mitigate (TTM) Microsoft incidents. We will explain how it was designed and then share the main issues we are facing.

Cezar Alevatto Guimaraes, Microsoft

Cezar Guimaraes is a Site Reliability Engineer Lead on the Microsoft Azure team. He has more than 15 years of experience and has worked at Microsoft for 11 years as a Software Engineer. Currently, he is working on Azure to identify and resolve problems that stand in the way of service uptime through engineering solutions such as bots and intelligence/correlation engines.

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

BibTeX
@conference {207191,
author = {Cezar Alevatto Guimaraes},
title = {{SREBot{\textemdash}More} Than a {Chatbot{\textemdash}An} Intelligent Bot to Crush Mitigation Time},
year = {2017},
address = {San Francisco, CA},
publisher = {USENIX Association},
month = oct
}

Presentation Video 

Presentation Audio