Build vs. Buy in the Midst of Armageddon

Monday, March 18, 2024 - 1:50 pm2:35 pm

Reggie Davis


What happens when you've got a team of SREs who get downsized and smacked with an increasing workload? This is the story of my small team's wild ride of revamping our in-house incident tool.

Spoiler: It turned into a "make or buy" debate real quick!

Picture this: a team of 10 becomes a merry band of 3, drowning in a sea of issues, requests, and feedback. Unexpectedly, our team changed like a game of musical chairs, skills shuffling all over. In our pursuit to be the heroes, we hit the brakes and pitched buying a ready-made tool for a change. It wasn't just a business decision; it was a cultural rollercoaster for everyone involved! So, let's take a stroll down memory lane, exploring how we switched gears to evaluate third-party tools to pave the ground for improving the reliability of our platform.

Reggie Davis

Reggie's a laid-back, jovial, and curious Senior SRE and Technical lead for the Platform Core SRE team at Elastic. Focusing on service management, incident management, and operational excellence in cloud-native environments, Reggie enjoys working with leaders across the platform to develop processes that help service teams "shift left" on their reliability efforts. Outside of work, Reggie's a avid Yogi, classic hip-hop vinyl collector, and a frequenter to coffee shops in whatever city he finds himself in.

