"Disorganizing" Your SRE Organization

Tuesday, June 01, 2021 - 9:45 am10:30 am

Leonid Belkind, StackPulse

Abstract: 

More than a year ago, COVID-19 has presented us with a challenge on how to establish and grow our SRE practice under new conditions of working from home (while doubling and then tripling our team). Instead of trying to treat the situation in a "business as usual" manner, we embarked on a journey to reorganize our SRE practices, tools, and culture. In this talk, we will share the assumptions we started the process with, our learnings during the process, and the state we ended in, after more than a year of a change.

Leonid Belkind, CTO - StackPulse

Leonid Belkind is a Co-Founder and CTO at StackPulse, a Site Reliability Engineering orchestration platform. Prior to StackPulse, Leonid co-founded (and was CTO of) Luminate where he guided this enterprise-grade service from inception to widespread Fortune 500 adoption to acquisition by Symantec. Before Luminate, Leonid managed software development organizations at CheckPoint.

Through his career, Leonid has witnessed modern Software Engineering practices come and replace the traditional ones, first around Continuous Integration and Delivery pipelines, then Infrastructure Management and Monitoring, and onwards as software services have replaced on-premise products. Throughout this journey Leonid has become passionate about building reliability-first architectures, methodologies, and organizational culture.

BibTeX
@conference {272791,
author = {Leonid Belkind},
title = {"Disorganizing" Your {SRE} Organization},
year = {2021},
address = {Anaheim, CA},
publisher = {USENIX Association},
month = jun
}

Presentation Video