Unconference: Developing Effective SRE Teams

Friday, 31 August, 2018 - 11:0012:40

Kurt Andersen, LinkedIn

Space is limited: add this to your schedule if you plan to attend.


Skill acquisition applies to both individuals and teams. For a team, it is helpful to understand where the team is currently executing and potential changes to move to greater levels of effectiveness. For convenience, we can usefully divide skill levels into five ranges: novice, advanced beginner, competent, proficient, and expert.

The precise agenda will be decided by the participants on the day, but the idea in this unconference is to take aspects of SRE practice (such as monitoring, measurement against SLOs, incident management and postmortems, managing toil, etc) and discuss what these look like at the different skill levels - not so much at an individual level as at an organizational one.

We would also like to discuss how attendees can gauge their own team and company's state and progress, and develop a plan for growing weak areas. This unconference session is a follow-up to Kurt's talk The Never-Ending Story of Site Reliability from SREcon17 Europe and a later version presented at SREcon Asia 2018 Characterizing and Phases of SRE Practice.

Kurt Andersen, LinkedIn

Kurt Andersen is one of the co-chairs for SREcon18Americas and has been active in the anti-abuse community for over 15 years. He is currently the senior IC for the Product SRE (site reliability engineering) team at LinkedIn. He also works as one of the Program Committee Chairs for the Messaging, Malware, and Mobile Anti-Abuse Working Group (M3AAWG.org). He has spoken at M3AAWG, Velocity, SREcon, and SANOG on various aspects of reliability, authentication, and security.

@conference {222413,
author = {Kurt Andersen},
title = {Unconference: Developing Effective {SRE} Teams},
year = {2018},
address = {Dusseldorf},
publisher = {USENIX Association},
month = aug,