SRE Classroom, Or, How to Design a Distributed System in 3 Hours

Wednesday, 2018, August 29 - 14:0017:30

Salim Virji, Fabian Geisberger, and Jean Joswig, Google

Space is limited: add this to your schedule if you plan to attend.

Abstract: 

This workshop ties together academic and practical aspects of systems engineering, with an emphasis on applying principles of systems design to a production service. We will analyze the service to quantify its performance, and iteratively improve the design. Participants will work together in small groups to sketch out the design, identify components and their relationships, and to assess the suitability of the design to the system’s Service Level Objective (SLO).

Participants will have a system design and bill of materials at the conclusion of this workshop.

Participants will not need laptops or specific coding experience; participants will need enthusiasm for collaborating in small groups, and for discussion-based problem-solving. Participants will come away with an understanding of the principles of iterative systems engineering, popularly known as “Non-abstract large systems design.”

This workshop covers material critical for SRE, an increasingly-broad field that combines software engineering and systems design.

Fabian Geisberger, Google

Fabian is a Site Reliability Engineer at Google in New York, where he currently works on monitoring systems. He previously worked on the Ganeti SRE team, the Production Monitoring team, and several other Google services. Fabian received a Masters (Diploma) in Computer Science from the Karlsruhe Institute of Technology (KIT), Germany, in 2012.

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

BibTeX
@inproceedings {218873,
author = {Salim Virji and Fabian Geisberger and Jean Joswig},
title = {{SRE} Classroom, Or, How to Design a Distributed System in 3 Hours},
booktitle = {SREcon18 Europe/Middle East/Africa (SREcon18 Europe)},
year = {2018},
address = {Dusseldorf},
url = {https://www.usenix.org/node/218874},
publisher = {{USENIX} Association},
}