| Trustworthy Graceful Degradation: Fault Tolerance across Service Boundaries | SREcon21 | Daniel Rodgers-Pryor |
| A Principled Approach to Monitoring Streaming Data Infrastructure at Scale | SREcon21 | Eric Schow, Praveen Yedidi |
| Watching the Watchers: Generating Absent Alerts for Prometheus | SREcon21 | Nick Spain |
| SLX: An Extended SLO Framework to Expedite Incident Recovery | SREcon21 | Qian Ding, Xuan Zhang |
| MySQL and InnoDB Performance for the Rest of Us | SREcon21 | Shaun O'Keefe |
| Need for SPEED: Site Performance Efficiency, Evaluation and Decision | SREcon21 | Kingsum Chow, Zhihao Chang |
| Take Me Down to the Paradise City Where the Metric Is Green and Traces Are Pretty | SREcon21 | Ricardo Ferreira |
| Elephant in the Blameless War Room—Accountability | SREcon21 | Christina Tan, Emily Arnott |
| Latency Distributions and Micro-Benchmarking to Identify and Characterize Kernel Hotspots | SREcon21 | Danny Chen |
| What To Do When SRE is Just a New Job Title? | SREcon21 | Benjamin Bütikofer |
| Sparking Joy for Engineers with Observability | SREcon21 | Zac Delagrange |
| Let the Chaos Begin—SRE Chaos Engineering Meets Cybersecurity | SREcon21 | Francesco Sbaraglia, Adriana Petrich |
| Spike Detection in Alert Correlation at LinkedIn | SREcon21 | Nishant Singh |
| 10 Lessons Learned in 10 Years of SRE | SREcon21 | Andrea Spadaccini |
| What's the Cost of a Millisecond? | SREcon21 | Avishai Ish-Shalom |
| Of Mice & Elephants | SREcon21 | Koon Seng Lim, Sandeep Hooda |
| When Systems Flatline—Enhancing Incident Response with Learnings from the Medical Field | SREcon21 | Sarah Butt |
| Panel: Observability | SREcon21 | Daria Barteneva, Liz Fong-Jones, Gabe Wishnie, Štěpán Davidovic, Richard Waid, Partha Kanuparthy |
| Panel: Engineering Onboarding | SREcon21 | Daria Barteneva, Jennifer Petoff, Anne Hamilton, Sandi Friend, Ilse White |
| Achieving Mutual TLS: Secure Pod-to-Pod Communication Without the Hassle | SREcon20 Americas | Mark Hahn, Thomas Hahn |
| Automatically Detect the Top Performance & Scalability Issues in Distributed Architectures | SREcon20 Americas | Andreas Grabner |
| The Evolution of Traffic Routing in a Streaming World | SREcon20 Americas | Abhishek Srikanth |
| Latency and Availability Error Budgets Done Right at Scale | SREcon20 Americas | Fred Moyer |
| 9 Years of Failure: How Racing Crappy Cars Made Me a Better SRE | SREcon20 Americas | Ryan Doherty |
| The Smallest Possible SRE Team | SREcon20 Americas | Zach Thomas |