| Beyond Goldilocks Reliability | SREcon21 | Narayan Desai |
| SRE "Power Words"—the Lexicon of SRE as an Industry | SREcon21 | Dave O'Connor |
| Learning More from Complex Systems | SREcon21 | Andrew Hatch |
| User Uptime in Practice | SREcon21 | Anika Mukherji |
| Practical TLS Advice for Large Infrastructure | SREcon21 | Mark Hahn, Ted Hahn |
| Improving Observability in Your Observability: Simple Tips for SREs | SREcon21 | Dan Shoop |
| Evolution of Incident Management at Slack | SREcon21 | D. Brent Chapman |
| Automating Performance Tuning with Machine Learning | SREcon21 | Stefano Doni |
| Hacking ML into Your Organization | SREcon21 | Cathy Chen |
| SRE for ML: The First 10 Years and the Next 10 | SREcon21 | Todd Underwood |
| What If the Promise of AIOps Was True? | SREcon21 | Niall Murphy |
| Demystifying Machine Learning in Production: Reasoning about a Large-Scale ML Platform | SREcon21 | Mary McGlohon |
| How We Built Out Our SRE Department to Support over 100 Million Users for the World's 3rd Biggest Mobile Marketplace | SREcon21 | Sinéad O'Reilly |
| Horizontal Data Freshness Monitoring in Complex Pipelines | SREcon21 | Alexey Skorikov |
| Microservices above the Cloud—Designing the International Space Station for Reliability | SREcon21 | Robert Barron |
| Grand National 2021: Managing Extreme Online Demand at William Hill | SREcon21 | Matthew Berridge, Josh Allenby |
| DevOps Ten Years After: Review of a Failure with John Allspaw and Paul Hammond | SREcon21 | Thomas Depierre, John Allspaw, Paul Hammond |
| From 15,000 Database Connections to under 100—A Tech Debt Tale | SREcon21 | Sunny Beatteay |
| Let's Bring System Dynamics Back to CS! | SREcon21 | Marianne Bellotti |
| Trustworthy Graceful Degradation: Fault Tolerance across Service Boundaries | SREcon21 | Daniel Rodgers-Pryor |
| A Principled Approach to Monitoring Streaming Data Infrastructure at Scale | SREcon21 | Eric Schow, Praveen Yedidi |
| Watching the Watchers: Generating Absent Alerts for Prometheus | SREcon21 | Nick Spain |
| SLX: An Extended SLO Framework to Expedite Incident Recovery | SREcon21 | Qian Ding, Xuan Zhang |
| MySQL and InnoDB Performance for the Rest of Us | SREcon21 | Shaun O'Keefe |
| Need for SPEED: Site Performance Efficiency, Evaluation and Decision | SREcon21 | Kingsum Chow, Zhihao Chang |