Search results

    TitleConferenceSpeaker(s)
    Case Study: Lessons Learned from Our First Worldwide OutageSREcon17 EuropeYoav Cohen
    Building Shopify's PaaS on KubernetesSREcon18 AmericasKaran Thukral
    Error Budgets and RisksSREcon15Marc Alvidrez
    Distributed Consensus Algorithms for Extreme ReliabilitySREcon15 EuropeLaura Nolan
    Automated Troubleshooting of Live Site IssuesSREcon17 AsiaSriram Srinivasan
    Bootstrapping an SRE Team: Effecting Culture Change and Leveraging Diverse Skill SetsSREcon18 AmericasAaron Wieczorek
    Antics, Drift, and ChaosSREcon18 AmericasLorin Hochstein
    Architecting a Technical Post MortemSREcon18 AmericasWill Gallego
    Comprehensive Container-Based Service Monitoring with Kubernetes and IstioSREcon18 AsiaFred Moyer
    Leading without Managing: Becoming an SRE Technical LeaderSREcon19 Asia/PacificTodd Palino
    A Fresh Look at Operational DebtSREcon22 AmericasDavid Owczarek
    Dark Sky Camping: Reducing Alert Pollution with Modern Observability PracticesSREcon22 AmericasKristin Smith
    Lifecycle of Reusable Automations: Track, Maintain, DeprecateSREcon22 Asia/PacificRenisha Fernandes, Bharat P
    Implementing SRE in a Telco with Reliability Enhancing ProceduresSREcon23 Europe/Middle East/AfricaFlorian Kammermann, Romain Bonjour
    Lessons from Unix HistorySREcon24 Europe/Middle East/AfricaDiomidis Spinellis
    SRE Saga: The Song of Heroes and VillainsSREcon24 Europe/Middle East/AfricaDaria Barteneva
    Mean Time to WTF: Why Developer Experience Frameworks Belong in Your Incident RetrospectivesSREcon26 AmericasNicole Forsgren
    Panel: Engineering OnboardingSREcon21Daria Barteneva, Jennifer Petoff, Anne Hamilton, Sandi Friend, Ilse White
    MLOps 2025: A Journey into the Past and the FutureSREcon25 Europe/Middle East/AfricaAlejandro Saucedo
    Gaining Insights from a Black Box SystemSREcon25 Europe/Middle East/AfricaThiara Ortiz
    Auto-Instrumentation for GPU Performance using eBPFSREcon25 Europe/Middle East/AfricaNikola Grcevski
    Open-Falcon: A Distributed and High-Performance Monitoring SystemSREcon17 AsiaYao-Wei Ou, Wei Lai
    SRE's Critical Role in the COVID-19 Pandemic Response in GovernmentSREcon23 AmericasAmy Quispe, Marc Alvidrez, Rick Hawes
    Alerting for Distributed Systems—A Tale of Symptoms and Causes, Signals and NoiseSREcon16 EuropeBjörn Rabenstein
    Autopsy of a MySQL Automation DisasterSREcon19 Europe/Middle East/AfricaJean-François Gagné

    Pages