Donate
Log In
Menu
Toggle menu visibility
About
About
About Us
Our Board of Directors
Board Meeting Minutes
Board Elections
Updates & Announcements
Our Staff
Governance & Financials
Lifetime Achievement Award
Events
Events
Upcoming
Past
Conference FAQ
Conference Policies
Code of Conduct
Calls for Papers
Author Resources
Grant Opportunities
Best Papers
Test of Time Awards
Join & Support
Join & Support
Become a Member
Ways to Give
Our Supporters
Student Opportunities
Sponsorship Opportunities
Archive
Archive
Proceedings
Multimedia
;login: Archive
Short Topics in System Administration Series
Journal of Education in System Administration (JESA)
Journal of Election Technology and Systems (JETS)
Computing Systems Journal
Search
Multimedia
Enter terms
Retain current filters
Search results
Title
Conference
Speaker(s)
From Thundering Herd to Zero Outages: Building Reliable Inventory Sync
SREcon26 Americas
Rushikesh Shashank Ghatpande
Three Lies We Tell Ourselves about Disaster Recovery and What to Do about Them
SREcon26 Americas
Colette Alexander
AI Agents for Incident Investigation: The Good, The Bad, and The Ugly
SREcon26 Americas
Vladyslav Budichenko
The Critical Resource Is You: Practical Destressing for On-Call Engineers
SREcon26 Americas
Beth Adele Long
Escaping Version Skew: Formalizing Compatibility in a World of Partial Rollouts
SREcon26 Americas
Robbie Ostrow
Stop Reading Changelogs: Safer Kubernetes Upgrades with Simulation
SREcon26 Americas
David Morrison
The Case of the Misnamed Cities: CAST Analysis of a Google Maps Incident
SREcon26 Americas
Ruben Barroso
When the Cure Is Worse than the Disease: Metastability in Recovery
SREcon26 Americas
Todd Porter, Aleksey Charapko
Beyond Loss and Accuracy: Closing the Observability Gaps in AI Training with TrainCheck
SREcon26 Americas
Yuxuan Jiang, Ryan Huang
From Chaos to Confidence: How SREs Can Leverage 50 (and Counting) Failure Scenarios to Test AI Readiness
SREcon26 Americas
Rohan Arora, Bhavya
Resilient Observability at the Retail Edge: A Lightweight, Scalable, and Cost-Efficient Framework
SREcon26 Americas
Prakash Velusamy
Epistemology of Incidents and Problem Solving
SREcon26 Americas
Jack Kingsman
So You Want a New Incident Commander—Lessons from Building Incident Response Teams
SREcon26 Americas
Vanessa Huerta Granda
Human Factors in the Age of AI Ops: Re-Engineering Trust between Humans and Machines
SREcon26 Americas
Edward Redick
It's Not Always the Network (But Here's How to Prove It): Kubernetes Packet Capture for SREs
SREcon26 Americas
Mitsuhiro Shibuya
The Unconspicuous Role of Conntrack in Kubernetes Networking
SREcon26 Americas
Ricard Bejarano
Observability for LLMs: Understanding What’s Happening Under the Hood
SREcon26 Americas
Salman Munaf
Precision Over Proliferation: SRE Approach for Leaner, Smarter and Data-Driven Observability
SREcon26 Americas
Md Shaghil
Lightning Talks
SREcon26 Americas
How Security Incidents Are Different ... and How They're Exactly the Same
SREcon26 Americas
Laura de Vesine, Alec Randazzo
How We Built Protosockets to Go Beyond HTTP and gRPC
SREcon26 Americas
Pratik Agarwal
Operationalizing Key Management for Regulatory Compliance and Emergency Response
SREcon26 Americas
Swetha Srinivasan
Unlock High-Frequency Deployments without Blowing Up Prometheus
SREcon26 Americas
Ganesh Vernekar
The Gashlycrumb Tinies of AI Networking You Must Know (or Languish!)
SREcon26 Americas
Lerna Ekmekcioglu
Reliability Engineering for Hybrid Robot-Cloud Systems
SREcon26 Americas
Jeff Corpuz, Rian Bogle
Pages
« first
‹ previous
…
40
41
42
43
44
45
46
47
48
next ›
last »
Printable Calendar
|
Google Calendar