Donate
Log In
Menu
Toggle menu visibility
About
About
About Us
Our Board of Directors
Board Meeting Minutes
Board Elections
Updates & Announcements
Our Staff
Governance & Financials
Lifetime Achievement Award
Events
Events
Upcoming
Past
Conference FAQ
Conference Policies
Code of Conduct
Calls for Papers
Author Resources
Grant Opportunities
Best Papers
Test of Time Awards
Join & Support
Join & Support
Become a Member
Ways to Give
Our Supporters
Student Opportunities
Sponsorship Opportunities
Archive
Archive
Proceedings
Multimedia
;login: Archive
Short Topics in System Administration Series
Journal of Education in System Administration (JESA)
Journal of Election Technology and Systems (JETS)
Computing Systems Journal
Search
Multimedia
Enter terms
Retain current filters
Search results
Title
Conference
Speaker(s)
Observability in the Cambrian Stack Era
SREcon17 Americas
Charity Majors
Traps and Cookies
SREcon17 Americas
Tanya Reilly
Panel: Training New SREs (continued)
SREcon17 Americas
Ruth Grace Wong, Katie Ballinger, Saravanan Loganathan, Rita Lu, Craig Sebenik, Andrew Widdowson
So You Want to Be a Wizard
SREcon17 Americas
Julia Evans
Reliability When Everything Is a Platform: Why You Need to SRE Your Customers
SREcon17 Americas
Dave Rensin
Next Generation of DevOps: AIOps in Practice @Baidu
SREcon17 Asia
Xianping Qu, Jingjing Ha
How Could Small Teams Get Ready for SRE
SREcon17 Asia
Zehua Liu
Focal Impact: The Service Pyramid
SREcon17 Asia
Michael Elkin
Smart Monitoring System for Anomaly Detection on Business Trends in Alibaba
SREcon17 Asia
Zhaogang Wang
Merou: A Decentralized, Audited Authorization Service
SREcon17 Asia
Luke Faraone
Graphite@Scale or How to Store Millions of Metrics per Second
SREcon17 Asia
Vladimir Smirnov
Open-Falcon: A Distributed and High-Performance Monitoring System
SREcon17 Asia
Yao-Wei Ou, Wei Lai
Talking to an OpenStack Cluster in Plain English
SREcon17 Asia
Wei Xu
A Scheduling Framework For Large-Scale Based on Ansible
SREcon17 Asia
AiZhen Chen
Draining the Flood—A Combat against Alert Fatigue
SREcon17 Asia
Yu Chen
Reliable Launches at Scale
SREcon17 Asia
Sebastian Kirsch
Didi: How to Provide a Reliable Ridesharing Service
SREcon17 Asia
Ming Hua, Lin Tan
Measuring the Success of Incident Management at Atlassian
SREcon17 Asia
Gerry Millar
Good, Better, Best, Mobile User Experience
SREcon17 Asia
Fred Wu
Event Correlation: A Fresh Approach towards Reducing MTTR
SREcon17 Asia
Renjith Rajan, Rajneesh
"A Unit Test Would Have Caught This:" Small, Cheap, and Effective Testing for Production Engineers
SREcon17 Asia
Andrew Ryan
Automated Troubleshooting of Live Site Issues
SREcon17 Asia
Sriram Srinivasan
Testing for DR Failover Testing
SREcon17 Asia
Zehua Liu
Accept Partial Failures, Minimize Service Loss
SREcon17 Asia
Daxin Wang
Data Checking at Dropbox
SREcon17 Asia
David Mah
Pages
« first
‹ previous
…
18
19
20
21
22
23
24
25
26
…
next ›
last »
Printable Calendar
|
Google Calendar