Dan Shoop is a Systems Reliability Engineering Manager with over 30 years of experience building distributed systems and infrastructure that are performant, highly-available, scalable and fault tolerant. While working at HBO solving Internet problems at Game of Thrones scale when that Google SRE book came out, he and his team realized they had actually been SREs most of their engineering careers, embraced the new paradigm to restructure their production operations teams, and driving their focus heavily on telemetry & observability as a critical component for understanding, measuring, monitoring and alerting on key indicators of systems health as related to its service impacts and architectural improvements. He went on to lead SRE at Venmo and has also worked at Sesame Street, United States Technical Services and operated his own consulting company. Having taken Edward Tufte's course on Information Visualization three times, he both recognizes and enjoys sharing techniques for improving the cognition and presentation of our observability data in terms of SLI/SLO narratives, and enhancing our engineering retrospectives and reports. He lives in New York City and enjoys good food, mountaineering, photography and UAVs.
Dan Shoop
Connect:
Linked Presentation: