Taking Control of Metrics Growth and Cardinality: Tips for Maximizing Your Observability Function

Note: Presentation times are in Coordinated Universal Time (UTC).

Friday, 15 October, 2021 - 03:4504:15

Rob Skillington, Chronosphere

Abstract: 

As companies transition to cloud-native architectures, the volume of metrics data being produced is growing exponentially and SRE teams are being forced to adapt to these increased demands, including finding ways to limit or control the cardinality of metrics. As this growth continues, it's critical that cloud-native companies (and their SRE teams) find ways to manage this growth sustainably and reliably.

During this session, Rob will discuss some best practices and tips for efficiently taming metrics data growth and cardinality at scale. He will also share some proven at scale KPIs and metrics to keep in mind when running, maintaining, and growing a world-class observability function. Focusing on real-life examples from leaders and engineers across the observability space, the audience will leave with a better understanding of how to implement these learnings with their existing SRE resources, including some ways for tracking and measuring these efforts.

Rob Skillington, Chronosphere

Rob Skillington is the co-founder and CTO of Chronosphere. He was previously at Uber, where he was the technical lead of the Observability team and creator of M3DB, the time series database at the core of M3.

He has worked in both large engineering organizations such as Microsoft and Groupon and a handful of startups. He and his family are based in NYC where he mainly spends weekends exploring all of New York's playgrounds and also following his wife's jazz adventures.

SREcon21 Open Access Sponsored by Indeed

BibTeX
@conference {276767,
author = {Rob Skillington},
title = {Taking Control of Metrics Growth and Cardinality: Tips for Maximizing Your Observability Function},
year = {2021},
publisher = {USENIX Association},
month = oct
}

Presentation Video