{BTrDB}: Optimizing Storage System Design for Timeseries Processing

Michael P Andersen; David E. Culler

help promote

FAST '17 CFP

Get
Help Promote graphics!

USENIX Conference Policies

BTrDB: Optimizing Storage System Design for Timeseries Processing

Michael P Andersen and David E. Culler, University of California, Berkeley

The increase in high-precision, high-sample-rate telemetry timeseries poses a problem for existing timeseries databases which can neither cope with the throughput demands of these streams nor provide the necessary primitives for effective analysis of them. We present a novel abstraction for telemetry timeseries data and a data structure for providing this abstraction: a time-partitioning version-annotated copy-on-write tree. An implementation in Go is shown to outperform existing solutions, demonstrating a throughput of 53 million inserted values per second and 119 million queried values per second on a four-node cluster. The system achieves a 2.9x compression ratio and satisfies statistical queries spanning a year of data in under 200ms, as demonstrated on a year-long production deployment storing 2.1 trillion data points. The principles and design of this database are generally applicable to a large variety of timeseries types and represent a significant advance in the development of technology for the Internet of Things.

Michael P Andersen, University of California, Berkeley

David E. Culler, University of California, Berkeley

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

BibTeX

@inproceedings {194398,
author = {Michael P Andersen and David E. Culler},
title = {{BTrDB}: Optimizing Storage System Design for Timeseries Processing},
booktitle = {14th USENIX Conference on File and Storage Technologies (FAST 16)},
year = {2016},
isbn = {978-1-931971-28-7},
address = {Santa Clara, CA},
pages = {39--52},
url = {https://www.usenix.org/conference/fast16/technical-sessions/presentation/andersen},
publisher = {USENIX Association},
month = feb
}

help promote

USENIX Conference Policies

BTrDB: Optimizing Storage System Design for Timeseries Processing

Michael P Andersen, University of California, Berkeley

David E. Culler, University of California, Berkeley

Open Access Media

Presentation Audio

Gold Sponsors

Silver Sponsors

Bronze Sponsors

Media Sponsors & Industry Partners

Open Access Publishing Partners

sponsors

help promote

USENIX Conference Policies

BTrDB: Optimizing Storage System Design for Timeseries Processing

Michael P Andersen, University of California, Berkeley

David E. Culler, University of California, Berkeley

Open Access Media

Presentation Audio

Gold Sponsors

Silver Sponsors

Bronze Sponsors

Media Sponsors & Industry Partners

Open Access Publishing Partners