OPTIMUSCLOUD: Heterogeneous Configuration Optimization for Distributed Databases in the Cloud

Authors: 

Ashraf Mahgoub and Alexander Michaelson Medoff, Purdue University; Rakesh Kumar, Microsoft; Subrata Mitra, Adobe Research; Ana Klimovic, Google Research; Somali Chaterji and Saurabh Bagchi, Purdue University

Abstract: 

Achieving cost and performance efficiency for cloud-hosted databases requires exploring a large configuration space, including the parameters exposed by the database along with the variety of VM configurations available in the cloud. Even small deviations from an optimal configuration have significant consequences on performance and cost. Existing systems that automate cloud deployment configuration can select near-optimal instance types for homogeneous clusters of virtual machines and for stateless, recurrent data analytics workloads. We show that to find optimal performance-per-$ cloud deployments for NoSQL database applications, it is important to (1) consider heterogeneous cluster configurations, (2) jointly optimize database and VM configurations, and (3) dynamically adjust configuration as workload behavior changes. We present OPTIMUSCLOUD, an online reconfiguration system that can efficiently perform such joint and heterogeneous configuration for dynamic workloads. We evaluate our system with two clustered NoSQL systems: Cassandra and Redis, using three representative workloads and show that OPTIMUSCLOUD provides 40% higher throughput/$ and 4.5× lower 99-percentile latency on average compared to state-of-the-art prior systems, CherryPick, Selecta, and SOPHIA.

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

BibTeX
@inproceedings {254410,
author = {Ashraf Mahgoub and Alexander Michaelson Medoff and Rakesh Kumar and Subrata Mitra and Ana Klimovic and Somali Chaterji and Saurabh Bagchi},
title = {{OPTIMUSCLOUD}: Heterogeneous Configuration Optimization for Distributed Databases in the Cloud},
booktitle = {2020 USENIX Annual Technical Conference (USENIX ATC 20)},
year = {2020},
isbn = {978-1-939133-14-4},
pages = {189--203},
url = {https://www.usenix.org/conference/atc20/presentation/mahgoub},
publisher = {USENIX Association},
month = jul
}

Presentation Video