DistCache: Provable Load Balancing for Large-Scale Storage Systems with Distributed Caching

Authors: 

Zaoxing Liu and Zhihao Bai, Johns Hopkins University; Zhenming Liu, College of William and Mary; Xiaozhou Li, Celer Network; Changhoon Kim, Barefoot Networks; Vladimir Braverman and Xin Jin, Johns Hopkins University; Ion Stoica, UC Berkeley

Best Paper at FAST '19: Link to Paper

Abstract: 

Load balancing is critical for distributed storage to meet strict service-level objectives (SLOs). It has been shown that a fast cache can guarantee load balancing for a clustered storage system. However, when the system scales out to multiple clusters, the fast cache itself would become the bottleneck. Traditional mechanisms like cache partition and cache replication either result in load imbalance between cache nodes or have high overhead for cache coherence.

We present DistCache, a new distributed caching mechanism that provides provable load balancing for large-scale storage systems. DistCache co-designs cache allocation with cache topology and query routing. The key idea is to partition the hot objects with independent hash functions between cache nodes in different layers, and to adaptively route queries with the power-of-two-choices. We prove that DistCache enables the cache throughput to increase linearly with the number of cache nodes, by unifying techniques from expander graphs, network flows, and queuing theory. DistCache is a general solution that can be applied to many storage systems. We demonstrate the benefits of DistCache by providing the design, implementation, and evaluation of the use case for emerging switch-based caching.

BibTeX
@inproceedings {238417,
author = {Zaoxing Liu and Zhihao Bai and Zhenming Liu and Xiaozhou Li and Changhoon Kim and Vladimir Braverman and Xin Jin and Ion Stoica},
title = {{DistCache}: Provable Load Balancing for {Large-Scale} Storage Systems with Distributed Caching},
booktitle = {2019 USENIX Annual Technical Conference (USENIX ATC 19)},
year = {2019},
address = {Renton, WA},
url = {https://www.usenix.org/conference/atc19/presentation/liu},
publisher = {USENIX Association},
month = jul
}

Presentation Video