Okapi: Decoupling Data Striping and Redundancy Grouping in Cluster File Systems

Sanjith Athlur and Timothy Kim, Carnegie Mellon University; Saurabh Kadekodi, Google; Francisco Maturana and Xavier Ramos, Carnegie Mellon University; Arif Merchant, Google; K. V. Rashmi and Gregory R. Ganger, Carnegie Mellon University

The Okapi cluster file system decouples how data is spread across disks (data striping) for IO efficiency from how data is erasure coded together (redundancy grouping) for durability. Existing systems couple these two mechanisms’ configurations, inducing significant inefficiencies. Decoupling allows grouping to be configured based on reliability and space efficiency goals, while simultaneously allowing striping to be configured based on performance goals. Decoupling also allows redundancy scheme changes from one EC scheme to another (e.g., to react to data temperature or disk failure rate changes) to occur without having to re-write data. Evaluation of an Okapi prototype shows that decoupling can be accomplished with <1% increase in metadata size and file manager memory, and minimal file creation and degraded read resource increase. Experiments demonstrate that decoupling can improve read throughput by 80% and reduce seeks per second by up to 70%, without yielding any data reliability, and reduce the overhead of redundancy transitions by up to 70%.

OSDI '25 Open Access Sponsored by
King Abdullah University of Science and Technology (KAUST)

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

BibTeX
@inproceedings {308794,
author = {Sanjith Athlur and Timothy Kim and Saurabh Kadekodi and Francisco Maturana and Xavier Ramos and Arif Merchant and K. V. Rashmi and Gregory R. Ganger},
title = {Okapi: Decoupling Data Striping and Redundancy Grouping in Cluster File Systems},
booktitle = {19th USENIX Symposium on Operating Systems Design and Implementation (OSDI 25)},
year = {2025},
isbn = {978-1-939133-47-2},
address = {Boston, MA},
pages = {897--914},
url = {https://www.usenix.org/conference/osdi25/presentation/athlur},
publisher = {USENIX Association},
month = jul
}

Presentation Video