Fast Cloud Storage for AI Jobs via Grouped I/O API with Transparent Read/Write Optimizations

Yingyi Hao, Shanghai Jiao Tong University; Ting Yao, Huawei Cloud; Xingda Wei, Dingyan Zhang, and Tianle Sun, Shanghai Jiao Tong University; Yiwen Zhang, Zhiyong Fu, and Huatao Wu, Huawei Cloud; Rong Chen, Shanghai Jiao Tong University

The emergence of AI workloads has placed rigorous bandwidth requirements on cloud storage, which are challenging to meet due to inherent hardware restrictions in cost-efficient disaggregated storage architectures, as well as the non-triviality of implementing application-tailored optimizations.

This paper presents AITURBO, a cloud storage system for AI jobs with high bandwidth demands. AITURBO first utilizes the high-bandwidth compute fabric between accelerators to meet AI applications’ bandwidth demands without incurring additional storage cost. AITURBO further introduces a simple yet powerful grouped I/O API that allows AITURBO to automatically derive optimized read and write plans at the storage layer. These plans enable optimizations that are comparable or better than application-level ones, because they capture common I/O patterns in AI workloads and have a holistic view from the storage layer’s perspective. Under common AI workloads such as checkpoint reads and writes and KV-cache reads, AITURBO achieves comparable or better performance than state-of-the-art systems, with and without application-level optimizations, including systems such as Megatron, Gemini, and Mooncake, typically with minimal application-level code changes. AITURBO has been deployed in training jobs in HUAWEI’s production cloud to support efficient training workloads.

FAST '26 Open Access Sponsored by
NetApp

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

BibTeX
@inproceedings {315963,
author = {Yingyi Hao and Ting Yao and Xingda Wei and Dingyan Zhang and Tianle Sun and Yiwen Zhang and Zhiyong Fu and Huatao Wu and Rong Chen},
title = {Fast Cloud Storage for {AI} Jobs via Grouped {I/O} {API} with Transparent {Read/Write} Optimizations},
booktitle = {24th USENIX Conference on File and Storage Technologies (FAST 26)},
year = {2026},
isbn = {978-1-939133-53-3},
address = {Santa Clara, CA},
pages = {255--270},
url = {https://www.usenix.org/conference/fast26/presentation/hao},
publisher = {USENIX Association},
month = feb
}

Presentation Video