Uncovering Access, Reuse, and Sharing Characteristics of I/O-Intensive Files on Large-Scale Production HPC Systems


Tirthak Patel, Northeastern University; Suren Byna, Glenn K. Lockwood, and Nicholas J. Wright, Lawrence Berkeley National Laboratory; Philip Carns and Robert Ross, Argonne National Laboratory; Devesh Tiwari, Northeastern University


Large-scale high-performance computing (HPC) applications running on supercomputers produce large amounts of data routinely and store it in files on multi-PB shared parallel storage systems. Unfortunately, storage community has a limited understanding of the access and reuse patterns of these files. This paper investigates the access and reuse patterns of I/O- intensive files on a production-scale supercomputer.

FAST '20 Open Access Sponsored by NetApp

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

@inproceedings {246180,
author = {Tirthak Patel and Suren Byna and Glenn K. Lockwood and Nicholas J. Wright and Philip Carns and Robert Ross and Devesh Tiwari},
title = {Uncovering Access, Reuse, and Sharing Characteristics of {I/O-Intensive} Files on {Large-Scale} Production {HPC} Systems},
booktitle = {18th USENIX Conference on File and Storage Technologies (FAST 20)},
year = {2020},
isbn = {978-1-939133-12-0},
address = {Santa Clara, CA},
pages = {91--101},
url = {https://www.usenix.org/conference/fast20/presentation/patel-hpc-systems},
publisher = {USENIX Association},
month = feb

Presentation Video