Hao Huang, Yifeng Zhang, Yanqi Pan, Wen Xia, Xiangyu Zou, and Darong Yang, Harbin Institute of Technology, Shenzhen; Jubin Zhong and Hua Liao, Huawei Technologies Co., Ltd
Read-only compressed file systems have become increasingly popular in space-sensitive scenarios, such as IoT and Docker containers. To construct condensed images, they divide the data into blocks (e.g., 1 MB) and compress blocks separately. However, we observe that block-based compression cannot fully utilize the compression benefits due to the data mixture problem, while its performance issues hinder practical usage.
We propose RubikFS, a sort-enhanced read-only file system. Our key idea is to solve data mixture by sorting and clustering similar data chunks in a file system-favored block granularity. This is achieved by similarity sorter, which builds a similarity graph to measure the similarity of data chunks and clusters similar chunks by subgraph partitioning. Moreover, sorting can also group data with the same hotness to minimize read amplification. We then introduce an array of techniques, including data grouper, data chunker, and hotness grouper, to implement condensed and efficient RubikFS. Experiments suggest that, compared to existing read-only compressed file systems, RubikFS increases the compression ratio by up to 42.60% and reduces unnecessary reads by up to 70.70%.
FAST '26 Open Access Sponsored by
NetApp
Open Access Media
USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

author = {Hao Huang and Yifeng Zhang and Yanqi Pan and Wen Xia and Xiangyu Zou and Darong Yang and Jubin Zhong and Hua Liao},
title = {Towards Condensed and Efficient {Read-Only} File System via {Sort-Enhanced} Compression},
booktitle = {24th USENIX Conference on File and Storage Technologies (FAST 26)},
year = {2026},
isbn = {978-1-939133-53-3},
address = {Santa Clara, CA},
pages = {35--51},
url = {https://www.usenix.org/conference/fast26/presentation/huang},
publisher = {USENIX Association},
month = feb
}
