ParaSync: Exploiting Fine-Grained Parallelism for Efficient File Synchronization

Zhihao Zhang, NICE Lab, Xiamen University; and Alibaba Cloud; Lu Tang, NICE Lab, Xiamen University; Huiba Li, Alibaba Cloud; Yue Yu, Sun Yat-sen University; Guangtao Xue, Shanghai Jiao Tong University; Jiwu Shu, Tsinghua University; Yiming Zhang, Shanghai Jiao Tong University and NICE Lab, Xiamen University

File synchronization (sync) based on Content-Defined Chunking (CDC) is gaining increasing importance for data migration over networks owing to its effectiveness in detecting and eliminating duplicate data within synchronized files. CDC-based sync schemes typically comprise three phases: file chunking, chunk matching, and delta reconstruction. Unfortunately, existing sync schemes fail to exploit parallelism inherent in these phases due to two dependencies: a sequential bottleneck in chunking, where checksums are computed only after boundaries are finalized, and rigid client-server stalls that serialize matching and reconstruction. This paper presents ParaSync, a novel CDC-based file sync scheme that breaks these dependencies to exploit fine-grained parallelism. First, ParaSync’s multi-threaded chunking algorithm reduces checksum computation to a lightweight combination step, decoupling it from boundary identification while preserving invariability. Second, ParaSync designs a streaming chunk matching method that removes the all-or-nothing exchange dependency on both the client and the server sides. Finally, ParaSync introduces an efficient absolute-offset-based pipelined delta reconstruction process that maximizes the overlap between network and disk I/O operations. We have done extensive experiments over both WANs and LANs using diverse real-world datasets. The results show that compared to the state-of-the-art file sync schemes, ParaSync achieves up to 7.6× speedup for file chunking and significantly improves the overall sync performance by up to 3.7×, while maintaining a consistent level of network traffic.

FAST '26 Open Access Sponsored by
NetApp

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

BibTeX
@inproceedings {315977,
author = {Zhihao Zhang and Lu Tang and Huiba Li and Yue Yu and Guangtao Xue and Jiwu Shu and Yiming Zhang},
title = {{ParaSync}: Exploiting {Fine-Grained} Parallelism for Efficient File Synchronization},
booktitle = {24th USENIX Conference on File and Storage Technologies (FAST 26)},
year = {2026},
isbn = {978-1-939133-53-3},
address = {Santa Clara, CA},
pages = {477--492},
url = {https://www.usenix.org/conference/fast26/presentation/zhang-zhihao-parasync},
publisher = {USENIX Association},
month = feb
}

Presentation Video