PD3: Prefetching Data with DPUs for Disaggregated Memory

Sidharth Sankhe, Felix Zhang, and Umayrah Chonee, University of Toronto; Sherman Lim, National University of Singapore; Jiasheng Hu, University of Toronto; Jialin Li, National University of Singapore; Qizhen Zhang, University of Toronto

We introduce PD3, a memory disaggregation solution that "avoids" cache misses, via prefetching, on compute servers and thus all their associated overhead. Unlike a traditional prefetcher that may pollute the cache or miss preloading opportunities due to false positives and false negatives, PD3 prevents mis-predictions with network support and minimal yet critical application information. Enabling PD3 is data processing units or DPUs, which allow (1) parsing user requests before they are processed by the compute server, (2) fetching data from remote memory on the shortest path, (3) offloading expensive RDMA and DMA operations from the host, and (4) incorporating application knowledge to faithfully predict cache misses and take actions accordingly. Designing PD3 requires reconciling DPU resource constraints and scaling requirements of cloud data systems, as well as achieving high efficiency with a myriad of performance optimizations. Our experimental results on real hardware, applications, and workloads show that with nominal compute-local memory, PD3 eliminates the performance gap between memory-disaggregated applications and their monolithic counterparts.

NSDI '26 Open Access Sponsored by
King Abdullah University of Science and Technology (KAUST)

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

BibTeX
@inproceedings {316698,
author = {Sidharth Sankhe and Felix Zhang and Umayrah Chonee and Sherman Lim and Jiasheng Hu and Jialin Li and Qizhen Zhang},
title = {{PD3}: Prefetching Data with {DPUs} for Disaggregated Memory},
booktitle = {23rd USENIX Symposium on Networked Systems Design and Implementation (NSDI 26)},
year = {2026},
isbn = {978-1-939133-54-0},
address = {Renton, WA},
pages = {1207--1223},
url = {https://www.usenix.org/conference/nsdi26/presentation/sankhe},
publisher = {USENIX Association},
month = may
}

Presentation Video