Tailwind: Fast and Atomic RDMA-based Replication


Yacine Taleb, Univ Rennes, Inria, CNRS, IRISA; Ryan Stutsman, University of Utah; Gabriel Antoniu, Univ Rennes, Inria, CNRS, IRISA; Toni Cortes, BSC, UPC


Replication is essential for fault-tolerance. However, in in-memory systems, it is a source of high overhead. Remote direct memory access (RDMA) is attractive to create redundant copies of data, since it is low-latency and has no CPU overhead at the target. However, existing approaches still result in redundant data copying and active receivers. To ensure atomic data transfers, receivers check and apply only fully received messages. Tailwind is a zero-copy recovery-log replication protocol for scale-out in-memory databases. Tailwind is the first replication protocol that eliminates {\em all} CPU-driven data copying and fully bypasses target server CPUs, thus leaving backups idle. Tailwind ensures all writes are atomic by leveraging a protocol that detects incomplete RDMA transfers. Tailwind substantially improves replication throughput and response latency compared with conventional RPC-based replication. In symmetric systems where servers both serve requests and act as replicas, Tailwind also improves normal-case throughput by freeing server CPU resources for request processing. We implemented and evaluated Tailwind on RAMCloud, a low-latency in-memory storage system. Experiments show Tailwind improves RAMCloud's normal-case request processing throughput by 1.7$\times$. It also cuts down writes median and 99\textsuperscript{th} percentile latencies by 2x and 3x respectively.

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

@inproceedings {216051,
author = {Yacine Taleb and Ryan Stutsman and Gabriel Antoniu and Toni Cortes},
title = {Tailwind: Fast and Atomic {RDMA-based} Replication},
booktitle = {2018 USENIX Annual Technical Conference (USENIX ATC 18)},
year = {2018},
isbn = {978-1-939133-01-4},
address = {Boston, MA},
pages = {851--863},
url = {https://www.usenix.org/conference/atc18/presentation/taleb},
publisher = {USENIX Association},
month = jul

Presentation Audio