Skip to main content
Back to USENIX
  • Conferences
  • Students
Sign in
  • Home
  • Attend
    • Registration Information
    • Registration Discounts
    • Venue, Hotel, and Travel
    • Co-Located Workshops
  • Program
  • Sponsorship
  • About
    • Summit Organizers
    • Services
    • Questions
    • Past Summits
  • Home
  • Attend
    • Registration Information
    • Registration Discounts
    • Students and Grants
    • Venue, Hotel, and Travel
    • Oakland Dining Guide
  • Activities
  • Program
    • At a Glance
    • Technical Sessions
  • Participate
    • Call for Papers
    • Call for Posters
    • Instructions for Participants
  • Sponsorship
  • About
    • Symposium Organizers
    • Past Symposia
    • Questions?
    • Help Promote!

sponsors

Gold Sponsor
Silver Sponsor
Silver Sponsor
Silver Sponsor
Bronze Sponsor
Bronze Sponsor
Bronze Sponsor
Bronze Sponsor
Bronze Sponsor
General Sponsor
General Sponsor
Media Sponsor
Media Sponsor
Media Sponsor
Media Sponsor
Media Sponsor
Media Sponsor
Media Sponsor
Media Sponsor
Media Sponsor
Media Sponsor
Industry Partner

help promote

NSDI '15 button

Get more
Help Promote graphics!

USENIX Conference Policies

  • Event Code of Conduct
  • Conference Network Policy
  • Statement on Environmental Responsibility Policy

CubicRing: Enabling One-Hop Failure Detection and Recovery for Distributed In-Memory Storage Systems

Yiming Zhang, National University of Defense Technology; Chuanxiong Guo, Microsoft; Dongsheng Li and Rui Chu, National University of Defense Technology; Haitao Wu, Microsoft; Yongqiang Xiong, Microsoft Research

In-memory storage has the benefits of low I/O latency and high I/O throughput. Fast failure recovery is cru- cial for large-scale in-memory storage systems, bringing network-related challenges including false detection due to transient network problems, traffic congestion during the recovery, and top-of-rack switch failures. This paper presents CubicRing, a distributed structure for cube- based networks which exploits network proximity to restrict failure detection and recovery within the small- est possible one-hop range. We leverage the Cubic- Ring structure to address the aforementioned challenges and design a network-aware in-memory key-value store called MemCube. In a 64-node 10GbE testbed, Mem- Cube recovers 48 GB of data for a single server failure in 3.1 seconds. The 14 recovery servers achieve 123.9 Gb/sec aggregate recovery throughput, which is 88.5% of the ideal aggregate bandwidth.

Yiming Zhang, National University of Defense Technology

Chuanxiong Guo, Microsoft

Dongsheng Li, National University of Defense Technology

Rui Chu, National University of Defense Technology

Haitao Wu, Microsoft

Yongqiang Xiong, Microsoft Research Asia

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

BibTeX
@inproceedings {189022,
author = {Yiming Zhang and Chuanxiong Guo and Dongsheng Li and Rui Chu and Haitao Wu and Yongqiang Xiong},
title = {{CubicRing}: Enabling {One-Hop} Failure Detection and Recovery for Distributed {In-Memory} Storage Systems},
booktitle = {12th USENIX Symposium on Networked Systems Design and Implementation (NSDI 15)},
year = {2015},
isbn = {978-1-931971-218},
address = {Oakland, CA},
pages = {529--542},
url = {https://www.usenix.org/conference/nsdi15/technical-sessions/presentation/zhang},
publisher = {USENIX Association},
month = may
}
Download
Zhang PDF
View the slides

Presentation Video 

Presentation Audio

MP3 Download

Download Audio

  • Log in or register to post comments

Gold Sponsors

Silver Sponsors

Bronze Sponsors

General Sponsors

Media Sponsors & Industry Partners

© USENIX
EIN 13-3055038

  • Privacy Policy
  • Contact Us