{Crash-Only} Software

George Candea; Armando Fox

Crash-Only Software

Crash-only programs crash safely and recover quickly. There is only one way to stop such software—by crashing it—and only one way to bring it up—by initiating recovery. Crash-only systems are built from crash-only components, and the use of transparent component-level retries hides intra-system component crashes from end users. In this paper we advocate a crash-only design for Internet systems, showing that it can lead to more reliable code, easier failure prevention, and faster, more effective recovery. We present ideas on how to build such crash-only Internet services, taking successful techniques to their logical extreme.

George Candea, Stanford University

Armando Fox, Stanford University

BibTeX

@inproceedings {270286,
author = {George Candea and Armando Fox},
title = {{Crash-Only} Software},
booktitle = {9th Workshop on Hot Topics in Operating Systems (HotOS IX)},
year = {2003},
address = {Lihue, HI},
url = {https://www.usenix.org/conference/hotos-ix/crash-only-software},
publisher = {USENIX Association},
month = may
}

Download

Links

Paper:

http://www.usenix.org/events/hotos03/tech/full_papers/candea/candea.pdf

Paper (HTML):

http://www.usenix.org/events/hotos03/tech/full_papers/candea/candea_html/index.html

USENIX Conference Policies

Crash-Only Software

George Candea, Stanford University

Armando Fox, Stanford University

Links