High Availability Solution for Large Scale Database Systems

Friday, June 14, 2019 - 3:00 pm3:30 pm

Guowei Zeng, Baidu Inc


At Baidu, over 95% of the OLTP requests go to the MySQL database system, which processes over 100 billion queries each day, and supports important applications such as search engine, advertisements, finance, and so on. Its availability is therefore of paramount importance. As the master/slave architecture of MySQL, the database HA solution includes two key points: accurate failure detection and remediation maintaining data consistency. The accuracy of failure detection should be as high as possible because a false positive will activate the lossy slave promotion and a false negative could lead to disasters when the failure persists. The remediation procedure should fill the data gap to ensure data consistency.

This talk will introduce a complete solution for database HA. Specifically, we will share our successful experience at Baidu to fulfill different needs from applications, such as large scale clusters, or high consistency for financial systems.

Guowei Zeng, Baidu Inc

Guowei Zeng has been a Database Architect and Senior R&D Engineer in the DBA group of Baidu for seven years. He is now responsible for the technical infrastructure of the database management platform, and focuses on database high availability and resource management continuously.

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

@conference {233329,
author = {Guowei Zeng},
title = {High Availability Solution for Large Scale Database Systems},
year = {2019},
address = {Singapore},
publisher = {USENIX Association},
month = jun

Presentation Video