Yoshinori Matsunobu, Meta
Operating online databases that serve user facing workloads at scale is challenging. What can you do to prevent from database outages? On incidents, how can you find root causes and mitigate quickly?
Databases can fail by various reasons, and you need to find root causes—offending queries/diffs, and mitigate quickly. Understanding various database reliability and performance practices will help, such as knowing common database outage reasons, indexing, how query optimizer works.
Database has another obvious, but hard to guarantee, requirement that it should not lose data / return wrong data. Users will treat the system is "unreliable" if database consistency or correctness is lost. How can we continuously verify our data is correct in production?
In this session, the speaker will show several database reliability and correctness issues that may happen in production. For each issue, the speaker will explain what kinds of workarounds can help to debug or mitigate, and will tell common performance and reliability practices.
Yoshinori Matsunobu, Meta
Yoshinori Matsunobu is a Production Engineer at Meta, specializing in online databases in production. Yoshinori has over 20 years of database industry experiences, mainly MySQL, and recently RocksDB. Yoshinori created several essential open source products and operated in production, such as MyRocks, quickstack, and MHA. Yoshinori has spoken at many conferences, and shared database practices with communities. Yoshinori received multiple database industry awards, such as Lifetime Open Source Database Contributor from Percona (2022), Honorable Mention from VLDB about MyRocks paper (2020).
author = {Yoshinori Matsunobu},
title = {Online Database Reliability, Performance, and Consistency Engineering},
year = {2022},
address = {Sydney},
publisher = {USENIX Association},
month = dec
}