Quake: Adaptive Indexing for Vector Search

Jason Mohoney, Devesh Sarda, and Mengze Tang, University of Wisconsin–Madison; Shihabur Rahman Chowdhury and Anil Pacaci, Apple; Ihab F. Ilyas, University of Waterloo; Theodoros Rekatsinas, Apple; Shivaram Venkataraman, University of Wisconsin–Madison

Vector search, the task of finding the k-nearest neighbors of a query vector against a database of high-dimensional vectors, underpins many machine learning applications, including retrieval-augmented generation, recommendation systems, and information retrieval. However, existing approximate nearest neighbor (ANN) methods perform poorly under dynamic and skewed workloads where data distributions evolve. We introduce Quake, an adaptive indexing system that maintains low latency and high recall in such environments. Quake employs a multi-level partitioning scheme that adjusts to updates and changing access patterns, guided by a cost model that predicts query latency based on partition sizes and access frequencies. Quake also dynamically sets query execution parameters to meet recall targets using a novel recall estimation model. Furthermore, Quake utilizes NUMA-aware intra-query parallelism for improved memory bandwidth utilization during search. To evaluate Quake, we prepare a Wikipedia vector search workload and develop a workload generator to create vector search workloads with configurable access patterns. Our evaluation shows that on dynamic workloads, Quake achieves query latency reductions of 1.5–38× and update latency reductions of 4.5–126× compared to state-of-the-art indexes such as SVS, DiskANN, HNSW, and SCANN.

OSDI '25 Open Access Sponsored by
King Abdullah University of Science and Technology (KAUST)

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

BibTeX
@inproceedings {308702,
author = {Jason Mohoney and Devesh Sarda and Mengze Tang and Shihabur Rahman Chowdhury and Anil Pacaci and Ihab F. Ilyas and Theodoros Rekatsinas and Shivaram Venkataraman},
title = {Quake: Adaptive Indexing for Vector Search},
booktitle = {19th USENIX Symposium on Operating Systems Design and Implementation (OSDI 25)},
year = {2025},
isbn = {978-1-939133-47-2},
address = {Boston, MA},
pages = {153--169},
url = {https://www.usenix.org/conference/osdi25/presentation/mohoney},
publisher = {USENIX Association},
month = jul
}

Presentation Video