Serving Heterogeneous Machine Learning Models on Multi-GPU Servers with Spatio-Temporal Sharing

TitleServing Heterogeneous Machine Learning Models on Multi-GPU Servers with Spatio-Temporal Sharing
Publication TypeConference Paper
Year of Publication2022
AuthorsChoi S, Lee S, Kim Y, Park J, Kwon Y, Huh J
Conference Name2022 USENIX Annual Technical Conference (USENIX ATC 22)
Date Published07/2022
PublisherUSENIX Association
Conference LocationCarlsbad, CA
ISBN Number978-1-939133-29-53
URLhttps://www.usenix.org/conference/atc22/presentation/choi-seungbeom