AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving

TitleAlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving
Publication TypeConference Paper
Year of Publication2023
AuthorsLi Z, Zheng L, Zhong Y, Liu V, Sheng Y, Jin X, Huang Y, Chen Z, Zhang H, Gonzalez JE, Stoica I
Conference Name17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23)
Date Published07/2023
PublisherUSENIX Association
Conference LocationBoston, MA
ISBN Number978-1-939133-34-2
URLhttps://www.usenix.org/conference/osdi23/presentation/li-zhouhan