Biblio

Export 3 results:
Filters: Author is Zhuohan Li  [Clear All Filters]
2024
Sheng Y, Cao S, Li D, Zhu B, Li Z, Zhuo D, Gonzalez JE, Stoica I.  2024.  Fairness in Serving Large Language Models. 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI 24). :965--988.
2023
Li Z, Zheng L, Zhong Y, Liu V, Sheng Y, Jin X, Huang Y, Chen Z, Zhang H, Gonzalez JE et al..  2023.  AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :663--679.
2022
Zheng L, Li Z, Zhang H, Zhuang Y, Chen Z, Huang Y, Wang Y, Xu Y, Zhuo D, Xing EP et al..  2022.  Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning. 16th USENIX Symposium on Operating Systems Design and Implementation (OSDI 22). :559--578.