Biblio

Export 8 results:
Filters: Author is Wencong Xiao  [Clear All Filters]
2024
Sun B, Huang Z, Zhao H, Xiao W, Zhang X, Li Y, Lin W.  2024.  Llumnix: Dynamic Scheduling for Large Language Model Serving. 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI 24). :173--191.
2022
Weng Q, Xiao W, Yu Y, Wang W, Wang C, He J, Li Y, Zhang L, Lin W, Ding Y.  2022.  MLaaS in the Wild: Workload Analysis and Scheduling in Large-Scale Heterogeneous GPU Clusters. 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI 22). :945--960.
Jia X, Jiang L, Wang A, Xiao W, Shi Z, Zhang J, Li X, Chen L, Li Y, Zheng Z et al..  2022.  Whale: Efficient Giant Model Training over Heterogeneous GPUs. 2022 USENIX Annual Technical Conference (USENIX ATC 22). :673--688.
2021
Lim G, Ahn J, Xiao W, Kwon Y, Jeon M.  2021.  Zico: Efficient GPU Memory Sharing for Concurrent DNN Training. 2021 USENIX Annual Technical Conference (USENIX ATC 21). :161--175.
2020
Xiao W, Ren S, Li Y, Zhang Y, Hou P, Li Z, Feng Y, Lin W, Jia Y.  2020.  AntMan: Dynamic Scaling on GPU Clusters for Deep Learning. 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20). :533--548.
2018
Xiao W, Bhardwaj R, Ramjee R, Sivathanu M, Kwatra N, Han Z, Patel P, Peng X, Zhao H, Zhang Q et al..  2018.  Gandiva: Introspective Cluster Scheduling for Deep Learning. 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18). :595--610.
2017
Xiao W, Xue J, Miao Y, Li Z, Chen C, Wu M, Li W, Zhou L.  2017.  Tux²: Distributed Graph Computation for Machine Learning. 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI 17). :669--682.