USENIX supports diversity, equity, and inclusion and condemns hate and discrimination.
Biblio
Export 8 results:
Filters: Author is Wencong Xiao [Clear All Filters]
Llumnix: Dynamic Scheduling for Large Language Model Serving. 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI 24). :173--191.
.
2024. MLaaS in the Wild: Workload Analysis and Scheduling in Large-Scale Heterogeneous GPU Clusters. 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI 22). :945--960.
.
2022. Whale: Efficient Giant Model Training over Heterogeneous GPUs. 2022 USENIX Annual Technical Conference (USENIX ATC 22). :673--688.
.
2022. Zico: Efficient GPU Memory Sharing for Concurrent DNN Training. 2021 USENIX Annual Technical Conference (USENIX ATC 21). :161--175.
.
2021. AntMan: Dynamic Scaling on GPU Clusters for Deep Learning. 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20). :533--548.
.
2020. Analysis of Large-Scale Multi-Tenant GPU Clusters for DNN Training Workloads. 2019 USENIX Annual Technical Conference (USENIX ATC 19). :947--960.
.
2019. Gandiva: Introspective Cluster Scheduling for Deep Learning. 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18). :595--610.
.
2018. Tux²: Distributed Graph Computation for Machine Learning. 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI 17). :669--682.
.
2017.