USENIX supports diversity, equity, and inclusion and condemns hate and discrimination.
Biblio
Export 6 results:
Filters: Author is Hao Zhang [Clear All Filters]
DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving. 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI 24). :193--210.
.
2024. AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :663--679.
.
2023. Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning. 16th USENIX Symposium on Operating Systems Design and Implementation (OSDI 22). :559--578.
.
2022. Pollux: Co-adaptive Cluster Scheduling for Goodput-Optimized Deep Learning. 15th {USENIX} Symposium on Operating Systems Design and Implementation ({OSDI} 21). :1--18.
.
2021. Cavs: An Efficient Runtime System for Dynamic Neural Networks. 2018 USENIX Annual Technical Conference (USENIX ATC 18). :937--950.
.
2018. Poseidon: An Efficient Communication Architecture for Distributed Deep Learning on GPU Clusters. 2017 USENIX Annual Technical Conference (USENIX ATC 17). :181--193.
.
2017.