Biblio

Export 5 results:
Filters: Author is Zili Zhang  [Clear All Filters]
2024
Wu B, Zhu R, Zhang Z, Sun P, Liu X, Jin X.  2024.  dLoRA: Dynamically Orchestrating Requests and Adapters for LoRA LLM Serving. 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI 24). :911--927.
Zhang Z, Liu F, Huang G, Liu X, Jin X.  2024.  Fast Vector Query Processing for Large Datasets Beyond GPU Memory with Reordered Pipelining. 21st USENIX Symposium on Networked Systems Design and Implementation (NSDI 24). :23--40.
Zhang Z, Jin C, Jin X.  2024.  Jolteon: Unleashing the Promise of Serverless for Serverless Workflows. 21st USENIX Symposium on Networked Systems Design and Implementation (NSDI 24). :167--183.
2023
Zhang Z, Jin C, Tang L, Liu X, Jin X.  2023.  Fast, Approximate Vector Queries on Very Large Unstructured Datasets. 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI 23). :995--1011.
Wu B, Zhang Z, Bai Z, Liu X, Jin X.  2023.  Transparent GPU Sharing in Container Clouds for Deep Learning Workloads. 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI 23). :69--85.