USENIX supports diversity, equity, and inclusion and condemns hate and discrimination.
Biblio
Export 17 results:
Filters: Author is Fan Yang [Clear All Filters]
Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation. 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI 24). :307--323.
.
2024. nnScaler: Constraint-Guided Parallelization Plan Generation for Deep Learning Training. 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI 24). :347--363.
.
2024. Parrot: Efficient Serving of LLM-based Applications with Semantic Variable. 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI 24). :929--945.
.
2024. Cocktailer: Analyzing and Optimizing Dynamic Control Flow in Deep Learning. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :681--699.
.
2023. On Modular Learning of Distributed Systems for Predicting End-to-End Latency. 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI 23). :1081--1095.
.
2023. Optimizing Dynamic Neural Networks with Brainstorm. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :797--815.
.
2023. PROGRAPHER: An Anomaly Detection System based on Provenance Graph Embedding. 32nd USENIX Security Symposium (USENIX Security 23). :4355--4372.
.
2023. VBASE: Unifying Online Vector Similarity Search and Relational Queries via Relaxed Monotonicity. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :377--395.
.
2023. Welder: Scheduling Deep Learning Memory Access via Tile-graph. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :701--718.
.
2023. PilotFish: Harvesting Free Cycles of Cloud Gaming with Deep Learning Training. 2022 USENIX Annual Technical Conference (USENIX ATC 22). :217--232.
.
2022. ROLLER: Fast and Efficient Tensor Compilation for Deep Learning. 16th USENIX Symposium on Operating Systems Design and Implementation (OSDI 22). :233--248.
.
2022. SparTA: Deep-Learning Model Sparsity via Tensor-with-Sparsity-Attribute. 16th USENIX Symposium on Operating Systems Design and Implementation (OSDI 22). :213--232.
.
2022. HiveD: Sharing a GPU Cluster for Deep Learning with Guarantees. 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20). :515--532.
.
2020. Rammer: Enabling Holistic Deep Learning Compiler Optimizations with rTasks. 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20). :881--897.
.
2020. Retiarii: A Deep Learning Exploratory-Training Framework. 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20). :919--936.
.
2020. Analysis of Large-Scale Multi-Tenant GPU Clusters for DNN Training Workloads. 2019 USENIX Annual Technical Conference (USENIX ATC 19). :947--960.
.
2019. Gandiva: Introspective Cluster Scheduling for Deep Learning. 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18). :595--610.
.
2018.