| Identifying On-/Off-CPU Bottlenecks Together with Blocked Samples | OSDI '24 | Minwoo Ahn, Jeongmin Han, Youngjin Kwon, Jinkyu Jeong |
| μSlope: High Compression and Fast Search on Semi-Structured Logs | OSDI '24 | Rui Wang, Devin Gibson, Kirk Rodrigues, Yu Luo, Yun Zhang, Kaibo Wang, Yupeng Fu, Ting Chen, Ding Yuan |
| Scaling AI Sustainably: An Uncharted Territory | USENIX ATC '24 | Carole-Jean Wu |
| dLoRA: Dynamically Orchestrating Requests and Adapters for LoRA LLM Serving | OSDI '24 | Bingyang Wu, Ruidong Zhu, Zili Zhang, Peng Sun, Xuanzhe Liu, Xin Jin |
| Parrot: Efficient Serving of LLM-based Applications with Semantic Variable | OSDI '24 | Chaofan Lin, Zhenhua Han, Chengruidong Zhang, Yuqing Yang, Fan Yang, Chen Chen, Lili Qiu |
| USHER: Holistic Interference Avoidance for Resource Optimized ML Inference | OSDI '24 | Sudipta Saha Shubha, Haiying Shen, Anand Iyer |
| Fairness in Serving Large Language Models | OSDI '24 | Ying Sheng, Shiyi Cao, Dacheng Li, Banghua Zhu, Zhuohan Li, Danyang Zhuo, Joseph E. Gonzalez, Ion Stoica |
| MonoNN: Enabling a New Monolithic Optimization Space for Neural Network Inference Tasks on Modern GPU-Centric Architectures | OSDI '24 | Donglin Zhuang, Zhen Zheng, Haojun Xia, Xiafei Qiu, Junjie Bai, Wei Lin, Shuaiwen Leon Song |
| Harmonizing Efficiency and Practicability: Optimizing Resource Utilization in Serverless Computing with Jiagu | USENIX ATC '24 | Qingyuan Liu, Yanning Yang, Dong Du, Yubin Xia, Ping Zhang, Jia Feng, James R. Larus, Haibo Chen |
| ALPS: An Adaptive Learning, Priority OS Scheduler for Serverless Functions | USENIX ATC '24 | Yuqi Fu, Ruizhe Shi, Haoliang Wang, Songqing Chen, Yue Cheng |
| Starburst: A Cost-aware Scheduler for Hybrid Cloud | USENIX ATC '24 | Michael Luo, Siyuan Zhuang, Suryaprakash Vengadesan, Romil Bhardwaj, Justin Chang, Eric Friedman, Scott Shenker, Ion Stoica |
| StreamBox: A Lightweight GPU SandBox for Serverless Inference Workflow | USENIX ATC '24 | Hao Wu, Yue Yu, Junxiao Deng, Shadi Ibrahim, Song Wu, Hao Fan, Ziyue Cheng, Hai Jin |
| Power-aware Deep Learning Model Serving with μ-Serve | USENIX ATC '24 | Haoran Qiu, Weichao Mao, Archit Patke, Shengkun Cui, Saurabh Jha, Chen Wang, Hubertus Franke, Zbigniew Kalbarczyk, Tamer Başar, Ravishankar K. Iyer |
| Fast Inference for Probabilistic Graphical Models | USENIX ATC '24 | Jiantong Jiang, Zeyi Wen, Atif Mansoor, Ajmal Mian |
| Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention | USENIX ATC '24 | Bin Gao, Zhuomin He, Puru Sharma, Qingxuan Kang, Djordje Jevdjic, Junbo Deng, Xingkun Yang, Zhou Yu, Pengfei Zuo |
| PUZZLE: Efficiently Aligning Large Language Models through Light-Weight Context Switch | USENIX ATC '24 | Kinman Lei, Yuyang Jin, Mingshu Zhai, Kezhao Huang, Haoxing Ye, Jidong Zhai |
| ScalaAFA: Constructing User-Space All-Flash Array Engine with Holistic Designs | USENIX ATC '24 | Shushu Yi, Xiurui Pan, Qiao Li, Qiang Li, Chenxi Wang, Bo Mao, Myoungsoo Jung, Jie Zhang |
| FastCommit: resource-efficient, performant and cost-effective file system journaling | USENIX ATC '24 | Harshad Shirwadkar, Saurabh Kadekodi, Theodore Tso |
| ZMS: Zone Abstraction for Mobile Flash Storage | USENIX ATC '24 | Joo-Young Hwang, Seokhwan Kim, Daejun Park, Yong-Gil Song, Junyoung Han, Seunghyun Choi, Sangyeun Cho, Youjip Won |
| Ethane: An Asymmetric File System for Disaggregated Persistent Memory | USENIX ATC '24 | Miao Cai, Junru Shen, Baoliu Ye |
| PeRF: Preemption-enabled RDMA Framework | USENIX ATC '24 | Sugi Lee, Mingyu Choi, Ikjun Yeom, Younghoon Kim |
| CyberStar: Simple, Elastic and Cost-Effective Network Functions Management in Cloud Network at Scale | USENIX ATC '24 | Tingting Xu, Bengbeng Xue, Yang Song, Xiaomin Wu, Xiaoxin Peng, Yilong Lyu, Xiaoliang Wang, Chen Tian, Baoliu Ye, Camtu Nguyen, Biao Lyu, Rong Wen, Zhigang Zong, Shunmin Zhu |
| OSMOSIS: Enabling Multi-Tenancy in Datacenter SmartNICs | USENIX ATC '24 | Mikhail Khalilov, Marcin Chrapek, Siyuan Shen, Alessandro Vezzu, Thomas Benz, Salvatore Di Girolamo, Timo Schneider, Daniele De Sensi, Luca Benini, Torsten Hoefler |
| More is Different: Prototyping and Analyzing a New Form of Edge Server with Massive Mobile SoCs | USENIX ATC '24 | Li Zhang, Zhe Fu, Boqing Shi, Xiang Li, Rujin Lai, Chenyang Yang, Ao Zhou, Xiao Ma, Shangguang Wang, Mengwei Xu |
| HiP4-UPF: Towards High-Performance Comprehensive 5G User Plane Function on P4 Programmable Switches | USENIX ATC '24 | Zhixin Wen, Guanhua Yan |