Conferences

Search results

    TitleConferenceSpeaker(s)
    Identifying On-/Off-CPU Bottlenecks Together with Blocked SamplesOSDI '24Minwoo Ahn, Jeongmin Han, Youngjin Kwon, Jinkyu Jeong
    μSlope: High Compression and Fast Search on Semi-Structured LogsOSDI '24Rui Wang, Devin Gibson, Kirk Rodrigues, Yu Luo, Yun Zhang, Kaibo Wang, Yupeng Fu, Ting Chen, Ding Yuan
    Scaling AI Sustainably: An Uncharted TerritoryUSENIX ATC '24Carole-Jean Wu
    dLoRA: Dynamically Orchestrating Requests and Adapters for LoRA LLM ServingOSDI '24Bingyang Wu, Ruidong Zhu, Zili Zhang, Peng Sun, Xuanzhe Liu, Xin Jin
    Parrot: Efficient Serving of LLM-based Applications with Semantic VariableOSDI '24Chaofan Lin, Zhenhua Han, Chengruidong Zhang, Yuqing Yang, Fan Yang, Chen Chen, Lili Qiu
    USHER: Holistic Interference Avoidance for Resource Optimized ML InferenceOSDI '24Sudipta Saha Shubha, Haiying Shen, Anand Iyer
    Fairness in Serving Large Language ModelsOSDI '24Ying Sheng, Shiyi Cao, Dacheng Li, Banghua Zhu, Zhuohan Li, Danyang Zhuo, Joseph E. Gonzalez, Ion Stoica
    MonoNN: Enabling a New Monolithic Optimization Space for Neural Network Inference Tasks on Modern GPU-Centric ArchitecturesOSDI '24Donglin Zhuang, Zhen Zheng, Haojun Xia, Xiafei Qiu, Junjie Bai, Wei Lin, Shuaiwen Leon Song
    Harmonizing Efficiency and Practicability: Optimizing Resource Utilization in Serverless Computing with JiaguUSENIX ATC '24Qingyuan Liu, Yanning Yang, Dong Du, Yubin Xia, Ping Zhang, Jia Feng, James R. Larus, Haibo Chen
    ALPS: An Adaptive Learning, Priority OS Scheduler for Serverless FunctionsUSENIX ATC '24Yuqi Fu, Ruizhe Shi, Haoliang Wang, Songqing Chen, Yue Cheng
    Starburst: A Cost-aware Scheduler for Hybrid CloudUSENIX ATC '24Michael Luo, Siyuan Zhuang, Suryaprakash Vengadesan, Romil Bhardwaj, Justin Chang, Eric Friedman, Scott Shenker, Ion Stoica
    StreamBox: A Lightweight GPU SandBox for Serverless Inference WorkflowUSENIX ATC '24Hao Wu, Yue Yu, Junxiao Deng, Shadi Ibrahim, Song Wu, Hao Fan, Ziyue Cheng, Hai Jin
    Power-aware Deep Learning Model Serving with μ-ServeUSENIX ATC '24Haoran Qiu, Weichao Mao, Archit Patke, Shengkun Cui, Saurabh Jha, Chen Wang, Hubertus Franke, Zbigniew Kalbarczyk, Tamer Başar, Ravishankar K. Iyer
    Fast Inference for Probabilistic Graphical ModelsUSENIX ATC '24Jiantong Jiang, Zeyi Wen, Atif Mansoor, Ajmal Mian
    Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttentionUSENIX ATC '24Bin Gao, Zhuomin He, Puru Sharma, Qingxuan Kang, Djordje Jevdjic, Junbo Deng, Xingkun Yang, Zhou Yu, Pengfei Zuo
    PUZZLE: Efficiently Aligning Large Language Models through Light-Weight Context SwitchUSENIX ATC '24Kinman Lei, Yuyang Jin, Mingshu Zhai, Kezhao Huang, Haoxing Ye, Jidong Zhai
    ScalaAFA: Constructing User-Space All-Flash Array Engine with Holistic DesignsUSENIX ATC '24Shushu Yi, Xiurui Pan, Qiao Li, Qiang Li, Chenxi Wang, Bo Mao, Myoungsoo Jung, Jie Zhang
    FastCommit: resource-efficient, performant and cost-effective file system journalingUSENIX ATC '24Harshad Shirwadkar, Saurabh Kadekodi, Theodore Tso
    ZMS: Zone Abstraction for Mobile Flash StorageUSENIX ATC '24Joo-Young Hwang, Seokhwan Kim, Daejun Park, Yong-Gil Song, Junyoung Han, Seunghyun Choi, Sangyeun Cho, Youjip Won
    Ethane: An Asymmetric File System for Disaggregated Persistent MemoryUSENIX ATC '24Miao Cai, Junru Shen, Baoliu Ye
    PeRF: Preemption-enabled RDMA FrameworkUSENIX ATC '24Sugi Lee, Mingyu Choi, Ikjun Yeom, Younghoon Kim
    CyberStar: Simple, Elastic and Cost-Effective Network Functions Management in Cloud Network at ScaleUSENIX ATC '24Tingting Xu, Bengbeng Xue, Yang Song, Xiaomin Wu, Xiaoxin Peng, Yilong Lyu, Xiaoliang Wang, Chen Tian, Baoliu Ye, Camtu Nguyen, Biao Lyu, Rong Wen, Zhigang Zong, Shunmin Zhu
    OSMOSIS: Enabling Multi-Tenancy in Datacenter SmartNICsUSENIX ATC '24Mikhail Khalilov, Marcin Chrapek, Siyuan Shen, Alessandro Vezzu, Thomas Benz, Salvatore Di Girolamo, Timo Schneider, Daniele De Sensi, Luca Benini, Torsten Hoefler
    More is Different: Prototyping and Analyzing a New Form of Edge Server with Massive Mobile SoCsUSENIX ATC '24Li Zhang, Zhe Fu, Boqing Shi, Xiang Li, Rujin Lai, Chenyang Yang, Ao Zhou, Xiao Ma, Shangguang Wang, Mengwei Xu
    HiP4-UPF: Towards High-Performance Comprehensive 5G User Plane Function on P4 Programmable SwitchesUSENIX ATC '24Zhixin Wen, Guanhua Yan

Pages