Conferences

Search results

    TitleConferenceSpeaker(s)
    Fairness in Serving Large Language ModelsOSDI '24Ying Sheng, Shiyi Cao, Dacheng Li, Banghua Zhu, Zhuohan Li, Danyang Zhuo, Joseph E. Gonzalez, Ion Stoica
    MonoNN: Enabling a New Monolithic Optimization Space for Neural Network Inference Tasks on Modern GPU-Centric ArchitecturesOSDI '24Donglin Zhuang, Zhen Zheng, Haojun Xia, Xiafei Qiu, Junjie Bai, Wei Lin, Shuaiwen Leon Song
    Harmonizing Efficiency and Practicability: Optimizing Resource Utilization in Serverless Computing with JiaguUSENIX ATC '24Qingyuan Liu, Yanning Yang, Dong Du, Yubin Xia, Ping Zhang, Jia Feng, James R. Larus, Haibo Chen
    ALPS: An Adaptive Learning, Priority OS Scheduler for Serverless FunctionsUSENIX ATC '24Yuqi Fu, Ruizhe Shi, Haoliang Wang, Songqing Chen, Yue Cheng
    Starburst: A Cost-aware Scheduler for Hybrid CloudUSENIX ATC '24Michael Luo, Siyuan Zhuang, Suryaprakash Vengadesan, Romil Bhardwaj, Justin Chang, Eric Friedman, Scott Shenker, Ion Stoica
    StreamBox: A Lightweight GPU SandBox for Serverless Inference WorkflowUSENIX ATC '24Hao Wu, Yue Yu, Junxiao Deng, Shadi Ibrahim, Song Wu, Hao Fan, Ziyue Cheng, Hai Jin
    Power-aware Deep Learning Model Serving with μ-ServeUSENIX ATC '24Haoran Qiu, Weichao Mao, Archit Patke, Shengkun Cui, Saurabh Jha, Chen Wang, Hubertus Franke, Zbigniew Kalbarczyk, Tamer Başar, Ravishankar K. Iyer
    Fast Inference for Probabilistic Graphical ModelsUSENIX ATC '24Jiantong Jiang, Zeyi Wen, Atif Mansoor, Ajmal Mian
    Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttentionUSENIX ATC '24Bin Gao, Zhuomin He, Puru Sharma, Qingxuan Kang, Djordje Jevdjic, Junbo Deng, Xingkun Yang, Zhou Yu, Pengfei Zuo
    PUZZLE: Efficiently Aligning Large Language Models through Light-Weight Context SwitchUSENIX ATC '24Kinman Lei, Yuyang Jin, Mingshu Zhai, Kezhao Huang, Haoxing Ye, Jidong Zhai
    ScalaAFA: Constructing User-Space All-Flash Array Engine with Holistic DesignsUSENIX ATC '24Shushu Yi, Xiurui Pan, Qiao Li, Qiang Li, Chenxi Wang, Bo Mao, Myoungsoo Jung, Jie Zhang
    FastCommit: resource-efficient, performant and cost-effective file system journalingUSENIX ATC '24Harshad Shirwadkar, Saurabh Kadekodi, Theodore Tso
    ZMS: Zone Abstraction for Mobile Flash StorageUSENIX ATC '24Joo-Young Hwang, Seokhwan Kim, Daejun Park, Yong-Gil Song, Junyoung Han, Seunghyun Choi, Sangyeun Cho, Youjip Won
    Ethane: An Asymmetric File System for Disaggregated Persistent MemoryUSENIX ATC '24Miao Cai, Junru Shen, Baoliu Ye
    PeRF: Preemption-enabled RDMA FrameworkUSENIX ATC '24Sugi Lee, Mingyu Choi, Ikjun Yeom, Younghoon Kim
    CyberStar: Simple, Elastic and Cost-Effective Network Functions Management in Cloud Network at ScaleUSENIX ATC '24Tingting Xu, Bengbeng Xue, Yang Song, Xiaomin Wu, Xiaoxin Peng, Yilong Lyu, Xiaoliang Wang, Chen Tian, Baoliu Ye, Camtu Nguyen, Biao Lyu, Rong Wen, Zhigang Zong, Shunmin Zhu
    OSMOSIS: Enabling Multi-Tenancy in Datacenter SmartNICsUSENIX ATC '24Mikhail Khalilov, Marcin Chrapek, Siyuan Shen, Alessandro Vezzu, Thomas Benz, Salvatore Di Girolamo, Timo Schneider, Daniele De Sensi, Luca Benini, Torsten Hoefler
    More is Different: Prototyping and Analyzing a New Form of Edge Server with Massive Mobile SoCsUSENIX ATC '24Li Zhang, Zhe Fu, Boqing Shi, Xiang Li, Rujin Lai, Chenyang Yang, Ao Zhou, Xiao Ma, Shangguang Wang, Mengwei Xu
    HiP4-UPF: Towards High-Performance Comprehensive 5G User Plane Function on P4 Programmable SwitchesUSENIX ATC '24Zhixin Wen, Guanhua Yan
    KEPC-Push: A Knowledge-Enhanced Proactive Content Push Strategy for Edge-Assisted Video Feed StreamingUSENIX ATC '24Ziwen Ye, Qing Li, Chunyu Qiao, Xiaoteng Ma, Yong Jiang, Qian Ma, Shengbin Meng, Zhenhui Yuan, Zili Meng
    High-density Mobile Cloud Gaming on Edge SoC ClustersUSENIX ATC '24Li Zhang, Shangguang Wang, Mengwei Xu
    Limitations and Opportunities of Modern Hardware Isolation MechanismsUSENIX ATC '24Xiangdong Chen, Zhaofeng Li, Tirth Jain, Vikram Narayanan, Anton Burtsev
    FetchBPF: Customizable Prefetching Policies in Linux with eBPFUSENIX ATC '24Xuechun Cao, Shaurya Patel, Soo Yee Lim, Xueyuan Han, Thomas Pasquier
    Fast (Trapless) Kernel Probes EverywhereUSENIX ATC '24Jinghao Jia, Michael V. Le, Salman Ahmed, Dan Williams, Hani Jamjoom, Tianyin Xu
    HydraRPC: RPC in the CXL EraUSENIX ATC '24Teng Ma, Zheng Liu, Chengkun Wei, Jialiang Huang, Youwei Zhuo, Haoyu Li, Ning Zhang, Yijin Guan, Dimin Niu, Mingxing Zhang, Tao Ma

Pages