USENIX supports diversity, equity, and inclusion and condemns hate and discrimination.
Conferences
Search results
Title | Conference | Speaker(s) | |
---|---|---|---|
Sabre: Hardware-Accelerated Snapshot Compression for Serverless MicroVMs | OSDI '24 | Nikita Lazarev, Varun Gohil, James Tsai, Andy Anderson, Bhushan Chitlur, Zhiru Zhang, Christina Delimitrou | |
Nomad: Non-Exclusive Memory Tiering via Transactional Page Migration | OSDI '24 | Lingfeng Xiang, Zhen Lin, Weishu Deng, Hui Lu, Jia Rao, Yifan Yuan, Ren Wang | |
Managing Memory Tiers with CXL in Virtualized Environments | OSDI '24 | Yuhong Zhong, Daniel S. Berger, Carl Waldspurger, Ryan Wee, Ishwar Agarwal, Rajat Agarwal, Frank Hady, Karthik Kumar, Mark D. Hill, Mosharaf Chowdhury, Asaf Cidon | |
Harvesting Memory-bound CPU Stall Cycles in Software with MSH | OSDI '24 | Zhihong Luo, Sam Son, Sylvia Ratnasamy, Scott Shenker | |
DRust: Language-Guided Distributed Shared Memory with Fine Granularity, Full Transparency, and Ultra Efficiency | OSDI '24 | Haoran Ma, Yifan Qiao, Shi Liu, Shan Yu, Yuanjiang Ni, Qingda Lu, Jiesheng Wu, Yiying Zhang, Miryung Kim, Harry Xu | |
Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve | OSDI '24 | Amey Agrawal, Nitin Kedia, Ashish Panwar, Jayashree Mohan, Nipun Kwatra, Bhargav Gulavani, Alexey Tumanov, Ramachandran Ramjee | |
ServerlessLLM: Low-Latency Serverless Inference for Large Language Models | OSDI '24 | Yao Fu, Leyang Xue, Yeqi Huang, Andrei-Octavian Brabete, Dmitrii Ustiugov, Yuvraj Patel, Luo Mai | |
InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management | OSDI '24 | Wonbeom Lee, Jungi Lee, Junghwan Seo, Jaewoong Sim | |
Llumnix: Dynamic Scheduling for Large Language Model Serving | OSDI '24 | Biao Sun, Ziming Huang, Hanyu Zhao, Wencong Xiao, Xinyi Zhang, Yong Li, Wei Lin | |
DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving | OSDI '24 | Yinmin Zhong, Shengyu Liu, Junda Chen, Jianbo Hu, Yibo Zhu, Xuanzhe Liu, Xin Jin, Hao Zhang | |
ACCL+: an FPGA-Based Collective Engine for Distributed Applications | OSDI '24 | Zhenhao He, Dario Korolija, Yu Zhu, Benjamin Ramhorst, Tristan Laan, Lucian Petrica, Michaela Blott, Gustavo Alonso | |
Beaver: Practical Partial Snapshots for Distributed Cloud Services | OSDI '24 | Liangcheng Yu, Xiao Zhang, Haoran Zhang, John Sonchack, Dan Ports, Vincent Liu | |
Fast and Scalable In-network Lock Management Using Lock Fission | OSDI '24 | Hanze Zhang, Ke Cheng, Rong Chen, Haibo Chen | |
Chop Chop: Byzantine Atomic Broadcast to the Network Limit | OSDI '24 | Martina Camaioni, Rachid Guerraoui, Matteo Monti, Pierre-Louis Roman, Manuel Vidigueira, Gauthier Voron | |
Enabling Tensor Language Model to Assist in Generating High-Performance Tensor Programs for Deep Learning | OSDI '24 | Yi Zhai, Sijia Yang, Keyu Pan, Renwei Zhang, Shuo Liu, Chao Liu, Zichun Ye, Jianmin Ji, Jie Zhao, Yu Zhang, Yanyong Zhang | |
Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation | OSDI '24 | Lei Wang, Lingxiao Ma, Shijie Cao, Quanlu Zhang, Jilong Xue, Yining Shi, Ningxin Zheng, Ziming Miao, Fan Yang, Ting Cao, Yuqing Yang, Mao Yang | |
Caravan: Practical Online Learning of In-Network ML Models with Labeling Agents | OSDI '24 | Qizheng Zhang, Ali Imran, Enkeleda Bardhi, Tushar Swamy, Nathan Zhang, Muhammad Shahbaz, Kunle Olukotun | |
nnScaler: Constraint-Guided Parallelization Plan Generation for Deep Learning Training | OSDI '24 | Zhiqi Lin, Youshan Miao, Quanlu Zhang, Fan Yang, Yi Zhu, Cheng Li, Saeed Maleki, Xu Cao, Ning Shang, Yilei Yang, Weijiang Xu, Mao Yang, Lintao Zhang, Lidong Zhou | |
ChameleonAPI: Automatic and Efficient Customization of Neural Networks for ML Applications | OSDI '24 | Yuhan Liu, Chengcheng Wan, Kuntai Du, Henry Hoffmann, Junchen Jiang, Shan Lu, Michael Maire | |
SquirrelFS: using the Rust compiler to check file-system crash consistency | OSDI '24 | Hayley LeBlanc, Nathan Taylor, James Bornholt, Vijay Chidambaram | |
High-throughput and Flexible Host Networking for Accelerated Computing | OSDI '24 | Athinagoras Skiadopoulos, Zhiqiang Xie, Mark Zhao, Qizhe Cai, Saksham Agarwal, Jacob Adelmann, David Ahern, Carlo Contavalli, Michael Goldflam, Vitaly Mayatskikh, Raghu Raja, Daniel Walton, Rachit Agarwal, Shrijeet Mukherjee, Christos Kozyrakis | |
IntOS: Persistent Embedded Operating System and Language Support for Multi-threaded Intermittent Computing | OSDI '24 | Yilun Wu, Byounguk Min, Mohannad Ismail, Wenjie Xiong, Changhee Jung, Dongyoon Lee | |
Data-flow Availability: Achieving Timing Assurance in Autonomous Systems | OSDI '24 | Ao Li, Ning Zhang | |
Microkernel Goes General: Performance and Compatibility in the HongMeng Production Microkernel | OSDI '24 | Haibo Chen, Xie Miao, Ning Jia, Nan Wang, Yu Li, Nian Liu, Yutao Liu, Fei Wang, Qiang Huang, Kun Li, Hongyang Yang, Hui Wang, Jie Yin, Yu Peng, Fengwei Xu | |
Optimizing Resource Allocation in Hyperscale Datacenters: Scalability, Usability, and Experiences | OSDI '24 | Neeraj Kumar, Pol Mauri Ruiz, Vijay Menon, Igor Kabiljo, Mayank Pundir, Andrew Newell, Daniel Lee, Liyuan Wang, Chunqiang Tang |