| EINNET: Optimizing Tensor Programs with Derivation-Based Transformations | OSDI '23 | Liyan Zheng, Haojie Wang, Jidong Zhai, Muyan Hu, Zixuan Ma, Tuowei Wang, Shuhong Huang, Xupeng Miao, Shizhi Tang, Kezhao Huang, Zhihao Jia |
| MGG: Accelerating Graph Neural Networks with Fine-Grained Intra-Kernel Communication-Computation Pipelining on Multi-GPU Platforms | OSDI '23 | Yuke Wang, Boyuan Feng, Zheng Wang, Tong Geng, Kevin Barker, Ang Li, Yufei Ding |
| BWoS: Formally Verified Block-based Work Stealing for Parallel Processing | OSDI '23 | Jiawei Wang, Bohdan Trach, Ming Fu, Diogo Behrens, Jonathan Schwender, Yutao Liu, Jitang Lei, Viktor Vafeiadis, Hermann Härtig, Haibo Chen |
| ShRing: Networking with Shared Receive Rings | OSDI '23 | Boris Pismenny, Adam Morrison, Dan Tsafrir |
| ServiceRouter: Hyperscale and Minimal Cost Service Mesh at Meta | OSDI '23 | Harshit Saokar, Soteris Demetriou, Nick Magerko, Max Kontorovich, Josh Kirstein, Margot Leibold, Dimitrios Skarlatos, Hitesh Khandelwal, Chunqiang Tang |
| Characterizing Off-path SmartNIC for Accelerating Distributed Systems | OSDI '23 | Xingda Wei, Rongxin Cheng, Yuhan Yang, Rong Chen, Haibo Chen |
| Ensō: A Streaming Interface for NIC-Application Communication | OSDI '23 | Hugo Sadok, Nirav Atre, Zhipeng Zhao, Daniel S. Berger, James C. Hoe, Aurojit Panda, Justine Sherry, Ren Wang |
| zpoline: a system call hook mechanism based on binary rewriting | USENIX ATC '23 | Kenichi Yasukata, Hajime Tazaki, Pierre-Louis Aublin, Kenta Ishiguro |
| SmartMoE: Efficiently Training Sparsely-Activated Models through Combining Offline and Online Parallelization | USENIX ATC '23 | Mingshu Zhai, Jiaao He, Zixuan Ma, Zan Zong, Runqing Zhang, Jidong Zhai |
| TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs | USENIX ATC '23 | Yuke Wang, Boyuan Feng, Zheng Wang, Guyue Huang, Yufei Ding |
| Light-Dedup: A Light-weight Inline Deduplication Framework for Non-Volatile Memory File Systems | USENIX ATC '23 | Jiansheng Qiu, Yanqi Pan, Wen Xia, Xiaojia Huang, Wenjun Wu, Xiangyu Zou, Shiyi Li, Yu Hua |
| SingularFS: A Billion-Scale Distributed File System Using a Single Metadata Server | USENIX ATC '23 | Hao Guo, Youyou Lu, Wenhao Lv, Xiaojian Liao, Shaoxun Zeng, Jiwu Shu |
| AAsclepius: Monitoring, Diagnosing, and Detouring at the Internet Peering Edge | USENIX ATC '23 | Kaicheng Yang, Yuanpeng Li, Sheng Long, Tong Yang, Ruijie Miao, Yikai Zhao, Chaoyang Ji, Penghui Mi, Guodong Yang, Qiong Xie, Hao Wang, Yinhua Wang, Bo Deng, Zhiqiang Liao, Chengqiang Huang, Yongqiang Yang, Xiang Huang, Wei Sun, Xiaoping Zhu |
| Confidential Computing within an AI Accelerator | USENIX ATC '23 | Kapil Vaswani, Stavros Volos, Cédric Fournet, Antonio Nino Diaz, Ken Gordon, Balaji Vembu, Sam Webster, David Chisnall, Saurabh Kulkarni, Graham Cunningham, Richard Osborne, Daniel Wilkinson |
| APRON: Authenticated and Progressive System Image Renovation | USENIX ATC '23 | Sangho Lee |
| Accelerating Distributed MoE Training and Inference with Lina | USENIX ATC '23 | Jiamin Li, Yimin Jiang, Yibo Zhu, Cong Wang, Hong Xu |
| Distributed Transactions at Scale in Amazon DynamoDB | USENIX ATC '23 | Joseph Idziorek, Alex Keyes, Colin Lazier, Somu Perianayagam, Prithvi Ramanathan, James Christopher Sorenson III, Doug Terry, Akshat Vig |
| VectorVisor: A Binary Translation Scheme for Throughput-Oriented GPU Acceleration | USENIX ATC '23 | Samuel Ginzburg, Mohammad Shahrad, Michael J. Freedman |
| Revisiting Secondary Indexing in LSM-based Storage Systems with Persistent Memory | USENIX ATC '23 | Jing Wang, Youyou Lu, Qing Wang, Yuhao Zhang, Jiwu Shu |
| Prefix Siphoning: Exploiting LSM-Tree Range Filters For Information Disclosure | USENIX ATC '23 | Adi Kaufman, Moshik Hershcovitch, Adam Morrison |
| LoopDelta: Embedding Locality-aware Opportunistic Delta Compression in Inline Deduplication for Highly Efficient Data Reduction | USENIX ATC '23 | Yucheng Zhang, Hong Jiang, Dan Feng, Nan Jiang, Taorong Qiu, Wei Huang |
| Beware of Fragmentation: Scheduling GPU-Sharing Workloads with Fragmentation Gradient Descent | USENIX ATC '23 | Qizhen Weng, Lingyun Yang, Yinghao Yu, Wei Wang, Xiaochuan Tang, Guodong Yang, Liping Zhang |
| Legion: Automatically Pushing the Envelope of Multi-GPU System for Billion-Scale GNN Training | USENIX ATC '23 | Jie Sun, Li Su, Zuocheng Shi, Wenting Shen, Zeke Wang, Lei Wang, Jie Zhang, Yong Li, Wenyuan Yu, Jingren Zhou, Fei Wu |
| Tectonic-Shift: A Composite Storage Fabric for Large-Scale ML Training | USENIX ATC '23 | Mark Zhao, Satadru Pan, Niket Agarwal, Zhaoduo Wen, David Xu, Anand Natarajan, Pavan Kumar, Shiva Shankar P, Ritesh Tijoriwala, Karan Asher, Hao Wu, Aarti Basant, Daniel Ford, Delia David, Nezih Yigitbasi, Pratap Singh, Carole-Jean Wu, Christos Kozyrakis |
| GLogS: Interactive Graph Pattern Matching Query At Large Scale | USENIX ATC '23 | Longbin Lai, Yufan Yang, Zhibin Wang, Yuxuan Liu, Haotian Ma, Sijie Shen, Bingqing Lyu, Xiaoli Zhou, Wenyuan Yu, Zhengping Qian, Chen Tian, Sheng Zhong, Yeh-Ching Chung, Jingren Zhou |