9:00 am–10:00 am
USENIX ATC '24 and OSDI '24 Joint Keynote
Scaling AI Sustainably: An Uncharted Territory
Carole-Jean Wu, Meta
The past 50 years has seen a dramatic increase in the amount of compute per person, in particular, those enabled by AI. Despite the positive societal benefits, AI technologies come with significant environmental implications. I will talk about the scaling trend and the operational carbon footprint of AI computing by examining the model development cycle, spanning data, algorithms, and system hardware. At the same time, we will consider the life cycle of system hardware from the perspective of hardware architectures and manufacturing technologies. I will highlight key efficiency optimization opportunities for cutting-edge AI technologies, from deep learning recommendation models to multi-modal generative AI tasks. To scale AI sustainably, we need to make AI and computing more broadly efficient and flexible. We must also go beyond efficiency and optimize across the life cycle of computing infrastructures, from hardware manufacturing to datacenter operation and end-of-life processing for the hardware. Based on the industry experience and lessons learned, my talk will conclude with important development and research directions to advance the field of computing in an environmentally responsible and sustainable manner.
Carole-Jean Wu, Meta
Carole-Jean Wu is a Director at Meta. She is a founding member and a Vice President of MLCommons—a non-profit organization that aims to accelerate machine learning for the benefit of all. Dr. Wu also serves on the MLCommons Board as a Director, chaired the MLPerf Recommendation Benchmark Advisory Board, and co-chaired for MLPerf Inference. Prior to Meta/Facebook, She was a tenured professor at ASU. She earned her M.A. and Ph.D. from Princeton and B.Sc. from Cornell.
Dr. Wu's expertise sits at the intersection of computer architecture and machine learning. Her work spans across datacenter infrastructures and edge systems, such as developing energy- and memory-efficient systems and microarchitectures, optimizing systems for machine learning execution at-scale, and designing learning-based approaches for system design and optimization. Dr. Wu's work has been recognized with several awards, including IEEE Micro Top Picks and ACM/IEEE Best Paper Awards. She was the Program Co-Chair of the Conference on Machine Learning and Systems (MLSys) in 2022, the Program Chair of the IEEE International Symposium on Workload Characterization (IISWC) in 2018, and the Editor for the IEEE MICRO Special Issue on Environmentally Sustainable Computing. She currently serves on the ACM SIGARCH/SIGMICRO CARES committee.
10:00 am–10:30 am
Break with Refreshments
10:30 am–10:45 am
Opening Remarks
Saurabh Bagchi, Purdue University; Yiying Zhang, University of California, San Diego
10:45 am–12:25 pm
Cloud Computing
Harmonizing Efficiency and Practicability: Optimizing Resource Utilization in Serverless Computing with ASTROLABE
Qingyuan Liu, Yanning Yang, Dong Du, and Yubin Xia, Shanghai Jiao Tong University; Ping Zhang and Jia Feng, Huawei Cloud; James Larus, EPFL; Haibo Chen, Shanghai Jiao Tong University
SEALS: A Self-Adaptive, Learned Scheduler for Serverless Functions
Yuqi Fu, University of Virginia; Ruizhe Shi, George Mason University; Haoliang Wang, Adobe Research; Songqing Chen, George Mason University; Yue Cheng, University of Virginia
Starburst: A Cost-aware Scheduler for Cloud Bursting
Michael Luo, Suryaprakash Vengadesan, Siyuan Zhuang, and Romil Bhardwaj, UC Berkeley; Justin Chang, UCSB; Eric Friedman, ICSI and UC Berkeley; Scott Shenker, ICSI and UC Berkeley; Ion Stoica, UC Berkeley
StreamBox: A Lightweight GPU SandBox for Serverless Inference Workflow
Hao Wu, Yue Yu, and Junxiao Deng, Huazhong University of Science and Technology; Shadi Ibrahim, Inria; Ziyue Cheng, Hao Fan, Song Wu, and Hai Jin, Huazhong University of Science and Technology
ML Inference
Power-aware Deep Learning Model Serving with μ-Serve
Haoran Qiu, Weichao Mao, Archit Patke, and Shengkun Cui, University of Illinois Urbana-Champaign; Saurabh Jha, Chen Wang, and Hubertus Franke, IBM Research; Zbigniew Kalbarczyk, Tamer Başar, and Ravishankar K. Iyer, University of Illinois Urbana-Champaign
Fast Inference for Probabilistic Graphical Models
Jiantong Jiang, The University of Western Australia; Zeyi Wen, The Hong Kong University of Science and Technology (Guangzhou); Atif Mansoor and Ajmal Mian, The University of Western Australia
Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention
Bin Gao, National University of Singapore; Zhuomin He, Shanghai Jiaotong University; Puru Sharma, Qingxuan Kang, and Djordje Jevdjic, National University of Singapore; Junbo Deng, Xingkun Yang, Zhou Yu, and Pengfei Zuo, Huawei Cloud
PUZZLE: Efficiently Aligning Large Language Models through Light-Weight Context Switch
Kinman Lei, Mingshu Zhai, Yuyang Jin, Kezhao Huang, Haoxing Ye, and Jidong Zhai, Tsinghua University
2:00 pm–3:40 pm
Storage 1
ScalaAFA: Constructing User-Space All-Flash Array Engine with Holistic Designs
Shushu Yi and Xiurui Pan, Peking University; Qiao Li, Xiamen University; Qiang Li, Alibaba; Chenxi Wang, Chinese Academy of Sciences; Bo Mao, Xiamen University; Myoungsoo Jung, KAIST and Panmnesia; Jie Zhang, Peking University
XCommit: resource-efficient, performant and cost-effective file system journaling
Harshad Shirwadkar, Saurabh Kadekodi, and Theodore Ts'o, Google
ZMS: Zone Abstracton for Mobile Flash Storage
Joo-Young Hwang, Seokhwan Kim, Daejun Park, Yong-Gil Song, Junyoung Han, Seunghyun Choi, and Sangyeun Cho, Samsung Electronics; Youjip Won, Korea Advanced Institute of Science and Technology (KAIST)
Ethane: An Asymmetric File System for Disaggregated Persistent Memory
Miao Cai and Junru Shen, College of Computer Science and Software Engineering, Hohai University; Baoliu Ye, State Key Laboratory for Novel Software Technology, Nanjing University, and College of Computer Science and Software Engineering, Hohai University
Networks 1
PeRF: Preemption-enabled RDMA Framework
Sugi Lee and Mingyu Choi, Acryl Inc.; Ikjun Yeom and Younghoon Kim, Sungkyunkwan University
CyberStar: Simple, Elastic and Cost-Effective Network Functions Management in Cloud Network at Scale
Tingting Xu, unaffiliated; Shunmin Zhu , Song Yang, Xiaomin Wu, Zhigang Zong, Xiaoxin Peng, Bengbeng Xue, Botao Yan, Yilong Lv, Alibaba Group; Camtu Nguyen Xiaoliang Wang, Nanjing University
OSMOSIS: Enabling Multi-Tenancy in Datacenter SmartNICs
Mikhail Khalilov, Marcin Andrzej Chrapek, Siyuan Shen, Thomas Benz, Alessandro Vezzu, Salvatore Di Girolamo, and Timo Schneider, ETH Zurich Daniele De Sensi, Sapienza University of Rome; Luca Benini and Torsten Hoefler, ETH Zurich
ETC: An Elastic Transmission Control Using End-to-End Available Bandwidth Perception
Feixue Han, Tsinghua Shenzhen International Graduate School; Qing Li, Peng Cheng Laboratory; Peng Zhang, Tencent; Gareth Tyson, Hong Kong University of Science and Technology; Yong Jiang, Tsinghua Shenzhen International Graduate School; Mingwei Xu, Tsinghua University; Yulong Lan and ZhiCheng Li, Tencent
3:40 pm–4:10 pm
Break with Refreshments
4:10 pm–5:55 pm
Edge Computing
More is Different: Prototyping and Analyzing a New Form of Edge Server with Massive Mobile SoCs
Li Zhang, Beijing University of Posts and Telecommunications; Zhe Fu, Tsinghua University; Boqing Shi and Xiang Li, Beijing University of Posts and Telecommunications; Rujin Lai and Chenyang Yang, vclusters; Ao Zhou, Xiao Ma, Shangguang Wang, and Mengwei Xu, Beijing University of Posts and Telecommunications
HiP4-UPF: Towards High-Performance Comprehensive 5G User Plane Function on P4 Progammable Switches
Zhixin Wen and Guanhua Yan, Binghamton University, State University of New York
KEPC-Push: A Knowledge-Enhanced Proactive Content Push Strategy for Edge-Assisted Video Feed Streaming
Ziwen Ye, Tsinghua University; Qing Li, Peng Cheng Laboratory; Chunyu Qiao, ByteDance; Xiaoteng Ma, Tsinghua University; Yong Jiang, Tsinghua Shenzhen International Graduate School; Qian Ma and Shengbin Meng, ByteDance; Zhenhui Yuan, University of Warwick; Zili Meng, HKUST
High-density Mobile Cloud Gaming on Edge SoC Farms
Li Zhang, Shangguang Wang, and Mengwei Xu, Beijing University of Posts and Telecommunications
Operating Systems 1
Opportunities and Limitations of Modern Hardware Isolation Mechanisms
Xiangdong Chen and Zhaofeng Li, University of Utah; Tirth Jain, Birla Institute of Technology and Science, Pilani; Vikram Narayanan and Anton Burtsev, University of Utah
FetchBPF: Customizable Prefetching Policies in Linux with eBPF
Xuechun Cao, Shaurya Patel, and Soo Yee Lim, University of British Columbia; Xueyuan Han, Wake Forest University; Thomas Pasquier, University of British Columbia
Fast (Trapless) Kernel Probes Everywhere
Jinghao Jia, University of Illinois Urbana-Champaign; Michael Le, IBM Research; Salman Ahmed, IBM Research, Yorktown Heights; Dan Williams, Virginia Tech; Hani Jamjoom, IBM; Tianyin Xu, University of Illinois at Urbana-Champaign
HydraRPC: RPC in the CXL Era
Teng Ma, Alibaba Group; Zheng Liu, Zhejiang University and Alibaba Group; Chengkun Wei, Zhejiang University; Jialiang Huang, Tsinghua University; Youwei Zhuo, Alibaba Group; Haoyu Li, Zhejiang University; Ning Zhang, Yijin Guan, and Dimin Niu, Alibaba Group; Mingxing Zhang, Tsinghua University; Tao Ma, Alibaba Group
Enabling Application-Aware Memory Page Placement Policies and Mechanisms With ExtMem
Sepehr Jalalian, Shaurya Patel, Milad Rezaei Hajidehi, and Margo Seltzer, University of British Columbia; Alexandra (Sasha) Fedorova, University of British Columbia and MongoDB
9:00 am–10:40 am
Operating Systems 2
TeleScale: Telemetry for Gargantuan Memory Footprint Applications
Alan Nair, The University of Edinburgh; Sandeep Kumar and Aravinda Prasad, Intel Labs; Ying Huang, Intel; Andy Rudoff, Intel Corporation; Sreenivas Subramoney, Intel
An Empirical Study of Rust-for-Linux: The Success, Dissatisfaction, and Compromise
Hongyu Li, Beijing University of Posts and Telecommunications; Liwei Guo, University of Electronic Science and Technology of China; Yexuan Yang, Shangguang Wang, and Mengwei Xu, Beijing University of Posts and Telecommunications
Scalable and Effective Page-table and TLB management on NUMA Systems
Bin Gao, Qingxuan Kang, and Hao-Wei Tee, National University of Singapore; Kyle Timothy Ng Chu, Horizon Quantum Computing; Alireza Sanaee, Queen Mary University of London; Djordje Jevdjic, National University of Singapore
UniMem: Redesigning Disaggregated Memory within A Unified Local-Remote Memory Hierarchy
Yijie Zhong, Minqiang Zhou, and Zhirong Shen, Xiamen University
Correctness
WingFuzz: Implementing Continuous Fuzzing for DBMSs
Jie Liang, Zhiyong Wu, and Jingzhou Fu, Tsinghua University; Yiyuan Bai and Qiang Zhang, Shuimu Yulin Technology Co., Ltd.; Yu Jiang, Tsinghua University
Balancing Analysis Time and Bug Detection: Daily Development-friendly Bug Detection in Linux
Keita Suzuki, Keio University; Kenta Ishiguro, Hosei University; Kenji Kono, Keio University
Koncord: Verifying Cluster Management Systems
Bingzhe Liu and Gangmuk Lim, UIUC; Ryan Beckett, Microsoft Research; Brighten Godfrey, UIUC and VMware
Monarch: A Fuzzing Framework for Distributed File Systems
Tao Lyu, EPFL; Liyi Zhang, University of Waterloo; Zhiyao Feng, Yueyang Pan, and Yujie Ren, EPFL; Meng Xu, University of Waterloo; Mathias Payer and Sanidhya Kashyap, EPFL
10:40 am–11:10 am
Break with Refreshments
11:10 am–12:25 pm
ML Training
Accelerating the Training of Large Language Models using Efficient Activation Rematerialization and Optimal Hybrid Parallelism
Tailing Yuan, Yuliang Liu, Xucheng Ye, Shenglong Zhang, Jianchao Tan, Bin Chen, Chengru Song, and Di Zhang, Kuaishou Technology
Metis: Fast Automatic Distributed Training on Heterogeneous GPUs
Taegeon Um, Byungsoo Oh, Minyoung Kang, Woo-Yeon Lee, Goeun Kim, Dongseob Kim, Youngtaek Kim, and Mohd Muzzammil, Samsung Research; Myeongjae Jeon, UNIST
FwdFL: Efficient Federated Finetuning of Language Models
Mengwei Xu, Dongqi Cai, Yaozong Wu, Xiang Li, and Shangguang Wang, Beijing University of Posts and Telecommunications
Security 1
A Secure, Fast, and Resource-Efficient Serverless Platform with Function REWIND
Jaehyun Song, Sungkyunkwan University; Bumsuk Kim, Samsung Electronics; Minwoo Kwak, Yonsei University; Byoungyoung Lee, Seoul National University; Euiseong Seo, Sungkyunkwan University; Jinkyu Jeong, Yonsei University
SimEnc: A High-Performance Similarity-Preserving Encryption Approach for Deduplication of Encrypted Docker Images
Tong Sun and Bowen Jiang, Zhejiang University; Borui Li, Southeast University; Jiamei Lv, Yi Gao, and Wei Dong, Zhejiang University
mmTLS: Scaling the Performance of Encrypted Network Traffic Inspection
Junghan Yoon, Seunghyun Do, and Duckwoo Kim, KAIST; Taejoong Chung, Virginia Tech; KyoungSoo Park, KAIST
2:00 pm–3:40 pm
ML-System Co-Design
Cost-Efficient Machine Learning Input Data Preprocessing
Oto Mraz, Dan-Ovidiu Graur, Muyu Li, and Sepehr Pourghannad, ETH Zurich; Chandramohan A. Thekkath, Google; Ana Klimovic, ETH Zurich
OPER: Optimality-Guided Embedding Table Parallelization for Large-scale Recommendation Model
Zheng Wang, University of California, San Diego; Yuke Wang, Boyuan Feng, and Guyue Huang, University of California, Santa Barbara; Dheevatsa Mudigere and Bharath Muthiah, Meta; Ang Li, Pacific Northwest National Laboratory; Yufei Ding, University of California, San Diego
DeepVisor: Effective Operator Graph Instantiation for Deep Learning by Execution State Monitoring
Chen Zhang, Rongchao Dong, Haojie Wang, Runxin Zhong, Jike Chen, and Jidong Zhai, Tsinghua University
Quant-LLM: Accelerating the Serving of Large Language Models via FP6-Centric Algorithm-System Co-Design on Modern GPUs
Haojun Xia, University of Sydney; Zhen Zheng and Xiaoxia Wu, Microsoft; Shiyang Chen, Rutgers University; Zhewei Yao, Stephen Youn, Arash Bakhtiari, and Michael Wyatt, Microsoft; Donglin Zhuang and Zhongzhu Zhou, University of Sydney; Olatunji Ruwase, Yuxiong He, and Shuaiwen Leon Song, Microsoft
Networks 2
QDSR: Accelerating Layer-7 Load Balancing by Direct Server Return with QUIC
Ziqi Wei, Tsinghua University; Zhiqiang Wang, Tencent; Qing Li, Peng Cheng Laboratory; Yuan Yang, Tsinghua University; Cheng Luo and Fuyu Wang, Tencent; Yong Jiang, Tsinghua Shenzhen International Graduate School; Sijie Yang, Tencent; Zhenhui Yuan, Northumbria University
Evaluating Chiplet-based Large-Scale Interconnection Networks via Cycle-Accurate Packet-Parallel Simulation
Yinxiao Feng and Yuchen Wei, Institute for Interdisciplinary Information Sciences, Tsinghua University; Dong Xiang, School of Software, Tsinghua University; Kaisheng Ma, Institute for Interdisciplinary Information Sciences, Tsinghua University
Config-Snob: Tuning for the Best Configurations of Networking Protocol Stack
Manaf Bin-Yahya, Yifei Zhao, Hossein Shafieirad, and Anthony Ho, Huawei Technologies Canada; Shijun Yin, Fanzhao Wang, and Geng Li, Huawei Technologies China
Conspirator: SmartNIC-Aided Control Plane for Distributed ML Workloads
Yunming Xiao, Northwestern University; Diman Zad Tootaghat, Aditya Dhakal, Lianjie Cao, and Puneet Sharma, Hewlett Packard Labs; Aleksandar Kuzmanovic, Northwestern University
3:40 pm–4:10 pm
Break with Refreshments
4:10 pm–5:25 pm
Memory
Making Memory Management Extensible With Filesystems
Bijan Tabatabai, University of Wisconsin—Madison; James Sorenson and Michael Swift, University of Wisconsin—Madison
Mangosteen: Fast Transparent Durability for Linearizable Applications using NVM
Sergey Egorov, Gregory Chockler, and Brijesh Dongol, University of Surrey; Dan O'Keeffe, Royal Holloway, University of London; Sadegh Keshavarzi, University of Surrey
FlexMem: Adaptive Page Profiling and Migration for Tiered Memory
Dong Xu, University of California, Merced; Junhee Ryu, Jinho Baek, and Kwangsik Shin, SK Hynix; Pengfei Su and Dong Li, University of California, Merced
Reliability
Ammit: Improving Cloud AI Infrastructure Reliability with Proactive Validation
Yifan Xiong, Yuting Jiang, Ziyue Yang, and Lei Qu, Microsoft Research; Guoshuai Zhao, Shuguang Liu, Dong Zhong, Boris Pinzur, Jie Zhang, Yang Wang, Jithin Jose, Hossein Pourreza, Jeff Baxter, Kushal Datta, Prabhat Ram, Luke Melton, and Joe Chau, Microsoft; Peng Cheng, Yongqiang Xiong, and Lidong Zhou, Microsoft Research
Removing Obstacles before Breaking Through the Memory Wall: A Close Look at HBM Errors in the Field
Ronglong Wu, Shuyue Zhou, Jiahao Lu, Zhirong Shen, Yiming Zhang, and Zikang Xu, Xiamen University; Kunlin Yang and Feilong Lin, Huawei Technologies Co., Ltd
MSFRD: Mutation Similarity based SSD Failure Rating and Diagnosis for Complex and Volatile Production Environments
Yuqi Zhang, Tianyi Zhang, Wenwen Hao, Shuyang Wang, Na Liu, and Xing He, Samsung RandD Institute China Xi'an, Samsung Electronics; Yang Zhang, Weixin Wang, Yongguang Cheng, Huan Wang, Jie Xu, Feng Wang, and Bo Jiang, ByteDance Inc.; Yongwong Gwon, Jongsung Na, Zoe Kim, and Geunrok Oh, Samsung Electronics
9:00 am–10:15 am
Deployed Systems
Diagnosing Application-network Anomalies for Millions of IPs in Production Clouds
Zhe Wang, Shanghai Jiao Tong University, China; Huanwu Hu, Alibaba Group, China; Linghe Kong, Shanghai Jiao Tong University, China; Xinlei Kang and Teng Ma, Alibaba Group, China; Qiao Xiang, Xiamen University, China; Jingxuan Li and Yang Lu, Alibaba Group, China; Zhuo Song, Alibaba Group and Shanghai Jiao Tong University, China; Peihao Yang, Alibaba Group, China; Jiejian Wu, Shanghai Jiao Tong University, China; Yong Yang and Tao Ma, Alibaba Group, China; Zheng Liu, Alibaba Group and Zhejiang University, China; Xianlong Zeng and Dennis Cai, Alibaba Group, China; Guihai Chen, Shanghai Jiao Tong University, China
Data Caching for Enterprise-Grade Petabyte-Scale OLAP
Chunxu Tang and Bin Fan, Alluxio; Jing Zhao and Chen Liang, Uber; Hope Wang and Beinan Wang, Alluxio; Ziyue Qiu, Carnegie Mellon University; Lu Qiu, Bowen Ding, Shouzhuo Sun, Saiguang Che, Jiaming Mai, Shouwei Chen, Yu Zhu, and Jianjian Xie, Alluxio; Yutian Sun, Meta; Yao Li and Yangjun Zhang, Uber; Ke Wang, Meta
Full Lifecycle Data Analysis on a Large-scale and Leadership Supercomputer: What Can We Learn from It?
Bin Yang, Tsinghua University, National Supercomputer Center in Wuxi; Hao Wei, Tsinghua University; Wenhao Zhu, Shandong University, National Supercomputer Center in Wuxi; Yuhao Zhang, Tsinghua University; Weiguo Liu, Shandong University; Wei Xue, Tsinghua University
Wide Area Network
Panorama: Optimizing Internet-scale Users’ Routes from End to End
Geng Li, Shuihai Hu, and Kun Tan, Huawei
Enhancing Resource Management of the World's Largest PCDN System for On-Demand Video Streaming
Rui-Xiao Zhang, University of Illinois Urbana-Champaign; Haiping Wang, Shu Shi, Xiaofei Pang, Yajie Peng, and Zhichen Xue, ByteDance; Jiangchuan Liu, Simon Fraser University
TileClipper: Lightweight Selection of Regions of Interest from Videos for Traffic Surveillance
Shubham Chaudhary and Aryan Taneja, IIIT Delhi, India; Anjali Singh, IGDTUW Delhi, India; Purbasha Roy, Sohum Sikdar, Mukulika Maity, and Arani Bhattacharya, IIIT Delhi, India
10:15 am–10:50 am
Break with Refreshments
10:50 am–12:05 pm
Virtualization
Expeditious High-Concurrency MicroVM SnapStart in Persistent Memory with an Augmented Hypervisor
Xingguo Pang, Yanze Zhang, Liu Liu, and Xiaobo Zhou, University of Macau; Dazhao Cheng, WuHan University; Chengzhong Xu, University of Macau
Taming Hot Bloat Under Virtualization with HugeScope
Chuandong Li, Peking University; Sai Sha, Beijing Huawei Digital Technologies; Diyu Zhou, École Polytechnique Fédérale de Lausanne (EPFL); Yangqing Zeng, Xiran Yang, Yingwei Luo, and Xiaolin Wang, Peking University; Zhenlin Wang, Michigan Tech
CrossMapping: Harmonizing Memory Consistency in Cross-ISA Binary Translation
Chen Gao and Xiangwei Meng, Lanzhou University; Wei Li, Tsinghua University; Jinhui Lai, Lanzhou University; Yiran Zhang, Beijing University of Posts and Telecommunications; Fengyuan Ren, Lanzhou University and Tsinghua University
Security 2
Efficient Decentralized Federated Singular Vector Decomposition
Di Chai, Junxue Zhang, and Liu Yang, Hong Kong University of Science and Technology; Yilun Jin, The Hong Kong University of Science and Technology; Leye Wang, Peking University; Kai Chen and Qiang Yang, Hong Kong University of Science and Technology
Models on the Move: Towards Feasible Embedded AI for Intrusion Detection on Vehicular CAN Bus
He Xu, Di Wu, and Yufeng Lu, Hunan University; Haibo Zeng, Virginia Tech; Jiwu Lu, Hunan University
Flexible, Secure and Efficient CVM Maintenance with Confidential Procedure Calls
Jiahao Chen, Zeyu Mi, Yubin Xia, Haibing Guan, and Haibo Chen, Shanghai Jiao Tong University
12:05 pm–1:40 pm
Lunch (on your own)
1:40 pm–3:20 pm
Storage 2
RL-Watchdog: A Fast and Predictable SSD Liveness Watchdog on Storage Systems
Jinyong Ha, Seoul National University; Sangjin Lee, Chung-Ang University; Heon Young Yeom, Seoul National University; Yongseok Son, Chung-Ang University
Exploit both SMART Attributes and NAND Flash Wear Characteristics to Effectively Forecast SSD-based Storage Failures in Clusters
Yunfei Gu and Chentao Wu, Shanghai Jiao Tong University; Xubin He, Temple University
StreamCache: Revisiting Page Cache for File Scanning on Fast Storage Devices
Zhiyue Li and Guangyan Zhang, Tsinghua University
Scalable Billion-point Approximate Nearest Neighbor Search Using SmartSSDs
Bing Tian, Haikun Liu, Zhuohui Duan, Xiaofei Liao, and Hai Jin, Huazhong University of Science and Technology; Yu Zhang, Service Computing Technology and System Lab, Huazhong University of Science and Technology
Hardware
gVulkan: Scalable GPU Pooling for Pixel-Grained Rendering in Ray Tracing
Yicheng Gu, Yun Wang, Yunfan Sun, Yuxin Xiang, Yufan Jiang, Xuyan Hu, Zhengwei Qi, and Haibing Guan, Shanghai Jiao Tong University
vFPIO: A Virtual I/O Abstraction for FPGA-accelerated I/O Devices
Jiyang Chen, Harshavardhan Unnibhavi, Atsushi Koshiba, and Pramod Bhatotia TU Munich;
ScalaCache: Scalable User-Space Page Cache Management with Software-Hardware Coordination
Li Peng and Yuda An, Peking University; You Zhou, Huazhong University of Science and Technology; Chenxi Wang, Chinese Academy of Sciences; Qiao Li, Xiamen University; Cheng Chuanning, Huawei; Jie Zhang, Peking University
Centimani: Enabling Fast AI Accelerator Selection for DNN Training with a Novel Performance Predictor
Zhen Xie, Binghamton University; Murali Emani, Argonne National Laboratory; Xiaodong Yu, Stevens Institute of Technology; Dingwen Tao, Indiana University; Xin He, Guangzhou Institute of Technology, Xidian University; Pengfei Su, University of California, Merced; Keren Zhou, George Mason University; Venkatram Vishwanath, Argonne National Laboratory
3:20 pm–3:40 pm
Break with Refreshments
3:40 pm–5:10 pm
Potpourri
A Difference World: High-performance, NVM-invariant, Software-only Intermittent Computation
Harrison Williams, Saim Ahmad, and Matthew Hicks, Virginia Tech
Efficient Large Graph Processing with Chunk-Based Graph Representation Model
Rui Wang, Weixu Zong, Shuibing He, Xinyu Chen, Zhenxin Li, and Zheng Dang, Zhejiang University
SlimArchive: A Lightweight Architecture for Ethereum Archive Nodes
Hang Feng, Yufeng Hu, and Yinghan Kou, Zhejiang University; Runhuai Li and Jianfeng Zhu, BlockSec; Lei Wu and Yajin Zhou, Zhejiang University
Every Mapping Counts in Large Amounts: Folio Accounting
David Hildenbrand and Martin Schulz, Technical University of Munich; Nadav Amit, Technion, Israel Institute of Technology
5:10 pm–5:20 pm
Closing Remarks
Saurabh Bagchi, Purdue University; Yiying Zhang, University of California, San Diego