USENIX ATC '24 Technical Sessions

Display:

Wednesday, July 10

8:00 am–9:00 am

Continental Breakfast

9:00 am–10:00 am

USENIX ATC '24 and OSDI '24 Joint Keynote Address

Scaling AI Sustainably: An Uncharted Territory

Carole-Jean Wu, Meta

The past 50 years has seen a dramatic increase in the amount of compute per person, in particular, those enabled by AI. Despite the positive societal benefits, AI technologies come with significant environmental implications. I will talk about the scaling trend and the operational carbon footprint of AI computing by examining the model development cycle, spanning data, algorithms, and system hardware. At the same time, we will consider the life cycle of system hardware from the perspective of hardware architectures and manufacturing technologies. I will highlight key efficiency optimization opportunities for cutting-edge AI technologies, from deep learning recommendation models to multi-modal generative AI tasks. To scale AI sustainably, we need to make AI and computing more broadly efficient and flexible. We must also go beyond efficiency and optimize across the life cycle of computing infrastructures, from hardware manufacturing to datacenter operation and end-of-life processing for the hardware. Based on the industry experience and lessons learned, my talk will conclude with important development and research directions to advance the field of computing in an environmentally responsible and sustainable manner.

Carole-Jean Wu, Meta

Carole-Jean Wu is a Director at Meta. She is a founding member and a Vice President of MLCommons—a non-profit organization that aims to accelerate machine learning for the benefit of all. Dr. Wu also serves on the MLCommons Board as a Director, chaired the MLPerf Recommendation Benchmark Advisory Board, and co-chaired for MLPerf Inference. Prior to Meta/Facebook, She was a tenured professor at ASU. She earned her M.A. and Ph.D. from Princeton and B.Sc. from Cornell.

Dr. Wu's expertise sits at the intersection of computer architecture and machine learning. Her work spans across datacenter infrastructures and edge systems, such as developing energy- and memory-efficient systems and microarchitectures, optimizing systems for machine learning execution at-scale, and designing learning-based approaches for system design and optimization. Dr. Wu's work has been recognized with several awards, including IEEE Micro Top Picks and ACM/IEEE Best Paper Awards. She was the Program Co-Chair of the Conference on Machine Learning and Systems (MLSys) in 2022, the Program Chair of the IEEE International Symposium on Workload Characterization (IISWC) in 2018, and the Editor for the IEEE MICRO Special Issue on Environmentally Sustainable Computing. She currently serves on the ACM SIGARCH/SIGMICRO CARES committee.

10:00 am–10:30 am

Break with Refreshments

10:30 am–10:45 am

Opening Remarks, Awards, and Presentation of the 2024 USENIX Lifetime Achievement (Flame) Award

Program Co-Chairs: Saurabh Bagchi, Purdue University; Yiying Zhang, University of California, San Diego

10:45 am–12:25 pm

Track 1

Cloud Computing

Track 2

ML Inference

Power-aware Deep Learning Model Serving with μ-Serve

Haoran Qiu, Weichao Mao, Archit Patke, and Shengkun Cui, University of Illinois Urbana-Champaign; Saurabh Jha, Chen Wang, and Hubertus Franke, IBM Research; Zbigniew Kalbarczyk, Tamer Başar, and Ravishankar K. Iyer, University of Illinois Urbana-Champaign

12:25 pm–2:00 pm

Conference Luncheon

2:00 pm–3:40 pm

Track 1

Storage 1

ZMS: Zone Abstracton for Mobile Flash Storage

Joo-Young Hwang, Seokhwan Kim, Daejun Park, Yong-Gil Song, Junyoung Han, Seunghyun Choi, and Sangyeun Cho, Samsung Electronics; Youjip Won, Korea Advanced Institute of Science and Technology (KAIST)

Track 2

Networks 1

OSMOSIS: Enabling Multi-Tenancy in Datacenter SmartNICs

Mikhail Khalilov, Marcin Andrzej Chrapek, Siyuan Shen, Thomas Benz, Alessandro Vezzu, Salvatore Di Girolamo, and Timo Schneider, ETH Zurich Daniele De Sensi, Sapienza University of Rome; Luca Benini and Torsten Hoefler, ETH Zurich

3:40 pm–4:10 pm

Break with Refreshments

4:10 pm–5:55 pm

Track 1

Edge Computing

Track 2

Operating Systems 1

Fast (Trapless) Kernel Probes Everywhere

Jinghao Jia, University of Illinois Urbana-Champaign; Michael Le, IBM Research; Salman Ahmed, IBM Research, Yorktown Heights; Dan Williams, Virginia Tech; Hani Jamjoom, IBM; Tianyin Xu, University of Illinois at Urbana-Champaign

HydraRPC: RPC in the CXL Era

Teng Ma, Alibaba Group; Zheng Liu, Zhejiang University and Alibaba Group; Chengkun Wei, Zhejiang University; Jialiang Huang, Tsinghua University; Youwei Zhuo, Alibaba Group; Haoyu Li, Zhejiang University; Ning Zhang, Yijin Guan, and Dimin Niu, Alibaba Group; Mingxing Zhang, Tsinghua University; Tao Ma, Alibaba Group

6:00 pm–7:30 pm

OSDI '24 Poster Session and Reception

Thursday, July 11

8:00 am–9:00 am

Continental Breakfast

9:00 am–10:40 am

Track 1

Operating Systems 2

Track 2

Correctness

10:40 am–11:10 am

Break with Refreshments

11:10 am–12:25 pm

Track 1

ML Training

Track 2

Security 1

12:25 pm–2:00 pm

Conference Luncheon

2:00 pm–3:40 pm

Track 1

ML-System Co-Design

Track 2

Networks 2

3:40 pm–4:10 pm

Break with Refreshments

4:10 pm–5:25 pm

Track 1

Memory

Track 2

Reliability

Ammit: Improving Cloud AI Infrastructure Reliability with Proactive Validation

Yifan Xiong, Yuting Jiang, Ziyue Yang, and Lei Qu, Microsoft Research; Guoshuai Zhao, Shuguang Liu, Dong Zhong, Boris Pinzur, Jie Zhang, Yang Wang, Jithin Jose, Hossein Pourreza, Jeff Baxter, Kushal Datta, Prabhat Ram, Luke Melton, and Joe Chau, Microsoft; Peng Cheng, Yongqiang Xiong, and Lidong Zhou, Microsoft Research

6:00 pm–7:30 pm

USENIX ATC '24 Poster Session and Reception

Friday, July 12

8:00 am–9:00 am

Continental Breakfast

9:00 am–10:15 am

Track 1

Deployed Systems

Diagnosing Application-network Anomalies for Millions of IPs in Production Clouds

Zhe Wang, Shanghai Jiao Tong University, China; Huanwu Hu, Alibaba Group, China; Linghe Kong, Shanghai Jiao Tong University, China; Xinlei Kang and Teng Ma, Alibaba Group, China; Qiao Xiang, Xiamen University, China; Jingxuan Li and Yang Lu, Alibaba Group, China; Zhuo Song, Alibaba Group and Shanghai Jiao Tong University, China; Peihao Yang, Alibaba Group, China; Jiejian Wu, Shanghai Jiao Tong University, China; Yong Yang and Tao Ma, Alibaba Group, China; Zheng Liu, Alibaba Group and Zhejiang University, China; Xianlong Zeng and Dennis Cai, Alibaba Group, China; Guihai Chen, Shanghai Jiao Tong University, China

Data Caching for Enterprise-Grade Petabyte-Scale OLAP

Chunxu Tang and Bin Fan, Alluxio; Jing Zhao and Chen Liang, Uber; Hope Wang and Beinan Wang, Alluxio; Ziyue Qiu, Carnegie Mellon University; Lu Qiu, Bowen Ding, Shouzhuo Sun, Saiguang Che, Jiaming Mai, Shouwei Chen, Yu Zhu, and Jianjian Xie, Alluxio; Yutian Sun, Meta; Yao Li and Yangjun Zhang, Uber; Ke Wang, Meta

Track 2

Wide Area Network

10:15 am–10:50 am

Break with Refreshments

10:50 am–12:05 pm

Track 1

Virtualization

Taming Hot Bloat Under Virtualization with HugeScope

Chuandong Li, Peking University; Sai Sha, Beijing Huawei Digital Technologies; Diyu Zhou, École Polytechnique Fédérale de Lausanne (EPFL); Yangqing Zeng, Xiran Yang, Yingwei Luo, and Xiaolin Wang, Peking University; Zhenlin Wang, Michigan Tech

Track 2

Security 2

Efficient Decentralized Federated Singular Vector Decomposition

Di Chai, Junxue Zhang, and Liu Yang, Hong Kong University of Science and Technology; Yilun Jin, The Hong Kong University of Science and Technology; Leye Wang, Peking University; Kai Chen and Qiang Yang, Hong Kong University of Science and Technology

12:05 pm–1:40 pm

Lunch (on your own)

1:40 pm–3:20 pm

Track 1

Storage 2

Track 2

Hardware

3:20 pm–3:40 pm

Break with Refreshments

3:40 pm–5:10 pm

Potpourri

5:10 pm–5:20 pm

Closing Remarks

Program Co-Chairs: Saurabh Bagchi, Purdue University; Yiying Zhang, University of California, San Diego