Search results
| Title | Conference | Speaker(s) | |
|---|---|---|---|
| Safe Evaluation and Rollout of AI Models | SREcon25 Americas | Brendan Burns | |
| Insights Gained from Delivering Two Generations of AI Supercomputers and Storage Solutions in IBM Cloud | FAST '25 | Dr. Seetharami Seelam | |
| Measuring Availability the Player Focused Way: How Riot Games Changed Its Availability Culture | SREcon25 Americas | Maxfield Stewart | |
| Optimizing RLHF Training for Large Language Models with Stage Fusion | NSDI '25 | Yinmin Zhong, Zili Zhang, Bingyang Wu, Shengyu Liu, Yukun Chen, Changyi Wan, Hanpeng Hu, Lei Xia, Ranchen Ming, Yibo Zhu, Xin Jin | |
| ClubHeap: A High-Speed and Scalable Priority Queue for Programmable Packet Scheduling | NSDI '25 | Zhikang Chen, Haoyu Song, Zhiyu Zhang, Yang Xu, Bin Liu | |
| MeshTest: End-to-End Testing for Service Mesh Traffic Management | NSDI '25 | Naiqian Zheng, Tianshuo Qiao, Xuanzhe Liu, Xin Jin | |
| Self-Clocked Round-Robin Packet Scheduling | NSDI '25 | Erfan Sharafzadeh, Raymond Matson, Jean Tourrilhes, Puneet Sharma, Soudeh Ghorbani | |
| Pushing the Limits of In-Network Caching for Key-Value Stores | NSDI '25 | Gyuyeong Kim | |
| GREEN: Carbon-efficient Resource Scheduling for Machine Learning Clusters | NSDI '25 | Kaiqiang Xu, Decang Sun, Han Tian, Junxue Zhang, Kai Chen | |
| CATO: End-to-End Optimization of ML-Based Traffic Analysis Pipelines | NSDI '25 | Gerry Wan, Shinan Liu, Francesco Bronzino, Nick Feamster, Zakir Durumeric | |
| Rajomon: Decentralized and Coordinated Overload Control for Latency-Sensitive Microservices | NSDI '25 | Jiali Xing, Akis Giannoukos, Paul Loh, Shuyue Wang, Justin Qiu, Henri Maxime Demoulin, Konstantinos Kallas, Benjamin C. Lee | |
| Accelerating Design Space Exploration for LLM Training Systems with Multi-experiment Parallel Simulation | NSDI '25 | Fei Gui, Kaihui Gao, Li Chen, Dan Li, Vincent Liu, Ran Zhang, Hongbing Yang, Dian Xiong | |
| CellReplay: Towards accurate record-and-replay for cellular networks | NSDI '25 | William Sentosa, Balakrishnan Chandrasekaran, P. Brighten Godfrey, Haitham Hassanieh | |
| Beehive: A Scalable Disaggregated Memory Runtime Exploiting Asynchrony of Multithreaded Programs | NSDI '25 | Quanxi Li, Hong Huang, Ying Liu, Yanwen Xia, Jie Zhang, Mosong Zhou, Xiaobing Feng, Huimin Cui, Quan Chen, Yizhou Shan, Chenxi Wang | |
| AutoCCL: Automated Collective Communication Tuning for Accelerating Distributed and Parallel DNN Training | NSDI '25 | Guanbin Xu, Zhihao Le, Yinhe Chen, Zhiqi Lin, Zewen Jin, Youshan Miao, Cheng Li | |
| DISC: Backpressure Mitigation In Multi-tier Applications With Distributed Shared Connection | NSDI '25 | Brice Ekane, Djob Mvondo, Renaud Lachaize, Yérom-David Bromberg, Alain Tchana, Daniel Hagimont | |
| Making Serverless Pay-For-Use a Reality with Leopard | NSDI '25 | Tingjia Cao, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau, Tyler Caraza-Harter | |
| Mitigating Scalability Walls of RDMA-based Container Networks | NSDI '25 | Wei Liu, Kun Qian, Zhenhua Li, Feng Qian, Tianyin Xu, Yunhao Liu, Yu Guan, Shuhong Zhu, Hongfei Xu, Lanlan Xi, Chao Qin, Ennan Zhai | |
| Ladder: A Convergence-based Structured DAG Blockchain for High Throughput and Low Latency | NSDI '25 | Dengcheng Hu, Jianrong Wang, Xiulong Liu, Hao Xu, Xujing Wu, Muhammad Shahzad, Guyue Liu, Keqiu Li | |
| High-level Programming for Application Networks | NSDI '25 | Xiangfeng Zhu, Yuyao Wang, Banruo Liu, Yongtong Wu, Nikola Bojanic, Jingrong Chen, Gilbert Louis Bernstein, Arvind Krishnamurthy, Sam Kumar, Ratul Mahajan, Danyang Zhuo | |
| Holmes: Localizing Irregularities in LLM Training with Mega-scale GPU Clusters | NSDI '25 | Zhiyi Yao, Pengbo Hu, Congcong Miao, Xuya Jia, Zuning Liang, Yuedong Xu, Chunzhi He, Hao Lu, Mingzhuo Chen, Xiang Li, Zekun He, Yachen Wang, Xianneng Zou, Junchen Jiang | |
| Building Massive MIMO Baseband Processing on a Single-Node Supercomputer | NSDI '25 | Xincheng Xie, Wentao Hou, Zerui Guo, Ming Liu | |
| Learning Production-Optimized Congestion Control Selection for Alibaba Cloud CDN | NSDI '25 | Xuan Zeng, Haoran Xu, Chen Chen, Xumiao Zhang, Xiaoxi Zhang, Xu Chen, Guihai Chen, Yubing Qiu, Yiping Zhang, Chong Hao, Ennan Zhai | |
| Understanding and Profiling NVMe-over-TCP Using ntprof | NSDI '25 | Yuyuan Kang, Ming Liu | |
| Tooth: Toward Optimal Balance of Video QoE and Redundancy Cost by Fine-Grained FEC in Cloud Gaming Streaming | NSDI '25 | Congkai An, Huanhuan Zhang, Shibo Wang, Jingyang Kang, Anfu Zhou, Liang Liu, Huadong Ma, Zili Meng, Delei Ma, Yusheng Dong, Xiaogang Lei |