Skyline: A Cloud Centric Internet Monitoring Engine

Shixian Guo, ByteDance; Ziqian Liu, The University of Hong Kong; Yangyang Bai, Yuan Chen, Kefei Liu, Qi Zhang, Songlin Liu, Yang Lv, Jianwei Hu, Gen Li, Zhenyang Zhong, Sisi Wen, Yongbin Dong, Feng Luo, Anjian Chen, Rui Han, Jiale Feng, Lingpei Meng, Siwan Chen, Hang Li, Shuai Xu, Juntao Zhong, and Chaoran Hu, ByteDance; Yibo Huang, University of Michigan; Yiming Qiu, The University of Hong Kong

Cloud providers depend on the public Internet to connect tenants and their clients, yet Internet faults are a leading cause of cloud outages: in our organization, more than 60% of network incidents happen in the Internet and account for close to 80% of user-impacting events. Effectively monitoring the Internet is challenging for cloud providers because they lack direct control and visibility into Internet internals. Our key insight is to treat coverage as a first-class goal and decompose the monitoring requirements into three coverage dimensionstraffic direction, incident lifecycle, and tenant granularity—then resolve each independently. We present Skyline, a cloud-centric Internet monitoring system that addresses all coverage dimensions at scale by combining purpose-built dataplane hooks with lightweight software control to minimize resource overhead, shorten reaction time, and preserve non-intrusiveness. Skyline has been deployed for more than two years. In 2025, it identified over 2,000 incidents with very high precision and recall over confirmed issues, thereby significantly improving the reliability of our cloud network.

Category: 
Operational Systems Paper

NSDI '26 Open Access Sponsored by
King Abdullah University of Science and Technology (KAUST)

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

BibTeX
@inproceedings {316616,
author = {Shixian Guo and Ziqian Liu and Yangyang Bai and Yuan Chen and Kefei Liu and Qi Zhang and Songlin Liu and Yang Lv and Jianwei Hu and Gen Li and Zhenyang Zhong and Sisi Wen and Yongbin Dong and Feng Luo and Anjian Chen and Rui Han and Jiale Feng and Lingpei Meng and Siwan Chen and Hang Li and Shuai Xu and Juntao Zhong and Chaoran Hu and Yibo Huang and Yiming Qiu},
title = {Skyline: A Cloud Centric Internet Monitoring Engine},
booktitle = {23rd USENIX Symposium on Networked Systems Design and Implementation (NSDI 26)},
year = {2026},
isbn = {978-1-939133-54-0},
address = {Renton, WA},
pages = {685--699},
url = {https://www.usenix.org/conference/nsdi26/presentation/guo-shixian},
publisher = {USENIX Association},
month = may
}

Presentation Video