Biblio

Export 2 results:
Filters: Author is Jianxi Ye  [Clear All Filters]
2024
Jiang Z, Lin H, Zhong Y, Huang Q, Chen Y, Zhang Z, Peng Y, Li X, Xie C, Nong S et al..  2024.  MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs. 21st USENIX Symposium on Networked Systems Design and Implementation (NSDI 24). :745--760.
2022
Kong X, Zhu Y, Zhou H, Jiang Z, Ye J, Guo C, Zhuo D.  2022.  Collie: Finding Performance Anomalies in RDMA Subsystems. 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI 22). :287--305.