{Ensō}: A Streaming Interface for {NIC-Application} Communication Sadok H, Atre N, Zhao Z, Berger DS, Hoe JC, Panda A, Sherry J, Wang R. 2023. {Ensō}: A Streaming Interface for {NIC-Application} Communication. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :1005--1025. Read more about {Ensō}: A Streaming Interface for {NIC-Application} CommunicationDBLPLog in to post commentsGoogle ScholarBibTeX
Characterizing Off-path {SmartNIC} for Accelerating Distributed Systems Wei X, Cheng R, Yang Y, Chen R, Chen H. 2023. Characterizing Off-path {SmartNIC} for Accelerating Distributed Systems. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :987--1004. Read more about Characterizing Off-path {SmartNIC} for Accelerating Distributed SystemsDBLPLog in to post commentsGoogle ScholarBibTeX
{ServiceRouter}: Hyperscale and Minimal Cost Service Mesh at Meta Saokar H, Demetriou S, Magerko N, Kontorovich M, Kirstein J, Leibold M, Skarlatos D, Khandelwal H, Tang C. 2023. {ServiceRouter}: Hyperscale and Minimal Cost Service Mesh at Meta. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :969--985. Read more about {ServiceRouter}: Hyperscale and Minimal Cost Service Mesh at MetaDBLPLog in to post commentsGoogle ScholarBibTeX
{ShRing}: Networking with Shared Receive Rings Pismenny B, Morrison A, Tsafrir D. 2023. {ShRing}: Networking with Shared Receive Rings. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :949--968. Read more about {ShRing}: Networking with Shared Receive RingsDBLPLog in to post commentsGoogle ScholarBibTeX
{BWoS}: Formally Verified Block-based Work Stealing for Parallel Processing Wang J, Trach B, Fu M, Behrens D, Schwender J, Liu Y, Lei J, Vafeiadis V, Härtig H, Chen H. 2023. {BWoS}: Formally Verified Block-based Work Stealing for Parallel Processing. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :833--850. Read more about {BWoS}: Formally Verified Block-based Work Stealing for Parallel ProcessingDBLPLog in to post commentsGoogle ScholarBibTeX
{MGG}: Accelerating Graph Neural Networks with {Fine-Grained} {Intra-Kernel} {Communication-Computation} Pipelining on {Multi-GPU} Platforms Wang Y, Feng B, Wang Z, Geng T, Barker K, Li A, Ding Y. 2023. {MGG}: Accelerating Graph Neural Networks with {Fine-Grained} {Intra-Kernel} {Communication-Computation} Pipelining on {Multi-GPU} Platforms. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :779--795. Read more about {MGG}: Accelerating Graph Neural Networks with {Fine-Grained} {Intra-Kernel} {Communication-Computation} Pipelining on {Multi-GPU} PlatformsDBLPLog in to post commentsGoogle ScholarBibTeX
{EINNET}: Optimizing Tensor Programs with {Derivation-Based} Transformations Zheng L, Wang H, Zhai J, Hu M, Ma Z, Wang T, Huang S, Miao X, Tang S, Huang K et al.. 2023. {EINNET}: Optimizing Tensor Programs with {Derivation-Based} Transformations. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :739--755. Read more about {EINNET}: Optimizing Tensor Programs with {Derivation-Based} TransformationsDBLPLog in to post commentsGoogle ScholarBibTeX
Effectively Scheduling Computational Graphs of Deep Neural Networks toward Their {Domain-Specific} Accelerators Zhao J, Feng S, Dan X, Liu F, Wang C, Yuan S, Lv W, Xie Q. 2023. Effectively Scheduling Computational Graphs of Deep Neural Networks toward Their {Domain-Specific} Accelerators. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :719--737. Read more about Effectively Scheduling Computational Graphs of Deep Neural Networks toward Their {Domain-Specific} AcceleratorsDBLPLog in to post commentsGoogle ScholarBibTeX
Welder: Scheduling Deep Learning Memory Access via Tile-graph Shi Y, Yang Z, Xue J, Ma L, Xia Y, Miao Z, Guo Y, Yang F, Zhou L. 2023. Welder: Scheduling Deep Learning Memory Access via Tile-graph. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :701--718. Read more about Welder: Scheduling Deep Learning Memory Access via Tile-graphDBLPLog in to post commentsGoogle ScholarBibTeX
Cocktailer: Analyzing and Optimizing Dynamic Control Flow in Deep Learning Zhang C, Ma L, Xue J, Shi Y, Miao Z, Yang F, Zhai J, Yang Z, Yang M. 2023. Cocktailer: Analyzing and Optimizing Dynamic Control Flow in Deep Learning. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :681--699. Read more about Cocktailer: Analyzing and Optimizing Dynamic Control Flow in Deep LearningDBLPLog in to post commentsGoogle ScholarBibTeX