USENIX | The Advanced Computing Systems Association

{Ensō}: A Streaming Interface for {NIC-Application} Communication

Sadok H, Atre N, Zhao Z, Berger DS, Hoe JC, Panda A, Sherry J, Wang R. 2023. {Ensō}: A Streaming Interface for {NIC-Application} Communication. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :1005--1025.

Read more about {Ensō}: A Streaming Interface for {NIC-Application} Communication
DBLP
Log in to post comments
Google Scholar
BibTeX

Characterizing Off-path {SmartNIC} for Accelerating Distributed Systems

Wei X, Cheng R, Yang Y, Chen R, Chen H. 2023. Characterizing Off-path {SmartNIC} for Accelerating Distributed Systems. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :987--1004.

Read more about Characterizing Off-path {SmartNIC} for Accelerating Distributed Systems
DBLP
Log in to post comments
Google Scholar
BibTeX

{ServiceRouter}: Hyperscale and Minimal Cost Service Mesh at Meta

Saokar H, Demetriou S, Magerko N, Kontorovich M, Kirstein J, Leibold M, Skarlatos D, Khandelwal H, Tang C. 2023. {ServiceRouter}: Hyperscale and Minimal Cost Service Mesh at Meta. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :969--985.

Read more about {ServiceRouter}: Hyperscale and Minimal Cost Service Mesh at Meta
DBLP
Log in to post comments
Google Scholar
BibTeX

{ShRing}: Networking with Shared Receive Rings

Pismenny B, Morrison A, Tsafrir D. 2023. {ShRing}: Networking with Shared Receive Rings. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :949--968.

Read more about {ShRing}: Networking with Shared Receive Rings
DBLP
Log in to post comments
Google Scholar
BibTeX

{BWoS}: Formally Verified Block-based Work Stealing for Parallel Processing

Wang J, Trach B, Fu M, Behrens D, Schwender J, Liu Y, Lei J, Vafeiadis V, Härtig H, Chen H. 2023. {BWoS}: Formally Verified Block-based Work Stealing for Parallel Processing. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :833--850.

{MGG}: Accelerating Graph Neural Networks with {Fine-Grained} {Intra-Kernel} {Communication-Computation} Pipelining on {Multi-GPU} Platforms

Wang Y, Feng B, Wang Z, Geng T, Barker K, Li A, Ding Y. 2023. {MGG}: Accelerating Graph Neural Networks with {Fine-Grained} {Intra-Kernel} {Communication-Computation} Pipelining on {Multi-GPU} Platforms. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :779--795.

{EINNET}: Optimizing Tensor Programs with {Derivation-Based} Transformations

Zheng L, Wang H, Zhai J, Hu M, Ma Z, Wang T, Huang S, Miao X, Tang S, Huang K et al.. 2023. {EINNET}: Optimizing Tensor Programs with {Derivation-Based} Transformations. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :739--755.

Effectively Scheduling Computational Graphs of Deep Neural Networks toward Their {Domain-Specific} Accelerators

Zhao J, Feng S, Dan X, Liu F, Wang C, Yuan S, Lv W, Xie Q. 2023. Effectively Scheduling Computational Graphs of Deep Neural Networks toward Their {Domain-Specific} Accelerators. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :719--737.

Welder: Scheduling Deep Learning Memory Access via Tile-graph

Shi Y, Yang Z, Xue J, Ma L, Xia Y, Miao Z, Guo Y, Yang F, Zhou L. 2023. Welder: Scheduling Deep Learning Memory Access via Tile-graph. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :701--718.

Read more about Welder: Scheduling Deep Learning Memory Access via Tile-graph
DBLP
Log in to post comments
Google Scholar
BibTeX

Cocktailer: Analyzing and Optimizing Dynamic Control Flow in Deep Learning

Zhang C, Ma L, Xue J, Shi Y, Miao Z, Yang F, Zhai J, Yang Z, Yang M. 2023. Cocktailer: Analyzing and Optimizing Dynamic Control Flow in Deep Learning. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). :681--699.

Pages