Cocktailer: Analyzing and Optimizing Dynamic Control Flow in Deep Learning

Chen Zhang; Lingxiao Ma; Jilong Xue; Yining Shi; Ziming Miao; Fan Yang; Jidong Zhai; Zhi Yang; Mao Yang

Authors:

Chen Zhang, Tsinghua University; Lingxiao Ma and Jilong Xue, Microsoft Research; Yining Shi, Peking University & Microsoft Research; Ziming Miao and Fan Yang, Microsoft Research; Jidong Zhai, Tsinghua University; Zhi Yang, Peking University; Mao Yang, Microsoft Research

Abstract:

With the growing complexity of deep neural networks (DNNs), developing DNN programs with intricate control flow logic (e.g., loops, branches, and recursion) has become increasingly essential. However, executing such DNN programs efficiently on accelerators is challenging. Current DNN frameworks typically process control flow on the CPU, while offloading the remaining computations to accelerators like GPUs. This often introduces significant synchronization overhead between CPU and the accelerator, and prevents global optimization across control flow scopes.

To address this challenge, we propose Cocktailer, a new DNN compiler that co-optimizes the execution of control flow and data flow on hardware accelerators. Cocktailer provides the uTask abstraction to unify the representation of DNN models, including both control flow and data flow. This allows Cocktailer to expose a holistic scheduling space for rescheduling control flow to the lower-level hardware parallelism of accelerators. Cocktailer uses a heuristic policy to find efficient schedules and is able to automatically move control flow into kernels of accelerators, enabling optimization across control flow boundaries. Evaluations demonstrate that Cocktailer can accelerate DNN models with control flow by up to 8.2× over the fastest one of the state-of-the-art DNN frameworks and compilers.

Chen Zhang, Tsinghua University

Lingxiao Ma, Microsoft Research

Jilong Xue, Microsoft Research

Yining Shi, Peking University & Microsoft Research

Ziming Miao, Microsoft Research

Fan Yang, Microsoft Research

Jidong Zhai, Tsinghua University

Zhi Yang, Peking University

Mao Yang, Microsoft Research

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

BibTeX

@inproceedings {288638,
author = {Chen Zhang and Lingxiao Ma and Jilong Xue and Yining Shi and Ziming Miao and Fan Yang and Jidong Zhai and Zhi Yang and Mao Yang},
title = {Cocktailer: Analyzing and Optimizing Dynamic Control Flow in Deep Learning},
booktitle = {17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23)},
year = {2023},
isbn = {978-1-939133-34-2},
address = {Boston, MA},
pages = {681--699},
url = {https://www.usenix.org/conference/osdi23/presentation/zhang-chen},
publisher = {USENIX Association},
month = jul
}

Download

Zhang PDF

Cocktailer: Analyzing and Optimizing Dynamic Control Flow in Deep Learning

Open Access Media

Presentation Video