Revisiting Pipeline Parallelism for LLM Serving

Soonjae Hwang and Jeongseob Ahn, Korea University