GroupMind: Accelerating Synchronous LLM Reinforcement Learning with Group-Aware Context Learning

Ruoyu Qin, Tsinghua University & Moonshot AI; Weiran He, Weixiao Huang, Yangkun Zhang, Yikai Zhao, Bo Pang, and Xinran Xu, Moonshot AI; Yingdi Shan, Yongwei Wu, and Mingxing Zhang, Tsinghua University