Arceus: Reducing Both Dynamic and Static Energy in Large Model Training

Ruofan Wu, Jae-Won Chung, and Mosharaf Chowdhury, University of Michigan