Breaking the Reward Barrier: Accelerating Tree-of-Thought Reasoning via Speculative Exploration

Shuzhang Zhong, Haochen Huang, and Shengxuan Qiu, Peking University; Pengfei Zuo, Huawei; Runsheng Wang and Meng Li, Peking University