Train Long, Think Short: Curriculum Learning for Efficient Reasoning

Aug 11, 2025·
Hassan Hammoud
,
Khalid AlHamoud
Abed Hammoud
Abed Hammoud
,
Marzyeh Ghassemi
,
Bernard Ghanem
· 0 min read
Abstract
We study curriculum learning strategies that train large reasoning models with long chains of thought but encourage short, efficient inference. We characterize when reducing test-time compute preserves accuracy, and propose a training schedule that closes the gap between long-train / short-test and long-train / long-test regimes across mathematical reasoning, multi-hop QA, and code generation benchmarks.
Type
Publication
Under review (arXiv:2508.08940)