Train Long, Think Short: Curriculum Learning for Efficient Reasoning

Aug 11, 2025·

Hassan Hammoud

Khalid AlHamoud

Abed Hammoud

Marzyeh Ghassemi

Bernard Ghanem

· 0 min read

PDF Source Document

Abstract

We study curriculum learning strategies that train large reasoning models with long chains of thought but encourage short, efficient inference. We characterize when reducing test-time compute preserves accuracy, and propose a training schedule that closes the gap between long-train / short-test and long-train / long-test regimes across mathematical reasoning, multi-hop QA, and code generation benchmarks.

Type

Publication

Under review (arXiv:2508.08940)

Last updated on Aug 11, 2025

Authors

Abed Hammoud

Postdoctoral Research Associate

← Reinforcement Learning, Optimal Control, and Bayesian Filtering in Data Assimilation Apr 14, 2026