arxiv:2505.24863

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Published on May 30

· Submitted by

RunpeiDong on Jun 2

#2 Paper of the day

Upvote

Authors:

Junyu Zhang ,

Runpei Dong ,

Abstract

AlphaOne dynamically modulates reasoning in large models by introducing $\alpha$ moment and Bernoulli process for slow thinking, improving efficiency and capability across diverse domains.

AI-generated summary

This paper presents AlphaOne (alpha1), a universal framework for modulating reasoning progress in large reasoning models (LRMs) at test time. alpha1 first introduces alpha moment, which represents the scaled thinking phase with a universal parameter alpha. Within this scaled pre-alpha moment phase, it dynamically schedules slow thinking transitions by modeling the insertion of reasoning transition tokens as a Bernoulli stochastic process. After the alpha moment, alpha1 deterministically terminates slow thinking with the end-of-thinking token, thereby fostering fast reasoning and efficient answer generation. This approach unifies and generalizes existing monotonic scaling methods by enabling flexible and dense slow-to-fast reasoning modulation. Extensive empirical studies on various challenging benchmarks across mathematical, coding, and scientific domains demonstrate alpha1's superior reasoning capability and efficiency. Project page: https://alphaone-project.github.io/