Skip to main navigation Skip to search Skip to main content

Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilities

  • Weixiang Zhao
  • , Xingyu Sui
  • , Jiahe Guo
  • , Yulin Hu
  • , Yang Deng
  • , Yanyan Zhao*
  • , Xuda Zhi
  • , Yongbo Huang
  • , Hao He
  • , Wanxiang Che
  • , Ting Liu
  • , Bing Qin
  • *Corresponding author for this work
  • Harbin Institute of Technology
  • Singapore Management University
  • SERES

Research output: Contribution to journalConference articlepeer-review

Abstract

Recent advancements in Large Reasoning Models (LRMs), such as OpenAI’s o1/o3 and DeepSeek-R1, have demonstrated remarkable performance in specialized reasoning tasks through human-like deliberative thinking and long chain-of-thought reasoning. However, our systematic evaluation across various model families (DeepSeek, Qwen, and LLaMA) and scales (7B to 32B) reveals that acquiring these deliberative reasoning capabilities significantly reduces the foundational capabilities of LRMs, including notable declines in helpfulness and harmlessness, alongside substantially increased inference costs. Importantly, we demonstrate that adaptive reasoning— employing modes like Zero-Thinking, Less-Thinking, and Summary-Thinking—can effectively alleviate these drawbacks. Our empirical insights underline the critical need for developing more versatile LRMs capable of dynamically allocating inference-time compute according to specific task characteristics.

Original languageEnglish
Pages (from-to)34976-34984
Number of pages9
JournalProceedings of the AAAI Conference on Artificial Intelligence
Volume40
Issue number41
DOIs
StatePublished - 2026
Event40th AAAI Conference on Artificial Intelligence, AAAI 2026 - Singapore, Singapore
Duration: 20 Jan 202627 Jan 2026

Fingerprint

Dive into the research topics of 'Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilities'. Together they form a unique fingerprint.

Cite this