Stepwise Internalization: From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step

January 2026

tl;dr: A curriculum learning experience of iteratively absorbing CoT into language model itself.

Overall impression

Stepwise Internalization is a method designed to achieve implicit chain-of-thought reasoning by gradually removing intermediate reasoning steps during training, first tokens first absorbed.

This work inspired later more influential work such as Coconut.

Key ideas

The primary difference between implicit CoT and No CoT lies in the use of intermediate reasoning steps as supervision during training.
There is a trade off between num of CoT tokens remaining and the accuracy. Completely absorb into the model sometimes does not work.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stepwise Internalization: From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step

Overall impression

Key ideas

Technical details

Notes

FilesExpand file tree

stepwise_internalization.md

Latest commit

History

stepwise_internalization.md

File metadata and controls

Stepwise Internalization: From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step

Overall impression

Key ideas

Technical details

Notes