Chen explains: "The model is learning to think for itself, rather than trying to imitate the way humans would think. It’s the first time we’ve seen this level of self-reasoning in an LLM. The model sharpens its thinking and fine-tunes the strategies it uses to get to the answer."