The difference between a conventional model and reasoning is similar to the two types of reflection described by the Nobel-Prize-Prize economism Michael Kahneman in his 2011 book Think fast and slow: Fast reflection and instinctive system-1 and slower more deliberative system-2 thought.
The type of model that made the Chatppt possible, known as a model or LLM of language, produces instant responses to an invitation by questioning a large neural network. These outings can be surprisingly intelligent and coherent but may not answer questions that require step -by -step reasoning, including simple arithmetic.
An LLM can be forced to imitate deliberative reasoning if it is invited to find a plan that he must then follow. This tip is not always reliable, however, and models are generally difficult to solve problems that require in -depth and meticulous planning. OPENAI, Google and now Anthropic all use an automatic learning method known as strengthening learning to get their latest models to learn to generate reasoning that points to correct answers. This requires collecting additional human training data on specific problem solving.
Penn says that Claude’s reasoning mode has received additional data on commercial applications, including the drafting and fixing of the code, the use of computers and the answer to complex legal questions. “The things we have made on improvements are … technical subjects or subjects that require long reasoning,” explains Penn. “What we have of our customers is a lot of interest in deploying our models in their real workloads.”
Anthropic says that Claude 3.7 is particularly good for solving coding problems that require step-by-step reasoning, outnai O1 outdoor O1 on certain landmarks like Swe-Bench. The company is publishing today a new tool, called Claude Code, specially designed for this type of AI assisted coding.
“The model is already good in coding,” explains Penn. But “an additional reflection would be good for cases that may require very complex planning – say that you are considering an extremely important code base for a business.”