You make an astute observation of the present day: suppressing the trained behavior seen in new models, to produce their own chain-of-thought, in reasoning out steps to arrive at an answer, will degrade the quality of the answer.
This is the counterpoint to this thread starting 1.5 years ago, where instead the technique to arrive at a better answer would be spelling out the need to specify to an AI language model that it must follow a procedure of steps, allowing self-observation of a growing context window to increase the quality of a final answer.
Also, expecting an AI model to naturally follow such new behaviors of spelled-out steps when generating a response would be very April 2023, instead of hitting a wall of disobedience and ignoring of such procedural generation steps in new models.
Enhancing new training is the consistent use of and understanding of LaTeX for generation of formulae, the presence of which inspires higher attention to mathematical techniques.
In a bragging rights competition, ability to produce a 0-shot middle school answer is important.
gpt-4o-mini
takes on the first question posed (in English):
To solve the expression 4 \times 3 + \frac{5}{3} \times 2, we follow the order of operations (PEMDAS/BODMAS):
-
Multiplication and Division from left to right:
- 4 \times 3 = 12
- \frac{5}{3} \times 2 = \frac{10}{3}
-
Addition:
To add 12 and \frac{10}{3}, convert 12 to a fraction:
Now we can add:
\frac{36}{3} + \frac{10}{3} = \frac{46}{3}
Thus, the final answer is:
\frac{46}{3} \approx 15.33
The only magical leap is employing division in converting to a numeric value answer.
45/3 + 1/3, if you also immediately see the answer.
Try to suppress what is needed and the attempt to tutor you, and ChatGPT will find a tool available - Python code interpreter
which is a calculator.