Does gpt-4o API have access to "secret" tool for math calculations?

will15 · November 25, 2024, 7:28pm

Hey everyone! I’ve been doing some testing on a prototype I’m building with gpt-4o, and I’m a little bit puzzled. In some of the model’s responses, it’s doing some pretty impressive number crunching (for an LLM). E.g., it correctly calculated e^-1/3 to five decimal places. I’d love to get an explanation for how it’s doing this. Here are my best hypotheses right now:

The model has somehow memorized a whole bunch of common expressions like the one I’ve shared,
more advanced arithmetic capabilities have somehow been “baked” into the latest version of gpt-4o (unclear how), or
the model is making a secret tool call, e.g. to an environment that can execute Python, to perform the calculation.

Does anyone have insight into which of these is the most likely—and more generally, how the model is doing this?

will15 · November 25, 2024, 7:33pm

Partly, I’m asking because if (3) is true, is it even necessary for developers to implement their own execute-Python tool anymore (as a workaround for LLMs’ generally poor arithmetic abilities)?

MARK0 · November 26, 2024, 7:48am

Here’s what I think is happening:

I recall when GPT-3 was first launched. Back then, I reported a few incorrect answers—mostly calculation errors—to OpenAI, and they responded impressively fast, within about 10 minutes, and fixed the issues. This makes me suspect there’s an efficient method to quickly update or hardcode corrections into the system’s knowledge base, possibly through something like a vector store.
The system also appears capable of leveraging a Python interpreter to handle calculations, which likely helps ensure accuracy in mathematical operations.

will15 · November 26, 2024, 2:29pm

I really appreciate your response!

The system also appears capable of leveraging a Python interpreter to handle calculations, which likely helps ensure accuracy in mathematical operations.

I’m definitely aware of this in the ChatGPT product! The Python interpreter appears to be just one of several tools it has, along with web search, image generation, etc. What I’m wondering is if the Python interpreter has somehow also been slipped into the API (and a quick note: I’m referring to the raw API here, not the assistants API). This would be very surprising to me, because as an API consumer, I like to think that I have full control over the tools parameter, the system message, etc., so it would be weird if OpenAI appended a new element to the tools array without telling me (and without charging for those tokens). And yet, the math abilities are hard to explain otherwise. Definitely curious for any other insights here!

will15 · December 18, 2024, 4:53pm

Thinking some more, I feel like the answer has to be (1). The latest release of gpt-4o is probably just trained/fine-tuned on a healthy-sized dataset of math questions.

Topic		Replies	Views
Response for mathematical request Prompting	9	2752	October 29, 2024
We have a problem with basic maths API	7	2449	December 27, 2023
How to access GPT4 with Code Interepreter via API? API gpt-4	3	3488	December 17, 2023
How to return correct percentiles? Prompting	7	1491	February 10, 2024
Completion not using entire function result for answer API gpt-4 , function-calling , tools	6	104	September 12, 2024

Does gpt-4o API have access to "secret" tool for math calculations?

Related topics