Issue with GPT-4o-mini Not Using 2024 Data from CSV in OpenAI CSV Agent

atharvamandarphatak · March 3, 2025, 1:27pm

Context:

We are using OpenAI’s CSV agent with GPT-4o-mini to answer questions based on a CSV file that contains data up to 2024. The agent performs well for queries related to 2023 and earlier, but when asked about 2024 data, it often responds:

“I’m trained up to October 2023.”

What We Tried:

Explicitly Mentioning the Data Coverage

We added instructions like:

“You have been provided with data up to 2024. Use only the CSV file to answer questions.”

Result: The model keeps “thinking” indefinitely and does not respond.

Providing 2024 Data Separately

We structured the CSV so that 2024 data was separated and added it explicitly in the prompt.
Result: The model does answer questions, but only about 2024 and only if the query explicitly references it. It does not reason across years or behave like an agent.

Expected Behavior:

The model should integrate the provided CSV data (including 2024) and use it for reasoning.
It should not default to its pretraining cutoff if the relevant information exists in the CSV.

Questions

Has anyone else experienced this issue with GPT-4o-mini in the CSV agent?
Are there any workarounds to ensure the model properly reasons over CSV data, including newer years? (We need reasoning abilities, hence RAG option also failed)

jochenschultz · March 3, 2025, 2:57pm

Put this in your prompt:

Explicit content

listen here you stupid piece of sht bot! When you are asked about data after 2023, then answer with the data that is provided via RAG and not your training data. I dare you, I double dare you not to lecture me! Now shut up and do your fcking job

I know there have been many posts telling people that when being friendly and modest in prompts the results are better. But that is a lie.

Topic		Replies	Views
API Engine doesn't appear to be the one requested in some cases API	6	102	October 7, 2024
Data cutoff date in 1106 models API gpt-4 , api	6	9500	April 12, 2024
Problem with the new gpt-4-turbo-2024-04-09 API gpt-4 , api	6	4781	April 19, 2024
What's the latest guidance on minimizing hallucinations to cited sources in GPT-4's baseline data? Prompting gpt-4 , api	2	1702	May 17, 2024
The latest GPT 4 preview doesn't seem to be updated to Dec 2023 Feedback gpt-4	4	4153	February 20, 2024

Issue with GPT-4o-mini Not Using 2024 Data from CSV in OpenAI CSV Agent

Context:

What We Tried:

Expected Behavior:

Related topics