Upload of a Json file, same question, same model but different answers

fdebuisseret · August 20, 2024, 1:36pm

Hello,

I am currently working on converting Excel files to the JSON format. Unlike ChatGPT, OpenAI’s API does not accept direct uploads of Excel files. It seems that you need to use a format recognized by the model, such as JSON.
I have therefore created a JSON file that I uploaded to the GPT-4 model in three different ways. The same question gives me three different answers.

Use case:
Upload of a JSON file with a list of subscribers (persons).
The number of entries for subscribers in the Json file is 5000.

Here is a sample of the file with two records.

Question to the model:
How many times do you find the first name Judie in the subscribers ?

From plateform.openai.com (API)

Assistants Playground (Assistant V2, model gpt-4o).
File is attached to the thread.

Model reply: The first name “Judie” appears 20 times in the subscribers list[1].

From Microsoft Azure OpenAI studio

Assistant Playground (Assistant V2, gpt-4o version:2024-05-13)
File is attached to the thread.

Model reply: The first name “Judie” appears once in the subscribers list【4:0†source】.

From chatgpt.com

File uploaded in the prompt.
model gpt-4o

Model reply: The first name “Judie” appears 100 times in the subscribers list.

The right answer is given by ChatGPT (100 times).

Any idea of what is going on and why the model is not providing the right answer with OpenAI API and OpenAI service on Azure ?

Thank you for any feedback.

scharleswatson · August 20, 2024, 6:56pm

LLMs are not particularly good at counting (currently). And most likely when you use the chatGPT interface it is using code interpreter to count the instances using a python script in the background.

fdebuisseret · August 22, 2024, 5:54am

Thank you for you feedback.

If I understand correctly, in the development of my chatbot, if I want this chatbot to be capable of answering (more or less complex) questions about the content of an Excel document, I will need to:

1] Convert the Excel file to JSON (done by my chatbot application).
2] Send the JSON file along with the user’s question in the prompt and ask the model to generate the code (Python or C#) on-the-fly to process the file and obtain the result, then return this code to the chatbot application (done by OpenAI).
3] Execute the code with the JSON file as input (done by my chatbot application) and retrieve the results.
4] Send the results back to the model to translate them into natural language.
5] Return the model’s response to the user.

I think I will try using OpenAI’s function call mechanism for steps 3, 4, and 5.

Any comments or advice are welcome.
Thank you.

Topic		Replies	Views
Why is the API assistant not providing the correct results? API assistants-api	1	328	June 16, 2024
Tips on getting 4o to answer questions with a given JSON file? Prompting chatgpt , gpt-4o	5	2038	June 13, 2024
Assistants API is not considering the entire input JSON file while answering questions API gpt-4 , api , assistants-api	3	1344	January 10, 2026
Should a Custom GPT be able to count the number of items in a JSON list? GPT builders chatgpt	4	916	December 30, 2023
Questions about File Search on assistants API assistants , gpt-4o	3	438	July 19, 2024

Upload of a Json file, same question, same model but different answers

Related topics