ChatGPT 4.0 keeps hallucinating exact quotes from the free-response data I've loaded

Fahv · April 2, 2024, 7:33pm

I have loaded a dataset of about 50K rows of survey data. There’s 8 columns, two of which are free-text data. When I ask it to pull quotes that demonstrate examples of a given topic, it consistently hallucinates the quote. Each one starts off matching my source data, but quickly becomes nonsense. What am I getting wrong here? Is there a way to force chatGPT to actually return my source data exactly as it is found?

Here’s an example of a prompt: “Show me some examples of respondents developing as leaders and mentors, skills like delivering feedback, coaching, mentoring, delegating work, etc. Include the full quote from the time card.”

AndersenWebworks · April 3, 2024, 1:29pm

GPT4 can only handle up to ~ 8000 tokens, so feeding it 50k rows of data can’t work. It will just remember the last ~ 8000 tokens from those. And if that wasn’t bad enough there is more problems currently:

Fahv · April 3, 2024, 6:10pm

Thanks for the heads up! I Tried again with only 100 rows of data (8,100 tokens) and it still is only paraphrasing the exact quotes, all while telling me they are precise. Still, if the tool can only handle 100 rows at a time, I would rather just look at the data manually…

I will try it with Claude!

garyk · December 2, 2024, 4:55am

I also find it difficult to use ChatGPT to analyze tables & numbers. In case it helps, I tried to develop a custom GPT to self-report hallucinations, feel free to test it out in your research here & good luck!

Topic		Replies	Views
How to Reduce Hallucinations in ChatGPT Responses to Data Queries Prompting gpt-4 , adv-data-analytics	5	2367	December 2, 2024
Gpt-4o hallucinates a lot Community api	27	3637	December 2, 2024
Can a good prompt prevent 'hallucination'? Prompting chatgpt , api	6	3697	November 4, 2023
Trouble extracting all information from long context document API gpt-4	6	751	October 29, 2024
GPT-4 extremely lazy while working with just 100-150 test results tables Prompting gpt-4	2	344	April 30, 2024

ChatGPT 4.0 keeps hallucinating exact quotes from the free-response data I've loaded

Related topics