So I’m helping out a friend clean up audio transcriptions for a school project he’s working on. There’s LOTS of text (in this case, about ~6.6k tokens), and I’m sending a bunch of it to GPT-4o so that it cleans it up. Rephrasing some sentences, correcting wrongly transcribed words, all that jazz.
I’ve done it a first time through the API, it worked perfectly well. I then did exactly the same through ChatGPT and I got some absolutely nonsensical answers that I never got before. It looked like I was getting other people’s generations instead of my own?
Note that my prompt and the text were both in French… then ChatGPT replied with a completely irrelevant answer in English:
(Note: the original text or prompt did not contain any mention of crime in England or Wales. It did not have ANYTHING to do with this whatsoever.)
Confused, I asked ChatGPT why it suddenly brought this up, and it then changed topic again, entirely, answering a question about… The Witcher series?
(The model is basically telling me about a fire that destroyed The Witcher costumes and delayed the show’s production. I can’t find anything about this online, looks like it never happened.)
I kept asking it about it, and it replied to question about PDF I did not ask.
(Telling me how to reduce PDF filesize.)
On top of all this, the conversation title that got generated for this one is just…
Any idea why this is happening? Is it just completely thrown out by the long text?
EDIT: I’ve started more conversations with the exact same prompt and the same bug happened every time. I can’t tell if the model is just hallucinating or if I’m somehow getting other people’s generated responses instead of mine. So far I’ve had the AI ignore my prompt and instead:
- Perform a web search about “The 2024 Canadian Census Test”
- Generate a graph about “College Admissions by Ethnicity and Status”
- The November 5, 2024 US Election
- The Michigan GOP primary of August 2, 2022
- Analyse a “covid_countries_cases.csv” file I never uploaded
- Explain how to resolve a quadratic equation, try to use code interpreter, then say “Sorry, I don’t have information about the results of that election.”