Chat GPT-3 forgets information from earlier in the thread

AlexTully · January 24, 2023, 8:01am

So what I’ve been doing with Chat GPT-3 is pasting in summaries of classes I’ve taught, and then querying it. The issue is that once I’ve pasted in about 5 pages of summaries (roughly 2,400 words) it starts failing to retrieve information that was pasted in earlier in the thread. So basically, it can only recall the information from the latest 2,400 words or so of the thread.

Does anyone know if the pro version of Chat GPT-3 will be more powerful? This memory limitation makes it a no-go for a lot of use cases (e.g. semantic search through archives of meeting minutes, checking a novel draft for plot holes).

nfhughes · January 24, 2023, 6:38pm

I’ve noticed similar limitations. Have you tried giving a name to each of the uploads? “Part 1” “Part 2” etc – then ask it to incorporate part 1 and part 2… I’ve had some success that way.

AlexTully · January 25, 2023, 1:28am

I labelled each upload with a date, then pasted them into Chat GPT-3 in chronological order (oldest first). Chat GPT-3 misses out on entire years.

For example here you can see there is a lot of information from 2019:

But Chat GPT-3 can’t remember receiving that information:

nfhughes · January 25, 2023, 7:52pm

There are definitely limitations. Not sure it will work but have you tried naming each time period?

In other words, start by just dropping in the the 2019 dates and explaining to ChatGP… “these are events that relate to 2019. Let’s name them “2019 events” Do you understand that these events relate to the year 2019? Do you understand that I will share events that relate to 2020 and 2021 next?”

Once it understands, then add the 2020 events – name it and make sure it understands, again… then, repeat for 2021 - naming and making sure it understands.

Once you do that, try asking the question… it just might work.

AlexTully · January 26, 2023, 1:04am

That wouldn’t work, because it completely forgets the info from earlier in the thread. For example, the 2019 data described a trip to watch Japanese kabuki. But even when I ask Chat GPT-3 whether there is any information about kabuki, it says there is nothing about kabuki. I tried querying it a bunch of ways, and got the same result.

luisdemiguel · January 27, 2023, 1:55pm

In this limitation about around 2,400 words that you are assuming, is taking account your prompts or also the answers from chat GPT?

Fusseldieb · January 27, 2023, 2:10pm

OpenAI models such as davinci have a limit of 4000 tokens, which would get pretty close to the limitation of ~2000 words you saw.

ChatGPT probably summarizes things internally to improve this “limit”, but as soon as you have a lot of condensed information, it will get lost quickly.

AlexTully · February 3, 2023, 4:28am

That sucks, and rules out a huge number of potential use cases. I hope they introduce a premium version with say a limit of 400,000 tokens.

OneBillionPlusOne · February 15, 2023, 8:28pm

Absolutely awful. Today it remembered yesterday’s script, wrote it again with modifications, then an app mistakenly reported fonts missing, ChatGPT suggested installing fonts in the system, I said there were plenty, ChatGPT recommended a command prompt to query my system for fonts installed, I pasted the list, ChatGPT said something generic about the list and regardless of prompts couldn’t put it back into context

OneBillionPlusOne · February 15, 2023, 8:30pm

I totally agree about increasing the tokens. Pretty worthless otherwise. You can’t plot a movie if it forgets the title of the movie. ChatGPT is not even able to render the name under which the topic is being saved

jimmyalan9 · March 7, 2023, 12:06am

I’m experiencing a similar problem. I share AlexTully’s viewpoint and was already considering the same solution. It would be beneficial if a premium version with varying limit tokens was introduced. Personally, I would be interested in purchasing an unlimited version.

curt.kennedy · March 7, 2023, 12:12am

If you’ve got the $$$, they offer …

Dedicated instances

We are also now offering dedicated instances for users who want deeper control over the specific model version and system performance. By default, requests are run on compute infrastructure shared with other users, who pay per request. Our API runs on Azure, and with dedicated instances, developers will pay by time period for an allocation of compute infrastructure that’s reserved for serving their requests.

Developers get full control over the instance’s load (higher load improves throughput but makes each request slower), the option to enable features such as longer context limits, and the ability to pin the model snapshot.

Dedicated instances can make economic sense for developers running beyond ~450M tokens per day. Additionally, it enables directly optimizing a developer’s workload against hardware performance, which can dramatically reduce costs relative to shared infrastructure. For dedicated instance inquiries, contact us.

david11 · March 7, 2023, 12:15am

The memory capacity is not exactly something they can just increase arbitrarily. It’s a limitation of the way they trained the model. The older, smaller models had even less capacity (about half). To increase the capacity significantly would require upgrading the model and retraining it from scratch, pretty much. To my understanding anyway.

AlexTully · March 7, 2023, 12:40am

So how much would it cost to subscribe to a version of Chat Chat GTP-3 with the token limit increased by 100x-1000x?

curt.kennedy · March 7, 2023, 12:46am

@AlexTully
Assuming the cost of turbo, they are saying 450M tokens per day to break even, which is $900 a day.

Oh, and no idea if this is a 10x or 100x increase in the window size. My guess no more than 10x.

AlexTully · March 7, 2023, 1:12am

Wow thanks for letting me know. I think I’ll hold off until OpenAI (or a competitor) make a cheaper model which accepts longer inputs.

ebann1 · April 23, 2023, 6:09pm

When I first started using it, I used it to write an entire 200 Page e- book I spent days and days chatting with it and developing the book and it was amazing and it worked great (I should mention using chat GPT to write a book you’re still doing about 90% of the work). Then the news broke and it became the hot thing and the developers must have changed something about how handles resources and how much resources it retains per conversation because it’s slowly deteriorated over the point to now a few months later it’s become almost useless because it can only hold context of your previous input during a conversation. You can’t even compose an email because you’ll ask two or three questions and it forgets everything from the first two question and only responds to the last question. Then you have to start building this huge context to paste in each time and then you start approaching the letter limit. Subscribing to premium doesn’t change this. Very disappointing tool.

AM0001 · June 29, 2023, 12:47pm

ChatGpt has become forgetting what you said to it in the same conversation, for example I send the first part and the second part, then it forgets the first part after sending second part. In the last month or before this problem does not exist. I and many are wondering whether it is being improved or made weaker?

Topic		Replies	Views
ChatGPT-4 Limits? Are they the same as for ChatGPT-3.5? API	12	8744	December 12, 2023
GPT-4o is stuck in a loop and unusable GPT builders gpt-4	8	4892	January 18, 2025
Gpt-4-turbo-2024-04-09 forgets previous messages in the chat (solved, coding) Bugs	5	1632	December 12, 2024
Need more than a 4097 token call from chat gpt api API	7	3264	November 28, 2023
I wish that when using the GPT API, it would be possible to have a contextual conversation like chatGPT API	14	7178	December 18, 2023

Chat GPT-3 forgets information from earlier in the thread

Dedicated instances

Related topics