Memory for Chat Completions

yournowto · July 29, 2024, 8:50am

I am not sure and because of that I would like to ask a question about Chat Completion. I have been using chat completion for over couple of months, and I have encountered with something new. Inside a new chat (via API calls) if I continue on the same topic (normally this was not possible) with making new requests; my ChatResponse ID appears to stay same. It was changing on every request. Does this mean that API has memory now?

I tried to send a new request without sending previous messages. seen that memory feature was not working. I believe memory feature is only available with assistants but with this recent update (same chat response id) I started questioning.

Any ideas?

merefield · July 29, 2024, 10:33am

No, I don’t believe the Chat Completions API has memory yet or we would have seen a big announcement and changes to docs.

In any case they have a product that already fits that niche (Assistants)

Very easy to test this.

State some opinions like what your favourite colour is and then follow up by asking what your favourite colour is.

My Chatbot will get it right every time because it has local memory, without it, it would fail most of the time.

sps · July 29, 2024, 10:37am

Welcome @yournowto

Here’s the structure of a chat completion response

{
  "id": "chatcmpl-123",
  "object": "chat.completion",
  "created": 1677652288,
  "model": "gpt-4o-mini",
  "system_fingerprint": "fp_44709d6fcb",
  "choices": [{
    "index": 0,
    "message": {
      "role": "assistant",
      "content": "\n\nHello there, how may I assist you today?",
    },
    "logprobs": null,
    "finish_reason": "stop"
  }],
  "usage": {
    "prompt_tokens": 9,
    "completion_tokens": 12,
    "total_tokens": 21
  }
}

The chat completion id in my tests has always varied with every single request.

ETA: I can confirm that it’s still different for every API call @yournowto .

yournowto · July 29, 2024, 11:05am

My solution also has memory but in the end you need to send previous conversations to API which causes high tokens. (Talking about chat completions)

For assistans things are totally different

yournowto · July 29, 2024, 11:06am

Yes previous I was also getting varied id’s. But in the recent versions it started to give same id’s interestingly.

merefield · July 29, 2024, 11:21am

You get what you pay for

It’s not like using the Assistant API will save you money, as I suspect the token usage will be even greater.

yournowto · July 29, 2024, 11:38am

Interesting.

Then what is your suggestion?

merefield · July 29, 2024, 11:53am

Reduce your context window to the minimum practical, use RAG and use cheaper capable models like mini.

Also: monetize accordingly!

Topic		Replies	Views
Can I consider memory in chat completion API requests? API	1	621	May 30, 2024
Getting ChatGPT to Remember Previous Chat Messages Prompting	37	69202	January 29, 2024
Conversational 'memory' using completions endpoint Prompting chat-completion	4	2355	June 21, 2024
Efficient stateful completion chatbot API	10	5078	July 9, 2024
Chat completion or completion endpoint for multi turns? API chatgpt	1	2638	January 23, 2024

Memory for Chat Completions

Related topics