Max number of tokens a Thread can use equal the Context Length of the used model?

_building · December 1, 2023, 10:37am

I was under the impression that each Message in thread is given its own Context Length. Am I wrong? So all messages in a thread will consume tokens from a single Context?

_j · December 1, 2023, 11:40am

You are correct.

Every question to an AI model that acts as a chatbot with memory must be accompanied by some past chat so the AI can understand what you were talking about.

“Thread” sounds like you are using “assistants” (which I would recommend against), which has no limit of the conversation length, and will fill the AI context window length with as much past conversation as will fit, and no way to budget it or even start again with a shorter conversation.

_building · December 1, 2023, 12:16pm

Why would you recommend against assistants API? Are there any pros to not using it?

_j · December 1, 2023, 12:20pm

The assistants feature is new, with the main feature that you cannot control or even see how much you are being billed for use. There are too many cons to using it.

Instead, one would just use the traditional chat completion API, which is actually straightforward, responsive, configurable, accountable.

Topic		Replies	Views
Context tokens in Assistant API API assistants-api	2	2128	February 20, 2024
Thread length = more context tokens? API assistants-api	3	216	July 14, 2024
Token consumption: Prompt tokens exponentially increase when using Threads (Assistants) API assistants-api	8	584	September 5, 2024
How many tokens is the size of the context window in Open AI Assistant? API	5	3268	April 8, 2024
Why are my context tokens used so quickly? API api	3	2856	January 5, 2024

Max number of tokens a Thread can use equal the Context Length of the used model?

Related topics