Limit the context size consumed by the model when using Threads

robin.pham · November 9, 2023, 1:50am

Threads don’t have a size limit. You can pass as many Messages as you want to a Thread. The API will ensure that requests to the model fit within the maximum context window, using relevant optimization techniques such as truncation.

I’d really love it if we could customize this ‘maximum context window’ that it considers and still have the API handle the truncation for us. i.e. I want to just use a 10k context window or a 30k context window rather than always using the full 100k. Always consuming the maximum window isn’t always desired due to costs. But I know this isn’t super urgent as it’s possible to do this truncation manually.

Topic		Replies	Views
Thread Truncation With New Assistants API API threads	0	1087	November 8, 2023
Can we manually control the thread lenght? API gpt-4 , api , assistants-api	2	514	December 13, 2023
Longer context limits. Shall we expect it at some point in the future? API	0	670	March 2, 2023
Request: make it possible to specify the upper limit of history Feedback assistants-api	11	1050	July 10, 2024
Thread length = more context tokens? API assistants-api	3	194	July 14, 2024

Limit the context size consumed by the model when using Threads

Related topics