Hi fellow enthusiasts. What is the recommended approach if wishing to using large context files e.g. 30k tokens being passed into a prompt on the API? Best, Shaun

Today’s update with GPT-4 Turbo has solved my query! The only thing now is to watch the cost of my queries as these will certainly rack up if I’m sending large token sizes through regularly.

How to handle large context token limits?

API

jwatte November 3, 2023, 9:46pm 2

You’ll have to compress the prompt by sending smaller pieces and asking the model to summarize each piece, before you send the total of the summarized pieces in for final inference.

Topic		Replies	Views
4096 response limit vs 128 000 context window API	11	11577	February 6, 2025
CLOSED Separate ChatCompletion API calls for 'system' and 'user' API	19	3514	September 20, 2023
Token limits on prompting Prompting plugin-development	4	2372	June 16, 2023
Chained Prompt to complete text larger than 4000 tokens? API	14	6042	December 25, 2023
Seem to be unable to reach context limit in my API request API gpt-4	9	610	June 17, 2024

How to handle large context token limits?

Related topics