16k Input vs Output: Edit and token strategies for long input texts

neon-green · August 27, 2023, 10:49am

My first post, I am so excited!

I am using turbo 3.5 16k to process “long” texts, mainly for summaries.

Recently I realized that the answers I receive are cut off (or weird in other ways). I assume this is because prompt and response must and will not exceed 16k tokens. And in some cases the prompt almost completely consumes the token limit.

What would be a good strategy to always reserve “enough” tokens for the answer? I am OK with reasonably shortening the original input text. So: Let it be a fixed number, or a certain percentage?

Thank you for your time. Best, J

_j · August 27, 2023, 12:22pm

If you are using a parameter max_tokens with your API call, this sets a limit of the size of the response you can receive. The AI might want to write more, but it will be cut off. Make the setting to big, and you’ll get errors back about your input because it reserves space.

You can remove this max_tokens specification completely, and there will be no artificial output limit, nor will you need to do any special calculations in order to use all of the remaining context length after your input for forming an answer.

Then it is simply up to you to not send too much.

The AI models are recently even more trained not to give large answers. It is as if OpenAI gave tons of fine-tuning just for ChatGPT’s 1500 token output limit, and didn’t alter this behavior for any special API or 16k response models.

Thus it will be challenging to compose language or rewrites that are long and take advantage of the output capabilities of the large context length model. Tricks in telling the AI to follow multiple individual instructions, or telling it that it has a 100000 word output limit and 20000 word target may help.

Topic		Replies	Views
Long Prompt with Large Text Data Prompting gpt-35-turbo , chatgpt , api	3	13463	July 14, 2023
Struggling with max_tokens and getting responses within a given limit, please help! API chatgpt	5	19433	October 28, 2023
How to print the output over 10,000 tokens? API gpt-4o-mini	4	735	September 9, 2024
Setting max tokens for output issues API gpt-4 , api	4	3701	January 26, 2024
Impossible to generate texts of more than 600 words API	5	3304	December 18, 2023

16k Input vs Output: Edit and token strategies for long input texts

Related topics