Is there also a situation where GPT stops generating so we need to make it continue manully

_j · November 4, 2023, 10:09am

There’s a few reasons why you could be getting truncated output:

You are setting a max_token value with your API call, and the generated output has exceeded the limit;
You are sending a very large input, so an adaptive or unset max_token value doesn’t leave enough context length after the input for creating the desired response;
Your streaming generation is taking too long, and either your platform times out after a short period (like 60 seconds of open connection) or you successfully made the AI produce very long output against its own 5-minute server timeout (about 10,000 tokens of -16k generations starts to get you into the five minute range at “normal speed”).
The AI was done writing, and generated a stop token (or rather a stop token was selected from the sampling of likely output tokens).

Most above have obvious solution if you do the logging and troubleshooting. #4 would require lowering the temperature or top_p so there is less selection of unlikely tokens, and the AI output seen follows its production intent.

Topic		Replies	Views
How to complete Long API responses? API gpt-35-turbo , chatgpt	6	4934	December 19, 2023
How to continue generation through api implementation API api	5	6400	June 30, 2023
How to force to continue a truncated completion? API	2	5487	December 24, 2023
Playground stop generating output and have to write continue Prompting chatgpt	8	804	June 7, 2023
API gpt 4 very little words API gpt-4 , api	5	1453	December 17, 2023

Is there also a situation where GPT stops generating so we need to make it continue manully

Related topics