Is GPT3.5 API designed to generate responses to fit within the remaining token space?

akramsam · April 30, 2023, 3:19pm

I can’t get gpt3.5-turbo api to generate long articles as in the ChatGPT interface (https://chat.openai.com/). When I input the same instructions as I do in the API, the website produces long responses that get cut off due to the token limit. I then prompt it with “continue” to generate the rest of the article. I have tried this multiple times and consistently receive long articles that require me to use “continue.”. This is exactly what I want, and I want to replicate this behavior when making API calls to gpt-3.5-turbo. However, with the API calls, it seems that gpt-3.5-turbo attempts to condense the entire article into the remaining space, resulting in a loss of information as it tries to summarize the content to fit the constraints. I have encountered this issue numerous times, as the model always tries to fit the entire article within the available space.

How can I replicate the long article generation behavior I experience on the ChatGPT website when using API calls to gpt-3.5-turbo, so that I can generate extended articles without losing information due to summarization?

Topic		Replies	Views
How to complete Long API responses? API gpt-35-turbo , chatgpt	6	4819	December 19, 2023
GPT fails to deliver a condensed version of a text API	4	570	December 24, 2023
Is it possible to have the response fit inside the max token limit? API gpt-35-turbo	2	2704	December 19, 2023
Impossible to generate texts of more than 600 words API	5	3243	December 18, 2023
GPT-4 8k token API response size limit API	1	1403	December 16, 2023

Is GPT3.5 API designed to generate responses to fit within the remaining token space?

Related topics