Is GPT3.5 API designed to generate responses to fit within the remaining token space?

akramsam · April 30, 2023, 2:49pm

I am experiencing difficulties in generating long articles using GPT within the ChatGPT interface (https://chat.openai.com/). When I input the same instructions as I do in the API, the website produces long responses that get cut off due to the token limit. I then prompt it with “continue” to generate the rest of the article. I have tried this multiple times and consistently receive long articles that require me to use “continue”. This exactly what I want, and I want to replicate this behavior when making API calls to gpt-3.5-turbo. However, with the API calls, it seems that gpt-3.5-turbo attempts to condense the entire article into the remaining space, resulting in a loss of information as it tries to summarize the content to fit the constraints. I have encountered this issue numerous times, as the model always tries to fit the entire article within the available space.

How can I replicate the long article generation behavior I experience on the ChatGPT website when using API calls to gpt-3.5-turbo, so that I can generate extended articles without losing information due to summarization?

Topic		Replies	Views
Is GPT3.5 API designed to generate responses to fit within the remaining token space? API	1	390	December 24, 2023
How to complete Long API responses? API gpt-35-turbo , chatgpt	6	2352	December 19, 2023
GPT fails to deliver a condensed version of a text API	4	290	December 24, 2023
Impossible to generate texts of more than 600 words API	5	1817	December 18, 2023
How to force to continue a truncated completion? API	2	3486	December 24, 2023

Is GPT3.5 API designed to generate responses to fit within the remaining token space?

Related Topics