Completion cut off and unknown characters

mahaeg · April 10, 2022, 8:10pm

Hi everyone!

I am relatively new to GPT-3, so my apologies if my questions seem quite trivial. I have tried looking into existing questions, but even though some questions are quite similar but none of them directly answered my queries.

I have 2 questions:

How can I prevent the system from generating incomplete completions (cut off ) independent of the max token numbers. If in my app I am using GPT-3 with Java. Will it be a better and possible option to post process the generated completion to truncate it and remove extra words after a “.” before displaying the generated completion to the user?
My fine tuned model occasionally generates text containing unknown characters to me (maybe similar to Chinese language- I don’t speak Chinese) even though my dataset was only provided in English language. Does anyone know what is the reason for this and how can I solve it?

Thanks in advance!

lmccallum · April 10, 2022, 8:34pm

Hi, regarding your first question, I have only experienced the problem of “incomplete completions” when bumping up against the maximum tokens specified. So make sure you are being generous enough with your maximum. To ensure that I got a “complete completion” within my specified maximum tokens, I found it helpful to include explicit instructions to that effect, something like this: “summarize details to provide an answer in 2-3 succinct sentences.” Sorry I have no idea about your second question.

mahaeg · April 16, 2022, 3:47pm

My fine-tuned model is behaving in a way that it keeps generating text until the maximum token specified. However, aligned with the purpose I want it for, one generated sentence should be enough. Therefore, I use a “.” as stop sequence. It is working fine for now. Thank you for your help!

Topic		Replies	Views
Finetune model completion cut off too short Prompting	7	3938	January 17, 2023
Is it possible to stop a completion at the Nth occurrence of the stop sequence? API	9	1659	December 18, 2023
Output seems to stop abruptly--why is that? API	6	3759	May 22, 2023
How to force to continue a truncated completion? API	2	5525	December 24, 2023
Issues with Truncated Responses API	3	2677	April 22, 2024

Completion cut off and unknown characters

Related topics