Hi everyone!
I am relatively new to GPT-3, so my apologies if my questions seem quite trivial. I have tried looking into existing questions, but even though some questions are quite similar but none of them directly answered my queries.
I have 2 questions:
-
How can I prevent the system from generating incomplete completions (cut off ) independent of the max token numbers. If in my app I am using GPT-3 with Java. Will it be a better and possible option to post process the generated completion to truncate it and remove extra words after a “.” before displaying the generated completion to the user?
-
My fine tuned model occasionally generates text containing unknown characters to me (maybe similar to Chinese language- I don’t speak Chinese) even though my dataset was only provided in English language. Does anyone know what is the reason for this and how can I solve it?
Thanks in advance!