What cause the rubbish data output from GPT4 API calls response in arabic?

Karek · December 29, 2023, 1:41am

I’ve been using GPT4 API and i thought before the problem from my application. after while i’ve used to be member of Voila browser extenstion that offer GPT4 API. the problem with arabic outputs in voila response same as the one happened to me before which is output a wierd rubbish data in arabic that damage the whole response.

It’s not happening sometimes, it’s hapeening ALWAYS when there’s a long response like articles or web results.

here’s example when the rubbish data starts and i should turn it off immeditly because it’s cause some visual problems:

_j · December 29, 2023, 1:52am

If using the chat completions endpoint, you should use the top_p and the temperature sampling parameters to reduce the unexpected tokens on world languages with less AI training data.

temperature = 0.4, top_p = 0.4, for both parameters is a good start.

A single “wrong” token can turn the AI output into an ongoing production of nonsense.

gpt-4-0613 will be higher quality than the (price and everything reduced) turbo preview model, which will also have problems calling functions with correct language characters.

Especially don’t attempt “assistants” with “retrieval” at this time, requiring the preview model, because you have no sampling parameter control, flawed function output, and uncontrolled looping on errors within the agent.

aboudyfarkh · May 7, 2024, 9:50am

Have you encountered a similar problem and this solution worked for you?

_j · May 7, 2024, 12:25pm

I am the one that offered the last advice in this topic four months ago.

Even today, I have a “review this code you wrote, because despite your high comprehension, tokens that were printed are generated by a random sampling process that may not have been ideal, introducing errors”.

I am guessing that you are not simply taking a satisfaction survey, but have your own issues that are distinct from what has previously been answered.

You will get better forum answers if you tell us what on-topic concerns you are facing yourself.

ardenius7 · June 2, 2024, 11:36am

hi karek, i am having the same problem. using temp of 0.5 top_p default unchanged. did you find a solution for this ?

Topic		Replies	Views
GPT-4 API Outputs Gibberish When Prompted in Arabic Community api	0	463	May 7, 2024
Gpt-4-1106-preview edit shortcomings API	8	994	January 12, 2024
Gpt-4o-mini corrupted output in last hour? Bugs gpt-4 , api	1	148	August 27, 2024
GPT API Failed to create completion as the model generated invalid Unicode output API gpt-35-turbo , api	3	3626	April 1, 2024
Gpt-4o vision extremaly long response time Bugs gpt-4o , api-vision	2	181	December 6, 2024

What cause the rubbish data output from GPT4 API calls response in arabic?

Related topics