Api does not support utf-8 encoding

darrelx · March 14, 2023, 12:32pm

Greetings to all,

I am currently working on a chatbot application in Flutter and I have encountered an issue with the encoding of the response received from the OpenAI API. Despite using HTTP requests, some characters, such as ‘é, à, où, …’, are being displayed incorrectly. I have attempted to resolve this issue by adding the ‘charset=UTF-8’ parameter in the API configuration, but the result remains the same.

ruby_coder · March 14, 2023, 12:59pm

Hi @darrelx

What to you mean precisely above?

Displayed where? On an HTML page? In the terminal?

darrelx · March 14, 2023, 2:35pm

For a French response, instead of “célébrité”, “répondre à”. I get “cÃ © lÃ¨brÃ¨bres”, “rÃ © pondre Ã”.
DIsplayed in my terminal, i’m using the last version of Powershell, i don’t think the terminal doest not support. And please excuse if my English or my words are not clear in advance.

stone.johnson.78 · April 6, 2023, 10:57pm

I’m having the same kinds of problems. It looks like there are often incorrectly encoded characters in the data sent back from the API, perhaps due to mixing together bits of text from different sources with slightly different or inconsistent encodings?

darrelx · April 17, 2023, 12:03am

thank you all for helping me. I solved my problem. The solution was a “response.bodyBytes” instead to “response.body”. bodyBytes is utf-8 encoded, so just decode it and access the message content.

mfqscode · July 17, 2023, 3:23pm

In dart you can resolve this with utf8.decode(<the text>.codeUnits);

But I agree that for me it used to work all the time with a raw response from openapi directly encoded in utf8…

mario.klobucaric · August 11, 2023, 2:32pm

Thanks, this solved my problem.

kitaloog · September 25, 2023, 6:36am

I search Google & OpenAI API (python) but could not find the response.bodyBytes, could you share the details how to fix this problem?

badpaybad · December 8, 2023, 4:08am

    var temp = json.decode(utf8.decode( response.bodyBytes));
    print(temp);

grm.83600 · February 20, 2024, 6:02pm

Close to @badpaybad, for Python developers, I fixed it with a simple:

response.choices[0].message.content.encode("utf-8").decode()

tomsoderlund · March 18, 2024, 8:22pm

I’m using the OpenAI Node.js API but still have text encoding problems, e.g. “K%C3%A4ngor” instead of “Kängor” but also “f\u00f6r” instead of “för” so different kinds of wrong encoding (is the LLM generating these on purpose?).

Is going “raw” and using response.bodyBytes the only solution?

tomsoderlund · March 20, 2024, 9:00am

Switching to gpt-4-turbo-preview (from gpt-4-1106-preview) solved it for me.

Topic		Replies	Views
Mangled enDashes and emDashes receivied via API API	8	1190	December 18, 2023
API Response encoding Bug \| UTF-8/UTF-16 API gpt-35-turbo , chatgpt , api , assistants-api	5	2342	April 10, 2024
Weird characters like Ø±Ð´Ñ in ouput when doing translation API	5	1827	December 24, 2023
Weird characters in-between chat API response 😭 API	9	315	April 18, 2025
Wrong encoding for gpt-4o during API Chat completion Bugs	2	1305	May 15, 2024

Api does not support utf-8 encoding

Related topics