Hi,
I’m working with the gpt-4o-realtime API, and facing a content-filter issue that I can reproduce on demand.
I narrowed it down to the API unable to generate french text and audio. Here’s how I came to this conclusion:
-Instruction in french, asking to generate in english → works fine.
-Instruction in english asking to generate in english → works fine.
-Instruction in english asking to generate in french → response incomplete, content_filter.
-Instruction in french asking to generate in french → response incomplete, content_filter.
-Instruction in whatever language, asking to generate in any language but french → works fine.
In every test, I used the exact same instruction:
"Say this sentence translated in ${lang}, without any other comment : ${text}"
Where lang is a (variable) language and text is a constant paragraph text in french, where a man asks questions about park assist. Here it is:
“Je me gare des dizaines de fois par jour dans des rues souvent encombrées et avec peu de visibilité. J’ai parfois peur de rayer mon véhicule ou d’avoir un accrochage en manœuvre. Quels équipements peuvent m’aider à me garer plus sereinement et à éviter les petits chocs au quotidien ? Et sont-ils faciles à activer ? C’est que je ne suis pas un pro de l’informatique…”
I tried with other french sentences and it ends up with content_filter every time, for instance:
{
type: 'response.done',
event_id: 'event_B7gTEwp07n8S9oZekbpUV',
response: {
object: 'realtime.response',
id: 'resp_B7gTA7Wx91S8ZtcIJixJ1',
status: 'incomplete',
status_details: { type: 'incomplete', reason: 'content_filter' },
output: [ [Object] ],
conversation_id: 'conv_B7gTATBbMOe9FDzGBivHh',
modalities: [ 'text', 'audio' ],
voice: 'echo',
custom_voice_id: null,
output_audio_format: 'pcm16',
temperature: 0.6,
max_output_tokens: 'inf',
usage: {
total_tokens: 358,
input_tokens: 98,
output_tokens: 260,
input_token_details: [Object],
output_token_details: [Object]
},
metadata: null
}
}
Can anybody help please ? Feel free to ask any further information.