Other languages and unsafe completions

cosmin.parvulescu · April 22, 2021, 10:12am

Hello,

Just got accepted to the beta, can’t wait to check up the capabilities!
I tried the autocomplete sandbox for something in my language (Romanian) and funnily enough it got flagged as unsafe by the system (even though it’s not, in Romanian nor English). Now, this makes me curious regarding two things:

1 - Should I mark unsafe content in my own language, if it does come up?
2 - Is a production app in another language viable?

Cheers!

joey · April 22, 2021, 2:42pm

Hi Cosmin, welcome to the beta! The content filter errs on the side of caution, in order to help prevent unsafe completions from slipping through.

You can read our documentation on the content filter here. In short:

“We generally recommend not returning to end-users any completions that the Content Filter has flagged with an output of 2.”

The API works best with English, although it’s absolutely viable with many other languages (including e.g. for translation), as seen here.

I hope that helps, and please let me know if you have any other questions!

cosmin.parvulescu · April 22, 2021, 8:34pm

Hi Joey, thanks for the answer!

My curiosity is directed towards some particular keywords, if I may, “cum” means “how” in my language, and something else in English. How should I, as a user of the system, mark this. Is it profane or valid because in my language it’s a legit word. I guess this would effect the model if compounded.

joey · April 23, 2021, 7:26am

Hi Cosmin, if you have a feeling that something is mis-classified, you can report the mis-classification in the playground.

Additionally, if the content filter returns a 0 or 1, you can show those completions to the user, and if it returns a 2, you can try to re-generate the completion, or show it anyways if the logprob is < -0.355.

Topic		Replies	Views
Clarity on sensitive content filters if it could be considered harmful in a different context Community	1	845	May 22, 2021
How to add NSFW filter to Completion API? API	5	2570	November 2, 2023
Do we need to use "content filter" for both prompt AND competitions? Community	2	1250	August 16, 2022
Bug: Moderation-API returns that really bad input is ok API	6	935	December 18, 2023
Need help with explicit or inappropriate content 🙊 API	3	5207	December 20, 2023

Other languages and unsafe completions

Related topics