'Prompt contains high-risk words' in playground but moderation API doesn't flag it

metalframe · May 11, 2023, 7:21am

Background: In the process of testing the API I took a completion generated by the 3.5turbo API (it was not unsafe content or an unsafe topic) and pasted it in the Playground as an assistant message. On pressing submit I was blocked by a ‘Prompt contains high-risk words’ message, and I followed its instructions to check the text I used with the moderation API.

Strangely, it didn’t flag it for anything (I tried both stable and latest endpoints, and received JSON responses for both) so I can’t even know what the problem is exactly.

metalframe · May 11, 2023, 8:50am

Oh I see, that explains a lot! I’ve narrowed it down to a single word that trips the filter in the Playground - but even when using this single word in isolation the moderation endpoints don’t flag it, funnily enough.

Topic		Replies	Views
Input in playground refused Community	5	719	January 3, 2024
O1 (mini & preview) API - getting 'prompt violating usage policy' on innocent prompts API	7	1359	December 19, 2024
Prompt contains high-risk words Prompting	2	1216	November 16, 2023
Getting different result when using playground vs API with gpt3.5-turbo API api	5	737	December 21, 2023
ImageNet class names contain high-risk words? API gpt-4 , chatgpt , api	8	696	July 8, 2023

'Prompt contains high-risk words' in playground but moderation API doesn't flag it

Related topics