What's the point of Output Moderation?

noorkhalifa · November 17, 2023, 6:32pm

As mentioned in the Moderation API guide, the API is used for:

monitoring the inputs and outputs of OpenAI APIs

I get the benefit of monitoring the inputs and how this prevents violations of the ToS, but how would monitoring the output benefit us as developers when the violating output was already given and might cause our account to be terminated?

Why do we need to monitor the outputs of GPTs, don’t they come with moderation embedded in them already?

I’m just trying to wrap my head around this moderation API as I got my other account terminated last week for no apparent reason from my side.

wclayf · November 17, 2023, 7:06pm

I don’t think you need to run the output thru any moderation at all. The API is sending that back to you and doesn’t even know what you’re doing with it after that right?

noorkhalifa · November 17, 2023, 7:26pm

Yeah that’s what I thought, but I thought that we are required to do that and that the APIs output might cause an account termination.

_j · November 17, 2023, 7:31pm

The output by the AI has already been recorded by the time it has been generated.

The only thing you can do with moderations on the output is discourage again doing that action that produced the output.

You’d see that in ChatGPT where the output is flagged and turns orange. OpenAI doesn’t ban themselves for their model making the output, though.

Foxalabs · November 17, 2023, 8:01pm

Moderation checking on the input and output is best practice, I would use it unless I had an application where a few hundred milliseconds was critical, even on streaming. I will send the tokens to the renderer as they arrive but check ever 15-20 tokens on a space/punctuation mark and test that block in another thread.

If I get a positive hit for moderation then I pull those tokens from the renderer, might not be a requirement now that there is a moderation stop reason, but I prefer belt and braces, API account is a valuable thing.

Topic		Replies	Views
Question about moderation for API usage API gpt-4 , api , moderation	2	1461	October 20, 2023
Creating adhoc API keys for giving credits to visitors API	12	435	February 13, 2024
Clarification on Using Moderation Model to Avoid Policy Violations API gpt-4 , api	3	609	October 9, 2024
API Endpoints with Integrated Content Moderation API gpt-4 , gpt-35-turbo , api	34	5373	December 20, 2023
Challenges in AI Moderation API	6	1240	September 22, 2023

What's the point of Output Moderation?

Related topics