Custom Moderation GPT Model | Fine Tuning

anon22939549 · July 25, 2024, 1:59am

To me, topics like this fall under the big umbrella of “don’t fight the model.”

The model “wants” to be helpful, so trying to give it a bunch of directives to not be helpful only really downgrades the response.

The solution I always propose for this is to filter-in and filter-out.

Just send them user’s message to a super cheap model and ask if it’s on topic, and pass the response through asking if if it’s on topic.

The out-pass wrecks streaming, but you can choose to only do it for questionable outputs or when you’ve already determined a particular user is trying to make the model talk about things you don’t want it to.

Topic		Replies	Views
Conditoning GPT4 API on my specific use case API	4	1289	March 28, 2023
Building a chatbot using gpt-3.5 turbo: Is there a way to ensure that chatbot strictly adheres to the specified domain? API api	7	1535	October 6, 2023
Avoid certain responses and prompts and generate responses as per my input API gpt-4 , gpt-35-turbo , chatgpt , fine-tuning	9	2283	March 6, 2024
Fine-Tuned Models to Strictly Follow Instructions API fine-tuning , fine-tuning-problems	6	310	June 10, 2024
Avoid overfitting during the fine-tuning of gpt-3.5 turbo API gpt-35-turbo , fine-tuning , fine-tuning-problems	4	2894	December 21, 2023

Custom Moderation GPT Model | Fine Tuning

Related topics