RLHF with OpenAI APIs using Fine-Tune APIs

jahanzaibkhan.te · October 24, 2024, 11:53am

I’m working on implementing Reinforcement Learning from Human Feedback (RLHF) with OpenAI APIs and wanted to confirm whether my feedback structure is compatible for this use case. My goal is to collect user feedback, provide model responses, and use a reward system for fine-tuning in an RLHF-like setup.

{
    "messages": [
        {
            "role": "system",
            "content": "You are a virtual assistant that responds to customer email requests with an action and a polite message."
        },
        {
            "role": "user",
            "content": {
                "feedback": {
                    "prompt": "",
                    "given_response": {
                        "subject": "",
                        "message": "",
                        "action": ""
                    },
                    "correct_response": {
                        "subject": "",
                        "message": "",
                        "action": ""
                    },
                    "error_message": "Model Selected Wrong Subject",
                    "reward": 1
                }
            }
        }
    ]
}

My Key Questions:

Can this structure be used to simulate RLHF, in the context of providing feedback on the response?
What is the best reward range to use (e.g., [-1, 1] vs. [-5, 5]) for RLHF fine-tuning purposes?
Does OpenAI’s API currently support RLHF directly, or would this approach only work for collecting data for future fine-tuning?
Should I use RLHF or RAG for this purpose ?

pierreprudh · July 8, 2025, 3:14pm

Hi,
Did you manage to find a solution ?

Topic		Replies	Views
Fine-Tuning to Avoid Scary Responses (Negative Reward) API api	9	1959	August 25, 2023
RLHF after Fine-Tuning Davinci? API	7	2095	February 21, 2024
Using RLHF with Fine tuned models API gpt-4 , chatgpt , api	4	2249	January 17, 2024
Feedback on Assistants API Documentation feature-request	2	1920	January 29, 2024
How to fine-tune using DPO/RL on entire chat history (agent workflows) API fine-tuning , data-preparation	1	112	July 8, 2025

RLHF with OpenAI APIs using Fine-Tune APIs

My Key Questions:

Related topics