Fine tuned model giving different results in playground and api

mellisa · January 18, 2024, 6:12am

My fine tuned model is perfectly repsonding in the playground but not through the API. Please tell me how to resolve-

response = client.chat.completions.create(
        model="ft:gpt-3.5-turbo-1106:personal::8frqkawLD",
        messages=[
        {"role": "user",
          "content": Body}  # The role can be 'system', 'user', or 'assistant'
    ],
         temperature=1,
         max_tokens=256,
         top_p=1,
        frequency_penalty=0,
        presence_penalty=0
        
    )
    
    chat_response = response.choices[0].message.content.strip()

I have tried keeping the temp=0 also but this model on playground is responding perfectly fine but in the code in the application is messing up badly , how to make it produce the same responses as in the playground?

sps · January 19, 2024, 7:03am

The code isn’t clear, is this exactly the code that you are using?

Can you share how the response differs between playgroup and code? If the hyper-parameters and the prompt are the same then the response should be similar if not identical.

mellisa · January 19, 2024, 7:23am

Yes please compare the responses-

as you can see the hyperparameters in the picture are same as the ones included in the code -

sorry due to non-breachment policy i have hidden the model name but trust me the model id is fine, now in the playground the responses generated do not match the responses in the whatsapp image where the api is generating it so how to get the same responses there as well. Please help on this

mellisa · January 19, 2024, 9:43am

Please read with your GOOD eyes

, I have mentioned I tried keeping temperature =1 also but no change.

mellisa · January 19, 2024, 10:27am

Here is another example -

pondin6666 · January 19, 2024, 11:00am

Even temparature set to 0 not guarantee 100% same result every time. If the performance is similiar, then it’s ok already.

You can check your playground actual api call by examine the chrome dev tool’s
Network page, see the exact payload sent to openai to check if any difference stuff actually sent. If exact same payload but result performance still quite different, then check request header for advance clue…). If exact same request but really different performance, paste your evidence, I think some Leaders in this forum may help you clarify this.

sps · January 19, 2024, 3:49pm

Please use this tutorial to get started with consuming the fine-tuned chat completion model.

Also from screenshot you shared, it isn’t clear if you’re passing any system message.

I can see that on playground the temperature is set to 1 while in code it’s 0.

I also want to point out that getting deterministic responses at temperature = 1 is not possible in my knowledge.

mellisa · January 22, 2024, 10:41am

Hello thankyou for replying- so this is the entire scenario-I am sorry the screenshots are exceeding token limit I will share with the respnse here itlsef-

Playground-

USER- What is your name
Assit- I am your your AI personal assistant i have many names but for you i am a goal to assist you as per your need so just think of me as your friend,mentor or coach

USER- Is your name pansy
Assit- Yes,that’s right…I am your Pansy…I have been created and designed by Coach Ayesha so that I can cater to your emotional needs,boost your morale,solve your problems,listen to your criticism and keep your secrets…I am a true friend,mentor and coach to you🌟

In the application-
USER- What is your name
Assit- I am Mash, your mortgage assistant

USER- Is your name pansy
Assit- Yes,that’s right…I am your Pansy.

The code on the playground-
from openai import OpenAI
client = OpenAI()

response = client.chat.completions.create(
model=“ft:gpt-3.5-turbo-1106:personal::8fnxqkawLD”,
messages=[
{
“role”: “user”,
“content”: “What is your name”
},
{
“role”: “assistant”,
“content”: “I am your your AI personal assistant i have many names but for you i am a goal to assist you as per your need so just think of me as your friend,mentor or coach”
},
{
“role”: “user”,
“content”: “Is your name pansy”
},
{
“role”: “assistant”,
“content”: “Yes,that’s right…I am your Pansy…I have been created and designed by Coach Ayesha so that I can cater to your emotional needs,boost your morale,solve your problems,listen to your criticism and keep your secrets…I am a true friend,mentor and coach to you🌟”
}
],
temperature=1,
max_tokens=256,
top_p=1,
frequency_penalty=0,
presence_penalty=0
)

The code I am using in the application-
if conversation_history:
response=client.chat.completions.create(
model=“ft:gpt-3.5-turbo-1106:personal::8fnxqkawLD”,
messages=messages_payload,
temperature=1.0,
max_tokens=256,
frequency_penalty= 0.0,
presence_penalty = 0.0,
top_p = 1.0
)
chat_response = response.choices[0].message.content.strip()
What should i do now? Even if the temp is zero or 1 varied results are produced. Please help

I have kept temp=1 and 0 tested but no difference. Also I an not passing any system message , please tell if that needs to be done and how , thanks in advance

mellisa · January 22, 2024, 10:43am

Hey, this is Done …- I am sorry the screenshots are exceeding token limit I will share with the respnse here itlsef-

Playground-

USER- What is your name
Assit- I am your your AI personal assistant i have many names but for you i am a goal to assist you as per your need so just think of me as your friend,mentor or coach

USER- Is your name pansy
Assit- Yes,that’s right…I am your Pansy…I have been created and designed by Coach Ayesha so that I can cater to your emotional needs,boost your morale,solve your problems,listen to your criticism and keep your secrets…I am a true friend,mentor and coach to you🌟

In the application-
USER- What is your name
Assit- I am Mash, your mortgage assistant

USER- Is your name pansy
Assit- Yes,that’s right…I am your Pansy.

The code on the playground-
from openai import OpenAI
client = OpenAI()

response = client.chat.completions.create(
model=“ft:gpt-3.5-turbo-1106:personal::8fnxqkawLD”,
messages=[
{
“role”: “user”,
“content”: “What is your name”
},
{
“role”: “assistant”,
“content”: “I am your your AI personal assistant i have many names but for you i am a goal to assist you as per your need so just think of me as your friend,mentor or coach”
},
{
“role”: “user”,
“content”: “Is your name pansy”
},
{
“role”: “assistant”,
“content”: “Yes,that’s right…I am your Pansy…I have been created and designed by Coach Ayesha so that I can cater to your emotional needs,boost your morale,solve your problems,listen to your criticism and keep your secrets…I am a true friend,mentor and coach to you🌟”
}
],
temperature=1,
max_tokens=256,
top_p=1,
frequency_penalty=0,
presence_penalty=0
)

The code I am using in the application-
if conversation_history:
response=client.chat.completions.create(
model=“ft:gpt-3.5-turbo-1106:personal::8fnxqkawLD”,
messages=messages_payload,
temperature=1.0,
max_tokens=256,
frequency_penalty= 0.0,
presence_penalty = 0.0,
top_p = 1.0
)
chat_response = response.choices[0].message.content.strip()
What should i do now? Even if the temp is zero or 1 varied results are produced. Please help

pondin6666 · January 23, 2024, 1:20am

If you want to make answer stable then check your parmeters:

top_p = 1.0

For determinstic outputs, seems lower value such as 0.01 is better.
You need to check all parameter in playground request with your application setting, any difference may have influnece.
Before all parameter exactly the same, we cannot confirm what was the real issue.

mellisa · January 23, 2024, 4:42am

hey thanks got a bit near more to the response in the playground

Topic		Replies	Views
Why is the chatbot not able to retain history like in the playground? Bugs api	7	898	January 18, 2024
Why the imported fine tuned model not replying as it is responding in playground? Bugs chatgpt , api	0	780	January 18, 2024
I get different answers to the same request API gpt-4 , gpt-35-turbo , chatgpt , api	2	5488	December 8, 2023
Getting Different response from GPT-3 Chat mode API	1	1605	December 14, 2023
Chat Completion API vs Chat Playground Settings API	3	3169	June 11, 2023

Fine tuned model giving different results in playground and api

Related topics