Fine-tuned bot answers way too short through API

rektalizerofficial · March 1, 2023, 2:23am

Hello, I am using fine-tuned davinci model for learning purposes. The problem I faced is that while playground version of my model answering with long and close to dataset type of answers, the telegram version that works through API shorten it’s answers a lot. For example playground version answered with about 200 words, while telegram one keep it close to 20-30. I ensured they both have same settings. The only difference might be that telegram version takes previous dialogue as a prompt, however it’s still an issue even if it’s fully restarted without any old dialogues in prompt.

sps · March 1, 2023, 7:58am

Welcome to the community.

It’ll be very helpful If you were to share the API request.

ruby_coder · March 1, 2023, 9:58am

Hi @rektalizerofficial

As @sps kindly requested we need you completion params to accurately assist; especially the temperature (s).

In addition please share your fine-tune params including the n_epochs param and a few example lines from your JSONL training file.

With that info we can help you for sure! Otherwise it is just “wild guessing”.

rektalizerofficial · March 1, 2023, 10:30am

Hello! Yes, here are my params:
n_epochs I did not specify, so it should be default 4 I guess.
temperature=0.5
top_p=1 (I did specify it later and it didn’t change anything)
stop=END
maxt_tokens=500 (was 256 and changing the number didn’t change the length of answers at all)

Example from dataset:

{“prompt”:“How do you respond to criticism? ->”,“completion”:" It really depends on intentions of why I was critisized in the first place. If it’s just a troll trying to touch me I either troll back or ignore depending on my mood. If it’s a professional criticism that comes from a man that have his brains set right, I will take it proudly and improve on it. That’s how you grow, not putting your head in the sand. If it’s a questionable piece of criticism, yet still real and have it’s points, I will argue to find the truth, which probably lies in between. END"}

I can’t say it’s a dataset problem, because playground version answers how I want and keeps messages long enough. I will bet on it being a problem with API request, so here it is:

 prompt = f"{last_text}Q: {rq} ->"[-1000:].replace("@RektGPT_BOT", "")
 completion = openai.Completion.create(
            engine=model, prompt=prompt, max_tokens=500, stop=[stop_symbols], temperature=0.5, top_p=1)
english_respond = completion['choices'][0]['text'].strip()
        if "->" in english_respond:
            english_respond = english_respond.split("->")[0].strip()
        answer = english_respond 
return answer

So I try to save context here by taking user previous dialogue with a bot {last_text}, and feeding this as a prompt, I keep it short by keeping it under 1000 symbols from the end. However I believe this is also not a part of a problem? Because model still answer short even with no prior dialogue history.

rektalizerofficial · March 1, 2023, 1:30pm

Of course. Here is params:
n_epochs is not specified and should be default, which I believe is 4?
temp = 0.8
max tokens = 500 (was 256, but it didn’t change much)
stop = END (which is a stop sequence for my dataset)
frequency penalty = 0.5
presence penalty = 0.5

With same params the playground version still able to answer with long and well-rounded text. This one usually go for short and ignorant answers. Here is my API request:

def openAI(prompt):
    response = requests.post(
        'https://api.openai.com/v1/completions',
        headers={'Authorization': f'Bearer {API_KEY}'},
        json={'model': MODEL, 'prompt': prompt, 'stop': STOP_SYMBOLS, 'temperature': 0.8, 'max_tokens': 500, 'frequency_penalty': 0.5, "presence_penalty": 0.5}
    )
    print(response)

    result = response.json()
    print(result)
    final_result = ''.join(choice['text'] for choice in result['choices'])
    return final_result
...
 if '@RektGPT_BOT' in result['message']['text']:
                        prompt = result['message']['text'].replace("@RektGPT_BOT", "")
                        print(prompt)
                        # Calling OpenAI API
                        bot_response = openAI(f"{prompt}")
                        # Sending back response to telegram group
                        print(telegram_bot_sendtext(bot_response, chat_id, msg_id))

ruby_coder · March 1, 2023, 1:37pm

Did you forget this?

Thanks

rektalizerofficial · March 1, 2023, 3:33pm

Forget what exactly, sorry? Here are few examples:

{"prompt":"How do you respond to criticism? ->","completion":" It really depends on intentions of why I was critisized in the first place. If it's just a troll trying to touch me I either troll back or ignore depending on my mood. If it's a professional criticism that comes from a man that have his brains set right, I will take it proudly and improve on it. That's how you grow, not putting your head in the sand. If it's a questionable piece of criticism, yet still real and have it's points, I will argue to find the truth, which probably lies in between. END"},
{"prompt":"What personality traits does a leader need? ->","completion":" True leader is more or less combination of different valuable personality traits. If we talk about pure high quality top notch leader, it will require so much that people of this caliber being born once in a century. If you talk about leader in general, I think this depends more on what he is about to lead. END"}

ruby_coder · March 1, 2023, 3:39pm

Thanks.

With the default 4 n_epochs (underfitted model) and a relatively high temperature of 0.8 it is not surprising you will send a lot of completion variations and randomness.

Hope this helps.

Topic		Replies	Views
Struggling with poor performance on fine-tuned davinci model API	15	2671	December 20, 2023
Finetune model completion cut off too short Prompting	7	3930	January 17, 2023
Excessive talkativeness of the tuning neural network API	6	784	July 26, 2023
Finetuned a model, but it replies like insane API	7	1220	December 24, 2023
Completions API response always considerably shorter than ChatGPT response Prompting api	4	1707	March 30, 2024

Fine-tuned bot answers way too short through API

Related topics