Overfitting when giving samples in prompts

udm17 · March 1, 2023, 1:34pm

I’m trying to use few-shot learning to try and get GPT to generate answer to a question in a specific manner.

However, the generation for a question completely unrelated to the sample questions is still being influenced by the sample answers they have.

Anyone who has encountered this and has a prompt tweak that might help sort this out

Cheers,
Ud

ruby_coder · March 1, 2023, 2:50pm

Are you talking about fine-tuning a particular base model @udm17 ?

udm17 · March 2, 2023, 5:43am

Not fine tuning. Using samples along with a prompt to generate the answer for a question. Sadly can’t share more info as it is for a propriety software.

PaulBellow · March 2, 2023, 3:25pm

What settings are you using? temp, freq_penalty, etc

udm17 · March 3, 2023, 6:37am

Temperature - 0
Freq Penalty - 0
Top P - 1
Presence Penalty - 0

PaulBellow · March 3, 2023, 6:44am

In my experience, the lower you go with temperature, the more likely it is to overfit/repeat… If you can’t raise the temperature, try moving frequency_penalty up a bit… but slowly… 0.05 at a time maybe?

Good luck!

udm17 · March 3, 2023, 7:48am

Cheers Paul.

This is something I have been tinkering with a bit, hopefully can find the sweet spot soon. I want the answer to be deterministic to a certain extant, so have been using low temperature and that definitely has led to overfitting.

ruby_coder · March 3, 2023, 8:06am

Hi @udm17

Which model are you taking about?

It’s a secret?

udm17 · March 3, 2023, 11:23am

Base model I’m using currently is davinci003 and has recently started using the gpt3.5 turbo. Am not too sure whether I should use a fine-tuned davinci would solve my problem.

ruby_coder · March 3, 2023, 11:29am

Fine-tuning is the best OpenAi tool available for better model fitting.

HTH

Topic		Replies	Views
Fine tuning Davinci01 or prompting Davinci03 API	3	727	December 31, 2022
Prompt Usage for Fine-Tuned Models Community gpt-35-turbo , fine-tuning	1	2219	January 4, 2024
GPT3 finetuning for large text summaries Prompting	8	1287	December 24, 2023
GPT-3.5-Turbo - Unable to prompt engineer Fine-tuned model Prompting fine-tuning	1	808	December 9, 2023
Fine-tuned model handles prompts differently Prompting	6	953	November 23, 2023

Overfitting when giving samples in prompts

Related topics