Gap between fine-tuning result and inference

carollhwrd · June 25, 2023, 3:45am

I fine-tuned “davinci” model.
For training data, training result file shows me almost (prompt, completion) matches in my training data are good with this fine-tuned model.
I mean for every prompts in my dataset results in training_token_accuracy=1.0, training_sequence_accuracy=1.0.
But actually I tried these prompts to fine-tuned model and the result was awful.
res = openai.Completion.create(
model=“davinci:*****”,
prompt=“one of dataset prompt”,
temperature=0,
stop=None,)
Why? I fixed max_tokens several times.
Still I don’t know the relationship between max_tokens and inference result.
How can I resolve this issue?

Foxalabs · June 25, 2023, 8:03am

Hi Carollhwrd,

Can you give some example prompts and replies and how they differ from your expectation?

Also you can wrap your code and data in triple back ticks so you get code like this
```this is code``` which makes it more readable or you can use the text box controls </> to do the same thing with the code section highlighted.

Topic		Replies	Views
Fine-tuned davinci - messed up completion Prompting	1	692	July 11, 2023
Struggling with poor performance on fine-tuned davinci model API	15	2590	December 20, 2023
Fine tuned model providing worse output Prompting	6	1972	March 7, 2023
Strange behavior of a fine tuned model API	6	1861	December 20, 2023
Finetune model completion cut off too short Prompting	7	3838	January 17, 2023

Gap between fine-tuning result and inference

Related topics