Parameters for Question & Answers samples

siddhant.saurabh · December 22, 2022, 4:24am

What parameters should I use in epoc, batch_size and learning rate for training model based on question and answers pair? and How many samples should be sufficient?

ryanthomasmusser · December 22, 2022, 4:55pm

What API are you using?

siddhant.saurabh · December 23, 2022, 3:11pm

I am using the fine-tuning mechanism as mentioned in the documentation.
Update:
I am using Davinci 003 for fine tuning

PaulBellow · December 23, 2022, 5:55pm

I think he meant which model… ie Davinci, etc.

As far as I know, epoch, batch_size and learning rate should be automatically set for best performance. I believe OpenAI recommends at least 200 examples, though the more the better. If you fine-tune with 1000+ examples and still aren’t getting good results, that might be the time to tinker with batch_size, epochs, etc.

From my GPT-2 experience, epochs are important because too much fine-tuning can result in overfitting which means the language model repeats verbatim from the dataset rather than new content.

Hope this helps!

Topic		Replies	Views
Tweaking the Amount of Epochs API	2	1603	December 28, 2023
Fine tuning Davinci01 or prompting Davinci03 API	3	599	December 31, 2022
Finetuning question Prompting	1	409	June 21, 2022
Trying To Fine-Tune To Overcome Prompt Size Limit API	4	1161	December 17, 2023
Test training Davinci and completion after training API	3	665	December 20, 2023

Parameters for Question & Answers samples

Related Topics