Best practice around fine tuning?

excelformulabot.com · August 2, 2022, 3:31am

Hello! I own the site www.excelformulabot.com, which has received ~350K GPT-3 API requests in just a couple of weeks since launch.

The website provides the recommended Excel formula for a given problem someone is trying to solve.

There are some common themes on what the davinci-002 consistently gets incorrect, like not being able to delineate “contains” versus “equals”.

I successfully uploaded a sample jsonl file, but I’m running into issues where the response is repeating the prompt: “Create the Microsoft Excel formula for the following problem: [dynamic entry]”.

Additionally, I’m curious… roughly 20% of the formula requests are deemed “incorrect” by the user (and myself, after auditing). Is it best practice to only upload records that were previously incorrect via the davinci-002 model then manually corrected? Should I include the ones that davinci-002 got correct, as well?

Thank you in advance! If this is asking for too much, I understand. I’d be more than happy to lean on someone for consulting.

Thanks, David

daveshapautomator · August 2, 2022, 11:40am

Since you’re getting user feedback, all you need to do is accumulate correct responses and you’re golden.

For the repetition, you need to add a STOP token

jhsmith12345 · August 2, 2022, 1:07pm

You only want high, quality, correct examples. Their source (synthetic or non) doesn’t matter

Topic		Replies	Views
Fine tuning Davinci01 or prompting Davinci03 API	3	727	December 31, 2022
Using multiple identical prompts with unique completions Prompting	2	685	December 20, 2023
Should prompts be unique for fine-tuning? Prompting	9	1722	December 25, 2023
How to improve a fine-tune classifier? Prompting	10	1399	August 15, 2022
Fine Tuned Chatbot forgets how to output summary of conversation API	9	1840	December 18, 2023

Best practice around fine tuning?

Related topics