Public repo of my finetuning data

daveshapautomator · April 8, 2022, 12:27pm

Since finetuning is a hot topic I figured I’d share my data. Most of this has been experiments leading up to further my research or side projects I’ve worked on. I figured I’d share to help the community better understand how to perform finetuning. The JSONL files are included so you can just grab them and go. This repo is under MIT license, so do whatever you want with it.

jon.oakes · April 13, 2022, 3:24pm

Thanks for this! I have just started down this path for our own chatbot (in a library) and this is a great shove in the right direction!

daveshapautomator · April 13, 2022, 8:19pm

Is this a public library setting? If so, I know someone you probably will want to talk to.

jon.oakes · April 13, 2022, 8:44pm

University Library, but, if successful, I’m sure it’d be useful for Public Libraries as well!

daveshapautomator · April 25, 2022, 8:16pm

@adallara is working on a related thesis right now.

jaronwilhelmsen · April 25, 2022, 9:20pm

Thanks for sharing your data like this! That is very helpful. I see that you’re including the explicit instructions for each tuning output, but would you ever include your instructions as one of the fine-tuning prompt files themselves? I’m trying to figure out how to incorporate explicit instructions I have for a game into a fine-tuning model format.

daveshapautomator · April 25, 2022, 9:42pm

I’m not quite sure what you mean. Can you show an example?

jaronwilhelmsen · April 26, 2022, 12:32am

Yeah, like for example you’ve got syn_prompt2 which you use in every prompt to direct the response for your tuning data. I’d call these the “instructions”. I was wondering if you’d ever include just those instructions in a prompt line.

For example, teaching the engine to play “Mastermind” I’d first give it a paragraph of instructions on how the game is played before moving on to show actual examples of playing in subsequent prompts. If I wanted a fine-tune for this, how would I incorporate the “instructions” paragraph. I tried once including it in as a prompt with a response of blank, but didn’t find it worked very well. Is that maybe not a use case for tuning data? Is there a better place to put explicit instructions?

daveshapautomator · April 26, 2022, 12:55am

If I understand you, then you don’t need the entire set of instructions. The model will implicitly learn the instructions - even if it’s something like chess or mastermind. Remember that GPT3 has already read a lot and knows the rules, speech patterns, etc. Fine-tuning just ensures that it will consistently use the embeddings it’s already got. You’re not teaching it anything new, you’re just making it practice using a skill it’s already got.

jaronwilhelmsen · April 26, 2022, 5:23pm

Ah ok, that makes sense. Thank you for your insight!

Topic		Replies	Views
Fine tune fine tuned models API	18	3798	January 30, 2024
Adding prompt info to fine-tuning API	14	3074	December 25, 2023
GPT3 Fine Tune Data API	18	2170	December 15, 2023
Data Distillation: Generate custom instructions for ChatGPT using your own data Prompting gpt-4 , chatgpt , project , prompt , prompt-engineering	1	4177	December 15, 2023
Five rules for finetuning from my experience, observations, and consulting Documentation	10	5164	September 5, 2023

Public repo of my finetuning data

Related topics