Davinci Fine-Tuning?

jxl38 · August 28, 2021, 7:56pm

I’ve been working on a songwriting assistant for a while with some really cool results. However, I feel like it could really be augmented by fine-tuning with Davinci. My Few-Shot results have been great, however, sometimes it becomes a bit leftfield when I create more specific prompts and restraints. I did a curie fine-tune that is producing okay, but repetitive results.

So, I’ve applied for access to the Davinci fine-tune. Is there any way to check on the status of my application?

daveshapautomator · August 28, 2021, 9:49pm

I found that a well fine-tuned CURIE can outperform DAVINCI few-shot examples. You might want to take a second look at your data and hyperparameters. How many samples are you using for the finetune? What temp, top_p, and penalties are you using?

jxl38 · August 29, 2021, 5:42pm

So, I’m using over 200 examples, and the temperature is 0.7 & top_p is 1. Penalties don’t seem to change results much.

boris · August 29, 2021, 11:17pm

You may want to change the fine-tuning structure into a conditional generation, which is less likely to be repetitive.

Also 200 is a small amount of data - if you have any way of increasing this, that would greatly improve the performance.

boris · August 31, 2021, 9:36am

Thanks that’s very interesting! You could use more data - eventually it’ll learn this.

You could also create a simple discriminator based on a deterministic check if a particular completion ends with a desired word. Then you generate multiple completions and pick the one which ends with the appropriate word

jxl38 · August 31, 2021, 8:42pm

Absolutely,

I actually currently have that type of logic in the method that calls the API in my Davinci implementation as a failsafe for the few times Davinci doesn’t get it. However, Curie almost never gives me the word at the end of the sentence, so this would cause it to timeout my request counter pretty much every time, and cause big latency issues.

Thanks for the tip, I’ll increase my data! let’s try 10,000 lines!

jxl38 · September 1, 2021, 4:11am

I built a parser that will load up text into the jsonl format. I’m a writer with lots of raw text laying around. So I just load my text into the parser and it slices it up into usable jsonl for the fine-tune job.

jxl38 · September 1, 2021, 4:20am

That’s a great idea. When I get desired results, I like to feed it back in to the prompt. It definitely wouldn’t hurt to have a few thousand good completions ready to go.

chimpsarehungry · September 2, 2021, 7:27pm

Awesome! Did you open source that parser? Also a writer.

craig.thomler · September 3, 2021, 5:01am

We did that some months ago and had groups coming together to sing the songs…

The link is to AI’s Got Talent’ which we ran back in Feb

clarence.hu · September 3, 2021, 3:50pm

thanks, boris. what do you recommend for a minimum amount of data for fine-tuning? assuming there’s no fixed threshold, could you offer any rules of thumb or at least order of magnitude guidance?

jxl38 · September 3, 2021, 5:16pm

Very cool results. The Java song was great!

jxl38 · September 4, 2021, 7:45am

Hi @chimpsarehungry,

No, the parser isn’t open source. It’s specifically tailored to my use case. However, it’s pretty straightforward to build one, all you need to do is to slice up your text, place it into the jsonl format, and then write it to a new file for uploading to the fine-tunes endpoint.

chimpsarehungry · September 6, 2021, 4:59pm

Thanks. I am confused about the need for prompt + completion in this format. I was used to fine-tuning GPT-2 by just providing lines of text so it becomes more similar to the writing in the training set. If I don’t have a prompt + completion format for this application, what is possible? Maybe just break every sentence in half?

Cheers,
Shane

clarence.hu · September 9, 2021, 5:19pm

hi @boris i hope your week is going well. pinging again on this message, if you don’t mind sharing your thoughts on the minimum amount of data required to fine-tune.

boris · September 9, 2021, 5:43pm

This guide will hopefully answer your question in more detail. OpenAI API Normally a few hundred examples is a good start, and then you’ll see a linear increase in performance roughly for every doubling of the dataset.

jxl38 · September 10, 2021, 8:35pm

Hi! So, what I did that seemed to work very well was to parse sentences two by two and put the first sentence in the prompt, and the proceeding sentence in the completion. However, I’ll bet that splitting the sentence in half would work too!

chimpsarehungry · September 10, 2021, 10:30pm

Ok great! Maybe a combo of 1/4th sentence + 3/4th. 1/2 + 1/2. 3/4 + 1/4th. 2 by 2 like you did. And other combos.

jxl38 · September 11, 2021, 9:04am

I like to try things and log the results!

magicpixie · September 12, 2021, 2:08pm

Are you using fine-tuning with Curie? I know there are a lot of lyrics websites out there, you could probably scape those to get a large enough dataset. Even GPT-2 when fine-tuned becomes pretty powerful, that might be sufficient without Davinci access

Topic		Replies	Views
Use "private" dataset as basis for AI responses Prompting	29	3001	December 16, 2023
Adding prompt info to fine-tuning API	14	3190	December 25, 2023
Fine tuning completation API	9	2423	December 25, 2023
Should prompts be unique for fine-tuning? Prompting	9	1779	December 25, 2023
Fine tuning is beautiful API	7	1914	December 25, 2023

Davinci Fine-Tuning?

Related topics