Any fine-tuning projects you’re working on?
Feel free to share with the community!
Any fine-tuning projects you’re working on?
Feel free to share with the community!
I’m going to fine-tune using my novels again, I think. I wonder if @eawestwrites has any recent tips?!
I might also use my 100,000+ character backstories to fine-tune.
What about you? Anything you’re working on fine-tuning?
Problems you’ve run into? Tips for us newbies?
I have all my emails from work that I would love to use as a base for a fine tuned model. They currently exist as .msg files. Anyone with an idea on how to approach this best?
As an input for the fine-tuning you need to create a data set composed of input-output pairs, i.e. pairs of user messages and assistant messages, as shown here in the example training data format, along with a system message (optional):
In your case you could construct this as follows:
In the system message you could provide some additional instructions regarding the task and some overarching stylistic guidelines. However, given you provide actual examples, you don’t have to overly detailed.
For additional guidance, you can take a look at the following fine-tuning guide.
Hello. Would you mind describing your process for fine-tuning models on novels? I had some success doing this using prompt/completion pairs with davinci, but now I’m trying to adapt for gpt-4o format. Thanks!
Hi @terekita - welcome back to the Forum!
Have you had a chance to look at this fine-tuning guide yet?
Let us know if you have a specific question and hopefully we’ll be able to point you in the right direction.
Thanks for your message. It’s more just a general question about what approaches people find useful for fine-tuning in the style of a novel. For my prior fine-tuning on davinci, I would take 3 sentences from the text and make that the prompt, and then the next three sentences for the completion.
For gpt-o mini I’m essentially doing the same thing for the user and assistant messages, and adding in a system message. This seems to work well also, but I’m wondering what other strategies people use for this task.
Thanks!
Here are a few links to posts you might find helpful: