How to correctly fine tune my own model?

polterguy · January 21, 2023, 9:22am

I have created an application that will scrape websites, generating fine-tuning data, by chopping the HTML up into Hx tags and paragraphs, in addition to crawl links it finds. Scraping two of my own related websites, it produces roughly 1,500 training “snippets”. However, when I submit this to training, and I use my fine tuned model, the model is only 50% accurate.

Can somebody explain (like I’m 5) how I need to apply settings during training to improve accuracy please?

And also, I’ve got this feature allowing the models to be “reinforced” by logging chat requests, making it easy to edit requests, and generating new training data based upon my edited responses. The idea is to allow for starting out with any website, generating a trained model based upon that website, modify it manually by human supervision, and “retrain” it again. How should my settings for the retraining be applied …?

You can see the product, and try it out, at AISTA (dot) com …

It’s also open source if you search for it. I can’t post links here unfortunately, but you can follow the bread crumbs provided above to find it if you’re really interested …

i-technology · January 21, 2023, 10:36am

Did you generate prompt/optimal response snippets for fine tuning, or did you generate embeddings from those snippets? Just curious

polterguy · January 21, 2023, 12:04pm

It creates prompt/completions for fine-tuning. But really, I’ve got no idea of the right process here. Been reading the docs, but they don’t really give away much …

i-technology · January 21, 2023, 12:18pm

This might help, but just starting to figure this stuff out myself…

Topic		Replies	Views
Fine tuning - how exactly does it work? API	6	2380	December 23, 2023
Fine tuning completation API	9	2337	December 25, 2023
Trying to fine tune in python? API	4	1380	April 28, 2023
What's better for the type of chatbot I am building? Fine tune or embedding? Community chatgpt , api	10	2092	August 20, 2023
Fine-Tuning with Non-Prompt/Completion Data: Seeking Advice for Direct Text-Based Training? API gpt-4 , chatgpt , fine-tuning , api	3	223	August 23, 2024

How to correctly fine tune my own model?

Related topics