Is fine tuning quite limited?

elin44 · May 5, 2023, 8:35pm

I was excited about using fine-tuning to essentially achieve unlimited size in context.

However according to the doc (OpenAI API), only these base models are available for fine tuning:

davinci , curie , babbage , and ada

I tried these base models in play ground and they seem very incoherent. Can you really build production-level app by fine tuning these models?

Yi

PaulBellow · May 5, 2023, 8:36pm

Welcome to the community. Fine-tuning does have its limitations. It’s still useful in some cases, but a lot of people are switching to embedding + completion… If you search around the forum, you should find some useful for posts.

elin44 · May 5, 2023, 11:34pm

Thanks, I’ll search around and see what I can find

elin44 · May 5, 2023, 11:54pm

Also, is my understanding correct that fine tuning can only be used on base models such as davinci, curies, and since they lack instruction, it’s hard to get good results with them? Versus using the embedding + completion approach, I get to use models like text-davinci-003, which has been instructed and allows me to build something closer in quality to GPT 3.5 more quickly.

PaulBellow · May 6, 2023, 12:05am

Yes, correct.

Depends on the use-case, the dataset, and the fine-tuning performed, but generally, yes, it’s better to stick with the newer models and embedding.

You can also get access to GPT-3.5-turbo (which is really cheap now at $0.002/1000 tokens…), and GPT-4 is in beta and there’s a waitlist.

But yeah, most would agree, I think, that embedding + completion endpoint is the way to go moving forward.

Hope you stick around our growing community here! We’ve got a great bunch of really smart people - and me! Small smile.

elin44 · May 7, 2023, 9:24pm

Paul,

Great, thanks for your input. Yes, it’s quite exciting to be a part of this community. I look forward to participating with you all.

vadim.chumak · May 15, 2023, 10:57am

Hi Paul, how embedding can help for more specific financial categorisation based on business entity historical data?

tregoning · October 25, 2023, 3:31pm

Looks like the above comments are incorrect/out-of-date?

_j · October 25, 2023, 3:44pm

Yes, there are likely thousands of thread topics on the forum that have obsolete information.

Not posting into them will keep them from being bumped back up to “latest”.

Topic		Replies	Views
Fine-tuning only available for 'base models'? API	6	1494	December 23, 2023
Fine-tuning using GPT-4 Beta API	18	25912	December 12, 2023
Fine tuning - how exactly does it work? API	6	2571	December 23, 2023
GPT-3.5 Turbo fine-tuning now available (and new GPT3 models) API announcement , fine-tuning , api	18	16194	December 15, 2023
Can not create fine tuning job based on existing fine tuned model API fine-tuning , fine-tuning-problems	11	1843	December 19, 2023

Is fine tuning quite limited?

Related topics