The models should be fine-tuned for the latest API features

stevenic · October 11, 2024, 8:03pm

I’m starting to do a lot of code gen that asks o1-mini to write code for calling OpenAI models directly. I’ve noticed that it has a tendency to want to use either gpt-4 or even gpt-3.5 when it calls the completion API. That’s presumably because that’s what its seen prior to its knowledge cutoff. The model also doesn’t know about newer features like Structured Outputs or even JSON mode.

Given that OpenAI controls the fine tuning of the models they should fine tune all of their models to know about their latest model names, API endpoints, and features. I mean shouldn’t o1-mini have a natural preference to want to call itself?

thinktank · October 11, 2024, 8:42pm

I completely agree.

It’s strange to me that o1-mini is supposed to be the best at coding tasks… but it is not trained on all of the recent coding advancements from OpenAI that I want to use it for.

anon25271712 · October 11, 2024, 8:43pm

I’ve heard about other forum members saying it is difficult to do so in the training data without risking quality for future models, so the easiest way to solve this for now is to include in the prompt you are sending the information about the latest models.

Topic		Replies	Views
Train your latest models on your latest docs please! Documentation gpt-4 , assistants-api	0	137	September 21, 2024
O1-preview understands only outdated OpenAI API? Community api	8	513	September 30, 2024
Fine tuning chat models -- coming soon? API	5	1396	December 15, 2023
Will we be allowed to fine-tune o1 models in the future? API fine-tuning , o1 , o1-preview	2	2054	October 6, 2024
Fine-tuning a codex model? API codex	10	2283	July 25, 2023

The models should be fine-tuned for the latest API features

Related topics