Fine tuning with function calling / tools help!

guillermog · November 27, 2024, 12:09am

First post, long time reader.

I have a question regarding fine-tuning a model with function calling / tools.

I want to fine tune 4o-mini based on the good responses from 4o. Basically my prompt starts with a set of instructions and tools, then I let the model execute the instructions with the tools provided. 4o is really good at doing this, but it is costly. So I thought I could collect a lot of “good responses” from the main model and use that to train the mini model. So I did this, but the fine tuned model is worst now than just 4o-min.

To fine tune the model I used the OpenAI dashboard / playground, which automatically use all the stored calls, but this also includes the intermediate steps, like when you send the result of a tool execution and you wait for a response.

So, after all this introduction, my question is: should I be only fine tuning the model with full length conversations (so 1 full complete execution log), rather than than with intermediate steps like is done via the dashboard?

MARK0 · November 27, 2024, 7:25am

Hello! I don’t think it’s necessary to include an intermediate step here. The model should have a clear understanding of when and how to call the appropriate tool directly. My approach would involve training the mini-model using user messages that already include responses gathered from the tools.

guillermog · November 27, 2024, 11:13pm

Thank you! That is what I thought. The OpenAI Dashboard feature of storing and training is quite convenient but the downside is that you can’t pick and choose which of the conversations to store. I will try setting up a clean training set with conversations end to end.

Topic		Replies	Views
Finetuning with tool calls and tool responses API fine-tuning , tools	3	933	July 23, 2025
Token Optimization with fine tuning + Function Calling API fine-tuning , function-calling	2	448	January 10, 2025
Gpt-4o-mini fine-tuning with only 10 lines of code API gpt-4o-mini	27	2938	September 5, 2024
Fine tuning - how exactly does it work? API	5	2801	April 12, 2023
Gpt-4o-mini stops following instructions after a few turns Prompting api , gpt-4o-mini	5	540	September 12, 2025

Fine tuning with function calling / tools help!

Related topics