Using Fine Tuned model to Assistant API

nikkdev · November 13, 2023, 12:17am

Hello everyone! I’m curious if anyone here has experimented with integrating a fine-tuned model into the Assistant API. Specifically, I’m wondering about the challenges and benefits you’ve encountered in tailoring the Assistant’s responses using a fine-tuned model. Any insights or experiences in this area would be greatly appreciated. Thank you!

trenton.dambrowitz · November 13, 2023, 1:54pm

My understanding was that the assistants API removes most of the need for fine tuning by allowing you to inject it with specific knowledge and behaviour.

From what I’m aware of, the largest benefit of fine tuning over prompting is possibly saving money if you go through a massive amount of prompts and tokens.

I may be wrong though, if there are other benefits I’d be interested to hear them.

Foxalabs · November 13, 2023, 2:00pm

Hi,

Have you tried replacing the model name with your fine-tuned model in the creation object?

const assistant = await openai.beta.assistants.create({
  name: "Math Tutor",
  instructions: "You are a personal math tutor. Write and run code to answer math questions.",
  tools: [{ type: "code_interpreter" }],
  model: "your_model_name"
});

The documentation states:

Model: you can specify any GPT-3.5 or GPT-4 models, including fine-tuned models. The Retrieval tool requires gpt-3.5-turbo-1106 and gpt-4-1106-preview models.

nikkdev · November 13, 2023, 8:03pm

but i think it will not have the tool retrieval.

do you think that using the retrieval, upload knowledge based documents or fine-tuning documents (in CSV, or PDF). will train the model or fine-tune it? or no?

Foxalabs · November 13, 2023, 8:05pm

If your requirement is for the AI to process your data and use that as context, then you should use retrievals as describe din the documentation, note that you can use the Playground uploader to get files into your assistant if you do not wish to do it programmatically.

nikkdev · November 13, 2023, 8:07pm

that’s right. but it doesn’t give you the option to fine tune it anymore.

so the real question is. does the data in the documentation uploaded from the playground fine tuning the gpt-4-1106 model? or no? its just using it as a context and no training is happening?

Foxalabs · November 13, 2023, 8:10pm

No, uploading data to an assistant does not fine tune the model, that is a separate endpoint and workflow.

nikkdev · November 13, 2023, 8:11pm

i see. so right now, there’s no way to use fine tune model + retrieval tool?

Foxalabs · November 13, 2023, 8:17pm

Correct, the documentation states

nikkdev · November 13, 2023, 8:26pm

ohh i think we can use a fine tuned GPT-3.5 turbo 1106 + retrieval tool.

thanks for that @Foxalabs

andrew.ferrone · November 17, 2023, 7:26pm

@nikkdev did this work for you? Were you able to use a fine-tuned GPT-3.5 turbo 1106 + retrieval tool with the Assistants API? Many thx.

nikkdev · November 17, 2023, 7:42pm

haven’t tried it but i think no. we can’t use specific fine tuned model for assistant API

3WaD · November 25, 2023, 1:09am

Anyone knows if we’ll be able to use retrieval tool on our fine-tuned models in the future?

dbenito · December 21, 2023, 6:08pm

Has anybody actually managed to get the Assistant API to work when using a fine-tuned model? I know the docs say this is possible, but I can’t get it to work in practice. I create a new assistant using “gpt-3.5-turbo-1106”, it works, but if I specify a fine-tuned model (with gpt-3.5-turbo-1106 as a base) instead, I get an error when trying to create the assistant. FWIW, I am not using the retrieval tool, only function calling.

goony · December 22, 2023, 9:33pm

I need to be able to combine those 2 as well. Hope it gets implemented soon.

glt32 · January 17, 2024, 4:26pm

Is there any update on this? Would be nice to have the ability to use a fine tuned model with the assistant so you can provide less instruction and more examples of what an ideal output would look like.

andrea.tomassi · January 19, 2024, 7:00pm

+1
This feature would be super useful.

DevGirl · January 19, 2024, 7:06pm

@dbenito I know the docs say this is possible, but I can’t get it to work in practice

That’s been my experience, as well.

I’ve found the best solution is fine-tuning only with limited-length text files. Anything else and I’m unable to get it to work.

Start with fine-tuning a model with a single text file, just to confirm you’re doing everything correctly. Once you’ve done that, you can continue to move forward with a better idea of where it breaks.

I hope that can help

andrea.tomassi · January 20, 2024, 11:08am

I create a new assistant using “gpt-3.5-turbo-1106”, it works, but if I specify a fine-tuned model (with gpt-3.5-turbo-1106 as a base) instead, I get an error when trying to create the assistant.

Any progress on that? I see the Assistant setup from Playground is not allowing the selection of my fine-tuned models.
I wonder if the Assistant API allows for that.

Foxalabs · January 20, 2024, 11:25am

The Assistants API does not currently allow fine-tuned models to be used, it was in the documentation for a brief period, but that was in error. Hopefully this becomes a feature at some point in the future.

Topic		Replies	Views
Cannot use fine-tuned model in assistant API Bugs fine-tuning-problems , assistants-api	36	4803	October 4, 2024
Can I create an Assistant with my fine-tuned model? API api , agents , assistants-api	2	2120	February 23, 2024
Fine tuned model with document retrieval API	5	999	August 19, 2024
New Assistant feature and Fine-tuning API	4	3797	February 5, 2024
GPT-3.5 Turbo fine-tuning now available (and new GPT3 models) API announcement , fine-tuning , api	18	16217	December 15, 2023

Using Fine Tuned model to Assistant API

Related topics