Tips & Tricks for Keeping Fine-Tunes active?

jxl38 · May 15, 2022, 7:11pm

So, I’ve built and application that uses many different fine-tune models interchangeably.
The issue is they’re always asleep! I get the error message “That model is loading, please try again.” inconsistently. Even if the model completed a successful completion a few seconds ago, sometimes it will return the loading error on the next query. Does anyone have any tips for keeping the models awake?

tolga · May 16, 2022, 10:37am

From my experience, you need to give a bit of time after fine-tuning is completed, even if you haven’t used it for a while. As I understand the models are waiting in a cold state until get a request. My model needs approx one minute to go hot state.

daveshapautomator · May 16, 2022, 2:08pm

One thing you can do is combine models to do multiple tasks so you’re only using one fine-tune the whole time. This means it’s more likely to stay in memory. As far as I know, finetuning is still in beta so perhaps this will be fixed before it goes GA.

jxl38 · May 17, 2022, 2:54pm

Thanks for the replies, guys. @daveshapautomator I’ve thought about doing that. I was just concerned about wires getting crossed on specialized tasks. I wonder if I can set up automation to keep them hot.

Topic		Replies	Views
'Model still being loaded' error for finetuned models? API	15	1479	September 23, 2022
High latency for fine-tuned gpt-4o-mini API	4	923	November 26, 2024
Fine-tuned gpt-3.5-turbo latency Feedback fine-tuning-problems	15	3783	November 15, 2024
Fine tuned model does not exist! API gpt-35 , fine-tuning , api	12	5064	December 9, 2023
Fine-tuning Beta Release Announcements	55	8793	December 25, 2023

Tips & Tricks for Keeping Fine-Tunes active?

Related topics