Fine-tuning examples still limited to 4k tokens even on gpt-3.5-turbo-1106

antonio.lopardo · November 8, 2023, 8:05pm

OpenAI Platform the docs mention that the examples for fine-tuning are limited to 4096 tokens is this still true for 16k-context tokens?

_j · November 8, 2023, 8:46pm

File validation would fail on larger example conversation lines.

Pack one with an extra “assistant” role with 4100 tokens of augmentation, and then just nine more placeholders.

I sanitized to 4101 tokens of the AI models list page. Added it as that RAG assistant role before more tokens of system, user, assistant. Whereas a bad json failed, this is in the queue now.

And the fine-tune GUI has no place to chose epochs, so I pay probably 8x more for the little experiment.

_j · November 9, 2023, 7:47pm

So lesson: don’t use the GUI to shoot off fine-tune jobs. No suffix. No control over epochs. They could run the thing at your tokens x50 on a whim.

antonio.lopardo · November 9, 2023, 7:49pm

Good to know, btw do you know if continuing to finetune a model after the end of a finetuning job will delete the intermediate model? Meaning that if you start a second finetuning job after the end of the second job the intermediate model gets deleted

_j · November 9, 2023, 7:50pm

The first model stays with its original name until you delete it. Just specify that ft:model instead of the original API name model.

antonio.lopardo · November 9, 2023, 7:52pm

Nice! But then why am I getting the same model names at least during the second finetuning job?

_j · November 9, 2023, 7:56pm

If you look at the report above, it says base model: gpt-3.5-turbo. I expect that you’d see base model instead as your previous fine tune once underway. Then you receive a new model name when done.

antonio.lopardo · November 9, 2023, 7:57pm

Great! Thank you very much for the info

hassantsyed · November 29, 2023, 2:07am

So anything more than 4k in samples for finetuning doesn’t work?
A bit confused.

Topic		Replies	Views
Finetuned gpt-3.5-turbo-0125 has a 4k context window, instead of the 16k promised API	5	1383	March 27, 2024
Max token limit for finetuning Documentation fine-tuning	6	2745	June 18, 2024
What are the example token length limits for fine-tuning? API fine-tuning	1	80	July 5, 2025
GPT 3.5-Turbo Fine-Tuned Token Limit API fine-tuning	4	3036	December 18, 2023
Fine-tuned gpt-3.5-turbo-0125 Max Output Tokens API	1	1360	April 23, 2024

Fine-tuning examples still limited to 4k tokens even on gpt-3.5-turbo-1106

Related topics