Finetuned gpt-3.5-turbo-0125 has a 4k context window, instead of the 16k promised

CentroNeuroCogni · March 26, 2024, 4:11am

Hi,

The documentation says that if you need to finetune a 16k model, you should use gpt-3.5-turbo-0125, as it supports 16k context examples.

I went for it and the finetuned model is limited to 4095 context tokens. Does somebody know what’s happening?

jr.2509 · March 26, 2024, 8:59am

Hi and welcome to the Forum!

You are right that the training examples for a GPT-3.5-turbo-0125 should allow for up to 16k tokens. What led you to conclude that it is limited to 4k context tokens?

CentroNeuroCogni · March 26, 2024, 11:57am

When I try the base gpt-3.5-turbo-0125 model and my finetuned version in Playground, the maximum length is 4095, while the gpt-3.5-turbo-16k let’s you work with the promised 16k.

I haven’t been able to do a proper test with the Completions API, because the outputs are incredibly buggy, like some answers are in korean (my dataset and prompt was in English), some are incomplete tokens superposed, some are a few words with some symbol gibberish in between, etc.

Thank you for answering!

_j · March 26, 2024, 4:52pm

The maximum OUTPUT of the newer models is 4k. That is the max_tokens value, and the purpose of the slider: response tokens. The AI will actually be restrained further in producing output by its training unless you are fine-tuning the AI specifically to write very long form. And you cannot fine-tune the gpt-3.5-turbo-16k-0613 model that doesn’t have a hard restriction on production.

https://platform.openai.com/docs/models/gpt-3-5-turbo

CentroNeuroCogni · March 26, 2024, 7:13pm

That’s strange, the description of the slider says:

and now that I’m checking again, the finetuned version is limited to 2k and not 4k as the base gpt-3.5-turbo-0125. I guess that’s a bug though, same as with the weird API completions.

That aside, the dataset used to FT was long form, so it should be giving long answers too.

Again, thanks for answering!

jr.2509 · March 27, 2024, 12:52pm

Good observation! The same applies to me when I checked my fine-tuned models on the playground - as I normally don’t consume them in the playground, I never realized it until now.

Have you tried using them via a regular API call outside of the playground and just set the max token hyperparameter to 4k?

Topic		Replies	Views
Fine-tuned gpt-3.5-turbo-0125 Max Output Tokens API	1	1360	April 23, 2024
GPT-4 128K only has 4096 completion tokens API gpt-4	9	27345	February 27, 2024
GPT 3.5-Turbo Fine-Tuned Token Limit API fine-tuning	4	3036	December 18, 2023
Fine-tuning examples still limited to 4k tokens even on gpt-3.5-turbo-1106 API fine-tuning	8	1224	November 29, 2023
Gpt-4-1106-preview in Playground needs some fixes API gpt-4 , playground	24	17187	February 5, 2024

Finetuned gpt-3.5-turbo-0125 has a 4k context window, instead of the 16k promised

Related topics