Gpt-3.5-turbo-16k, but not for responses?

josh8 · June 14, 2023, 3:34am

I’m testing out gpt-3.5-turbo-16k on a dev server and I’m giving it the max tokens, however, it seems that it can’t output more than maybe 2,000 tokens? I’ve tested raising the presence penalty, I’ve lowered and raised the temperature, I’ve lowered the top p, I’ve changed wording on prompts, but it seems to always top out around 1,000 - 2,000 tokens in response.

Also, gpt-4-0613 seems more like gpt-3.5-turbo-pro than it does gpt-4.

Is anybody else having problems with the 16k not outputting more then 1K-2K tokens?

Also, can we get some playground settings that actually match what’s available… penalties go negative, but not on playground. Also, max_tokens goes way higher than that on all models now. It would make testing parameters a lot easier.

baoboochat · June 14, 2023, 2:20pm

Yeah,I have the same problem.Even though I set maxresponse to 8k token,But api response token less than 3k.

novaphil · June 14, 2023, 5:22pm

Is the response cut off or just not as long/detailed as you want?

lmwfh2rq44u · June 15, 2023, 3:40am

Please note that the date for this transition on 27 June.

You may find the link as below, it’s for your information -

https://openai.com/blog/function-calling-and-other-api-updates

hascdev · June 15, 2023, 12:20pm

I have a similar problem. I enter a long context to answer the question but it’s not all used and I get a shorter answer. (Compared to gpt-3.5-turbo)

ricardodg · July 20, 2023, 1:04am

Hi, maybe someone can help

I have the 16K available on playground and working great but not on the API, I get a response response:{ “error”: { “message”: “The model gpt-3.5-turbo-16K does not exist”,

Are you guys able to access it via API?

Thanks

novaphil · July 20, 2023, 3:23am

It’s lowercase k
gpt-3.5-turbo-16k

ricardodg · July 21, 2023, 4:00am

Great thanks that worked

Topic		Replies	Views
Gpt-3.5-turbo-16k with long context not work API gpt-35-turbo	6	3653	November 1, 2023
Finetuned gpt-3.5-turbo-0125 has a 4k context window, instead of the 16k promised API	5	1383	March 27, 2024
Chat GPT4 1106 vs ChatGPT 4: Impressive drop in quality API gpt-4 , chatgpt	27	15618	February 14, 2024
GPT-4 Turbo Long response issues (Lazy ? Restricted to 1xxx tokens?) Bugs gpt-4-turbo	2	1024	February 23, 2024
GPT-4 128K only has 4096 completion tokens API gpt-4	9	27346	February 27, 2024

Gpt-3.5-turbo-16k, but not for responses?

Related topics