GPT-4o 2024-08-06 - Context Output 16k Tokens - My Requests Max Tokens Around ~3k

guile.brazil · December 28, 2024, 6:27pm

Hello everyone,

According to the documentation for GPT-4o, the output window is approximately 16k tokens. However, in my requests, I can’t seem to generate responses that exceed 3.1k tokens.

Could someone kindly guide me on this?

Thank you!

guile.brazil · December 28, 2024, 6:31pm

[
{
“id”: “chatcmpl-AjVlynmQu4R7OXV5ky8iau295QcVa”,
“object”: “chat.completion”,
“created”: 1735410258,
“model”: “gpt-4o-2024-08-06”,
“choices”: [
{
“index”: 0,
“message”: {
“role”: “assistant”,
“content”: "{content} ",
“refusal”: null
},
“logprobs”: null,
“finish_reason”: “stop”
}
],
“usage”: {
“prompt_tokens”: 2699,
“completion_tokens”: 2542,
“total_tokens”: 5241,
“prompt_tokens_details”: {
“cached_tokens”: 2432,
“audio_tokens”: 0
},
“completion_tokens_details”: {
“reasoning_tokens”: 0,
“audio_tokens”: 0,
“accepted_prediction_tokens”: 0,
“rejected_prediction_tokens”: 0
}
},
“system_fingerprint”: “fp_d28bcae782”
}
]

sps · December 28, 2024, 6:41pm

Hello @guile.brazil,

Welcome to the forum.

The maximum output tokens is the upper limit up to which the model can generate tokens. Any outputs exceeding this limit will have the ”finish_reason” : “length”.

In your case, the completion reached its logprob-ablistic end, i.e., the model finished generating tokens and emitted the default stop sequence.

PaulBellow · December 28, 2024, 7:11pm

Can you share your prompt?

Also, the latest 4o usually gives me longer output if I feed it a good prompt.

Topic		Replies	Views
GPT-4o-mini max token 16,384 API gpt-4 , api	2	1845	August 31, 2024
Impossible to generate texts of more than 600 words API	5	3306	December 18, 2023
Chat Completions output cutting off without hitting max_tokens limit API gpt-35-turbo , api , token , gpt-0125	1	903	July 14, 2024
Optimizing Token Utilization for GPT-4 with Vector Database: Overcoming 1000-Token Limit Challenges Community gpt-4 , api , assistants-api	2	402	October 9, 2024
Gpt-4-1106-preview: 400 This model's maximum context length is 4097 tokens API api , token , gpt-4-turbo	8	5500	March 18, 2024

GPT-4o 2024-08-06 - Context Output 16k Tokens - My Requests Max Tokens Around ~3k

Related topics