Gpt-4o-mini responses are being cut off

rohithar · January 27, 2025, 9:19am

Hello, I am using gpt-4o-mini to run an inference on an input of about 23k tokens. The model’s responses are being cut off after about 100 tokens. Tried to use gpt-4o for the same task and it works as expected. max_tokens parameter is set to 16384 so that is not what’s causing the issue. This task has worked with 4o-mini before and is a new issue that started just 15 minutes ago. Tried using playground and openai python sdk and the same thing happens. Anyone facing the same issue?

_j · January 28, 2025, 6:00am

Look at the API finish reason.

Maybe content_filter? That will be if it is reproducing trained content. Your “lyric search engine” AI won’t work.

You can also use the new parameter max_completion_tokens.

Topic		Replies	Views
GPT-4o-mini max token 16,384 API gpt-4 , api	2	1775	August 31, 2024
Does gpt-4o-mini-search-preview have a completion token limit of around 1530? Bugs	0	84	March 28, 2025
Is it me or GPT4 consistently doesn't finish and cuts the answers? API	18	6578	April 11, 2024
OpenAI truncating the response API gpt-4 , chatgpt	0	115	April 4, 2025
Why does chat completion API stop sometimes during generation? Max tokens not an issue API	3	1496	January 20, 2025

Gpt-4o-mini responses are being cut off

Related topics