Under token limit, response is cut off regardless

polar · April 16, 2024, 3:00am

Hello,

I am at Tier 1 of api usage. I have found that regardless of coming in well under a given model’s token limit (in this case gpt-3.5-turbo-0125, but happens for all), my responses are getting cut off. Measuring my latest input using the tokenizer is 290 tokens, and the response that I received was 295. The response was cut off mid-sentence.

This has occurred for me regardless of the model, and regardless of whether I set the max_tokens parameter or not.

Any ideas?

polar · April 16, 2024, 3:28am

Unfortunately this looks to have been caused by user error. My db tool has a hover functionality which I just found out cuts off past a certain number of characters. After using the django ORM to query, found the completion was giving full result.

Topic		Replies	Views
Issues with Truncated Responses API	3	1253	April 22, 2024
Streamed response truncating under token limit Bugs gpt-4 , api	0	143	May 29, 2024
Trying to understand why I'm hitting token limit with API API gpt-4 , api	8	3310	February 27, 2024
Max_tokens not set, truncated return with "finish_reason": "stop" API gpt-4 , api	9	2696	April 24, 2024
Chat Completions output cutting off without hitting max_tokens limit API gpt-35-turbo , api , token , gpt-0125	1	407	July 14, 2024

Under token limit, response is cut off regardless

Related topics