Max_completion_tokens not working with openai.beta.threads.runs.stream

gyanesh · August 27, 2024, 8:07am

const stream = openai.beta.threads.runs.stream(
    threadId,
    { assistant_id: assistantId}, {max_completion_tokens: 50},
    eventHandler,
); This doesn't put any constraints on the length of the answer. What could I be doing wrong ?

Munna23 · August 27, 2024, 8:31am

You could say 4 chars ~ 1 token or 75 words ~ 100 tokens. You may calculate based of that. Also you can lower top_p value to allow only higher probability tokens for the response which will limit the response. Hope this helps - Cheers!

gyanesh · August 27, 2024, 3:43pm

What I meant to say was that I am able to receive a 1000 word answer even after putting the limit of 50 tokens… what am I doing wrong ?

Topic		Replies	Views
Assistant max_completion_tokens not working as expected API assistants-api	4	2240	April 29, 2024
Assistant max_completion_tokens not returning status incomplete API gpt-4 , assistants-api	0	117	October 8, 2024
4096 completion token limit with gpt-4o. Using assistant streaming API API assistants-api , assistants-streaming , gpt-4o	0	270	July 20, 2024
Setting max tokens for output issues API gpt-4 , api	4	3723	January 26, 2024
Assistant API v2: max_prompt_tokens gets exceeded, barely, consistently Bugs	5	948	July 4, 2024

Max_completion_tokens not working with openai.beta.threads.runs.stream

Related topics