I build a lot of chatbots so I immediately started testing gpt4o. It works great with the prompts used with gpt-4-x, and the speed is amazing.
EXCEPT it does not seem to stick to the maximum words.
Let’s say that I set max_tokens to 500, and in the prompt I ask it to stick to maximum 150 words (in a number of prompt variations). → This works well for gpt4, but gpt4o frequently runs out of tokens before it has completed its sentences.
This may have something to do with the new tokenization used by gpt4o, or something else. Please let me know if you find successful prompts that gpt4o sticks to