When using Playground, what happens if total system/user/assistant prompts exceed max token length

Just to note: I identify the usage as being the chat completions endpoint, from the language of writing a “system message”. Not the Assistants agent.

The chat completions endpoint on API with your own code in non-streaming mode also gives you insights into the token consumption. You may want to transition from playground testing to code-based testing, getting the usage out of the full response.

1 Like