Assistant with Schema - Request too large - TPM Limit 10000 for simple "Hi"

Jetro223 · October 3, 2024, 8:15pm

I’m trying to build an assistant for invoice recognition. I have entered system instructions in the playground.

You are an assistant which recognizes invoice data from pdf files uploaded to you.
You recognize the fields from the invoice and output JSON in the schema provided with the extracted data from the invoice.
The output is only the parameters to the function call formatted as JSON. Fields you can not find in the invoice will be empty in the JSON response.

And I added a json_schema describing an invoice.

With the model gpt-4o-2024-08-06 (which is the only one I could find which supports json_schema) I get the following error, even for a simple “Hi”.

Request too large for gpt-4o in organization org-XYZ on tokens per min (TPM): Limit 10000, Requested 16481. The input or output tokens must be reduced in order to run successfully.

Can somebody help and explain the high token count?

Topic		Replies	Views
Too many TPM - Can't configure file search results number for API API api , assistants-api	2	33	November 28, 2024
Assistant API - way too much "input" tokens used API assistants-api , assistants-pricing	7	5513	September 6, 2024
Assistants API context tokens Number API assistants-api	5	1040	January 12, 2026
Assistant API with 2 csv files: how to manage "the output is too large"? API chatgpt , context-elements , assistants , assistants-api , knowledge-files	0	735	January 26, 2024
Assistants API works in playground , fails with incomplete when calling API API assistants-api	3	207	September 20, 2024

Assistant with Schema - Request too large - TPM Limit 10000 for simple "Hi"

Related topics