Token Optimization Question

seadude · May 11, 2023, 6:07pm

Hello,

I’m sending requests to an Azure OpenAI GPT-4-32k instance. I want to optimize the prompt token counts by removing dead spaces; maybe minify prompts, etc.

When I paste a raw JSON prompt into the OpenAI Tokenizer, I see ~17.5k tokens:

When I paste in a semi-processed JSON prompt ( replace(‘’, ‘<no-space’), I see ~5.5k tokens:

<can’t paste the image here due to new member restrictions>

Pretty substantial diff.

However, when I submit the raw JSON prompt to my Azure OpenAI resource, I see the token usage come back as ~5.5k.

So the question: Does Azure OpenAI optimize the prompt? How is the 17.5k token prompt being reduced to ~5.5k tokens?

sps · May 11, 2023, 6:16pm

Welcome to the community @seadude

When you copy-paste from code editor(assuming it’s formatted), it copies the formatting tabs or spaces, used for indentation, as well.

When you pass directly to the API (likely stringifying json before passing) it sends it as a string with the indentation removed.

Hence, it’s not the API that’s doing the work, it’s your code stringifying the json.

seadude · May 11, 2023, 6:31pm

Aha! Ok, thank you. That makes sense.

Postman’s code in this case!

Topic		Replies	Views
Question about function completion model tokenization API	3	429	July 12, 2023
Different prompt tokens betwen OpenAI tokenizer or Azure OpenAI and OPENAI API via python library API gpt-35-turbo , chatgpt , api , chat-completion , azure	4	1961	April 16, 2024
Prompt tokes are much lower than the number mentioned in the response API	6	118	January 10, 2025
Is JSON Mode supposed to result in a higher prompt token count? API	2	1522	December 1, 2023
Using the API the token count is off API	10	1687	January 16, 2024

Token Optimization Question

Related topics