Different prompt tokens betwen OpenAI tokenizer or Azure OpenAI and OPENAI API via python library

miked1 · February 23, 2024, 8:44pm

I’m in the same exact boat, trying to figure out the root cause of this.

I’ve tried various parameters, etc. Using semantic search, with only top 5 or top 3 documents, and asking a basic (5-10 token question along with a 100 token system prompt) - My responses from Azure Open AI are telling me I am using 6000 prompt tokens on average (GPT4, 8K).

After doing some chunking on my data (200 token chunks), I was able to reduce prompt tokens down to 4,000.

Still, this just seems extremely high and I cannot pin point what I am doing wrong.

Is this what you are also seeing?

Topic		Replies	Views
Prompt tokes are much lower than the number mentioned in the response API	6	96	January 10, 2025
Response being cut off in Azure OpenAI API	6	2249	January 30, 2024
Chat GPT4 1106 vs ChatGPT 4: Impressive drop in quality API gpt-4 , chatgpt	27	15608	February 14, 2024
Token Optimization Question API	2	1362	May 11, 2023
Optimizing Token Utilization for GPT-4 with Vector Database: Overcoming 1000-Token Limit Challenges Community gpt-4 , api , assistants-api	2	430	October 9, 2024

Different prompt tokens betwen OpenAI tokenizer or Azure OpenAI and OPENAI API via python library

Related topics