OpenAI Developer Community

Best Practice to save money on Calling Assistant API

zhangligao November 24, 2023, 3:14am 1

Hi, I am building a system using Assistant API,
I use Assistants API to read pdf files and chat with the data.

I upload files to OpenAI and attach it to assistant:

 assistant = get_openai_client().beta.assistants.create(
        name="Disclosure Gpt4 1106 preview",
        instructions=xxxx,
        tools=[{"type": "retrieval"}],
        file_ids=openai_file_ids,
        model="gpt-4-1106-preview"
    )

the pdf files are 50 ~ 100 pages in total, I find that it used a lot token in my API call, any suggestions to reduce the cost? thanks!

udm17 November 24, 2023, 5:38am 2

It looks like all the files are being converted to tokens and sent to GPT. Due to this, all the calls will contain tokens worth all the files.

The simplest way to reduce this would be to not use the inbuilt file retrieval system but use a semantic matcher and extract the similar data yourself and feed that in the input to the GPT

richdev.boston November 24, 2023, 6:28am 3

Can you share tools and technique for doing what you suggest? Do you suggest using Langchain with a vector DB or some other techniques and tools?

udm17 November 24, 2023, 7:26am 4

Langchain would way a good way to go about this. If the files will keep changing and you might have to create embeddings frequently, would be a good solution to use langchain.

However, if the files are static, you oculd use a database like Pinecone to store them long term.

For similarity once you have embeddings, cosine similarity is the way to go

Topic		Replies	Views	Activity
Do Assistants use tokens to access Files (every time)? API	2	56	April 22, 2025
Assistants API Cost Exceeds Reasonable Expectations API gpt-4	4	1042	April 11, 2024
How to reduce file_search token count API gpt-4 , api , assistants-api	1	606	April 29, 2024
Having unique assistant for all my users API	5	471	March 7, 2024
High Costs and Input Tokens with Assistants API File Search API pricing , assistants-api , assistants-pricing , assistants-files	4	1478	October 31, 2024