Assistants API is Killing Me

DevGirl · February 11, 2024, 10:10am

You’re not doing anything wrong.

As others have suggested, a less expensive LLM such as 3.5 is an option because you’re relying on RAG and don’t require as much power in the LLM.

IMHO, the most effective option would be to chunk this, import into a Pinecone vector DB (it will be small enough to be run for free) and this will substantially reduce costs while potentially increasing accuracy, depending on how conducive the embedding is to a structured document/chunking.

Topic		Replies	Views
Alternatives to Assistant API API assistants-api	19	6943	January 26, 2024
Did assistant api kill manual RAG with vector databases? API	8	6693	December 18, 2023
Assistants API pricing details per message API api-billing	68	40844	January 29, 2024
Assistants API is too slow! API assistants-api	26	3950	March 16, 2025
Assistants API and RAG - Best of Both Worlds? API rag , assistants-api	7	12120	May 29, 2024

Assistants API is Killing Me

Related topics