I am in middle of situation where Retrieval method for file and its pricing, if i use new assistant every time which means creating new thread always, would it cost me more or using same thread which saved token (which i really dont need)would cost me more, ?
For me the retrieval method works really good with assistant than pinecone, I have one big file with unique numbers and data which AI needs to pick WRT some situation.
There’s no need to create a new assistant every time, assistant’s and thread’s are separate things, and you can use multiple threads with the same assistant
The thing that will save you the most tokens is creating an API that the assistant can fetch the unique numbers and data from.
2 Likes