Assistant Retrieval API with or without Threading and its cost effect

I am in middle of situation where Retrieval method for file and its pricing, if i use new assistant every time which means creating new thread always, would it cost me more or using same thread which saved token (which i really dont need)would cost me more, ?
For me the retrieval method works really good with assistant than pinecone, I have one big file with unique numbers and data which AI needs to pick WRT some situation.

There’s no need to create a new assistant every time, assistant’s and thread’s are separate things, and you can use multiple threads with the same assistant :laughing:

The thing that will save you the most tokens is creating an API that the assistant can fetch the unique numbers and data from.

2 Likes