Understanding the current Assistant Retrieval process

Right now its very expensive to use when its not doing a vector-db retrieval. I’ve got a chatbot and did just a few messages today and used over 65k tokens…

1 Like