I hope you’re well. I’m looking for some experts with the Assistants API. Currently, we are looking at if we should use Chat Completion or Assistants and are also struggling with understanding the costs of both.
If you have some experience with this, please do reach out to me and I will message you my Whatsapp.
I’ve noticed assistants api to be slow for my complex multi modal output use-case (Asking assistant api to generate code, run and capture the output to a chart
we are making an AI chatbot of athletes and personal trainers etc. we want it to capture their personality and chat like how an athlete would. Imagine an AI Lebron, we want it to speak like Lebron. What would you recommend and why wouldn’t assistants be good?
The AI is already trained up to a knowledge cutoff, Sept 2021, or on the latest “economy” model of GPT-4-turbo, to April 2023. That’s not quite good enough for “where are you playing tomorrow, sports ball man?”, but that is not really the entertainment factor of chatting with a simulated AI buddy.
With the latter model that has a very large possible input (50k words or so at increasing expense), you can place a whole bunch of prompt info like the above. Explore what else the AI knows first.
Your forum keyword for injecting even more knowledge from documents on demand is “RAG” (retrieval-augmented generation, using embeddings-based vector semantic search).
You pay for the data. You have to figure out and manage how much word (token) consumption is being done by each user query. Then price competitively.