Assistant's API speed and hallucination tricks i learned while building my production app

macroguy · January 15, 2024, 4:46am

Hi Folks,

I’ve spent the last two months using the assistant’s API to build out www.docmonster.ai and
i wanted to share the best tricks i’ve found to get to somewhat production speed. This is like under 1.2 sec

Context
Model: Gpt -3.5 turbo

Tip 1 for speed:
So all messages have three parts:
Thread,
User message,
run,

Runs have messages on them and are within a thread. My biggest speed improvement came from splitting part one. Basically when the user opens my bot by clicking on the + symbol, it sends a message to the backend to initalise a thread and have it ready. Then when the user send the message, the run is added to the thread. Thread ends when the user closes the bot.

Tip2 for speed:

use promises instead of running them one after the other.
const [sentMessage, run] = await Promise.all([sentMessagePromise, runPromise]);

This made a big difference as well.

Tip3 for speed: More dynos

My backend was running on heroku and i was using one eco dyno. Adding 2 standard dynos made a world of a difference. don’t judge the api on the speed it has on your local server. even without streaming it’s not horrible!

Tip for Hallucinations:
I had to keep gaslighting my bot to make it retrieve from the file retrieve. It would keep saying it was unable to access files and then when i replied with “try again you have access, it works”

I added this to the instructions when my bot is created to solve for this: “If the myfiles_browser tool doesnt work the first time, try again till it works’”

So myfiles_browser tool seems to be the file retrieval tool’s name. So adding this instruction at the creation and the message level has made it stop telling me it couldn’t access my files.

It’s possible this is coincidence and the instruction doesn’t really help, but it’s consistently worked for me.

Anyways, thats it for me. Hope these tips were useful. If you want to try the bot’s speed you can do it on my product for free at www.docmonster.ai to see what i mean. Plus i’m launching what i think may be one of the first few production apps using the assitant’s api so wish me luck!

PaulBellow · January 15, 2024, 7:45am

Thanks for sharing your tips and suggestions with us!

jay_mercu · January 15, 2024, 10:34am

Thanks for sharing!

Noob question: Why did you use the Assistants API instead of the Chat API?

If I understand docmonster’s use case correctly, you’re basically building “ChatGPT over my API docs”. Wouldn’t a simple RAG architecture on top of the Chat API have sufficed?

foschi · January 15, 2024, 1:46pm

RAG is included with Assistant. You can use Assistant for that, and that’s what OP did.

simmel · January 15, 2024, 3:08pm

Thanks a lot & good luck with Docmonster @macroguy! Very interesting.

Are you loading just one document into the assistant, or more?
Gaslighting also helped me, but only for a few attached docs (1-3), not the official 20.

And are you using the citations feature? that one was also not working most of the time.

yonatanab1 · January 15, 2024, 7:00pm

Well done! Thanks for the tips

jay_mercu · January 15, 2024, 8:59pm

Yeah, I’m aware. But what’s the benefit of using the Assistant API over the Chat API for this use case?

The Chat API is tried and tested, the Assistant API isn’t.

diego.cardenas · February 11, 2024, 8:58pm

The most important benefit is simply the easy of release. You don’t have to worry about context, history of messages, RAG, etc. You simply configure the Assistant and use the API. This is INCREDIBLE to get your product into production and test it to validate any early hypothesis.

I still have not seen any real advantage in the long run. You have more control creating your own RAG system.

stevenic · February 11, 2024, 9:07pm

Great tips… especially the speed tips. Well done

Topic		Replies	Views
How do you use the Assistants API? API assistants-api	21	8193	August 2, 2024
The optimal way to build AI Chatbots? API	7	376	March 6, 2025
9 months of using the OpenAI Assistants - API assistants-api	3	558	August 19, 2024
Why does my assistant find the right answer from file on Playground but not via API? API	6	1789	December 8, 2023
Chatbot Assistant Implementation Feedback API	2	424	March 24, 2024

Assistant's API speed and hallucination tricks i learned while building my production app

Related topics