Optimizing Runs that make a tool call

I’ve been using mini-4o and a chat completion to narrow down the selection of possible tools for a query to 0-2 (this takes about half a second), but even with mini-4o as the model for a Run, it still takes 5+ seconds for even a simple tool call. Has anyone found a way to optimize? How long are queries typically taking in your app?