Realtime: Calling 3rd party API

For my current project, I make API calls to an AI agent I developed. Is there a way to bypass the tool or function-calling mechanism and directly invoke my model every time the user speaks? I only want the output from my model without any additional processing or tool interaction.

So the only functionality I want from RT is input and output voice. And yes, I do have a version of my project using both TTS and STT (higher latency).

1 Like