Use knowledge base with realtime api?

meix · October 16, 2024, 2:25am

What’s the best way to use my own knowledge base with the realtime API?

Would it be through function calling? RAG? How would it work?

Thanks!

_j · October 16, 2024, 2:45am

That’s an interesting question, and here’s how I would do it, especially considering every AI turn can be increasingly expensive.

Have user input-based RAG returns placed into a section of the system message, which is session.update → session, instructions. That can only work if you are using text input or some very fast transcription on user voice running outside of the realtime API.

The benefit is that you haven’t automatically doubled the cost of the turn and increased the delay from having the AI write a function output, send the return value back to the AI server-side chat history, and then run the whole input context again.

Then a voice conversation appears continuous after the system message, so the AI should never drop out of voice.

If OpenAI ever implements context caching for significant discounts, such early changes in the context could break that. Then you can consider two new messages, one for “here’s information relevant to the user’s next question”, and one for user voice. That would take trials to see quality of ongoing chat responses, and implementing the extra work of trying to delete that retrieval message back out of a chat quickly so they don’t pile up, increasing costs and reducing attention. That is like a function call with no call.

meix · October 17, 2024, 10:12am

I thought more about this, an easy way could be to populate the “memory” with the knowledge base, such that it will have access to it and retrieve as the conversation goes on.

Topic		Replies	Views
Knowledge Base in realtime api API openai , realtime , api-realtime	3	285	January 22, 2025
Realtime API with Knowledge Base API vector-db , realtime , api-realtime-speech	3	411	November 15, 2024
How to Add Knowledge Base in API API api	12	18647	December 15, 2023
Designing a Custom Chatbot with RAG and Function Calling GPT builders	4	395	January 22, 2025
RAG with voice-voice(end-end) RealTime API API api	17	3985	January 19, 2025

Use knowledge base with realtime api?

Related topics