(Realtime API) Hows everyone managing longer than 30min sessions

andreskimlee · March 16, 2025, 7:17am

I’m building an application that requires longer session windows.

Curious to know how others have implemented this without disruptions to the user side.

My current thought is to simply refresh the ephemeral token every 29 minutes and update my webRTC connection on the client side by managing two sessions.

First session would be the one that’s about to expire
Second session is the new session that the user will switch over to when the connection is established

_j · March 16, 2025, 7:26am

I don’t know that anyone’s doing that as a matter of course. Only user audio goes in, so you can’t load up a chat again with turns of whatever audio you want the assistant to imagine it spoke in the past.

You’d have to create something you can place in the instructions as a chat summary and recent chat exchanges to give the illusion someone is continuing with a bit of memory.

andreskimlee · March 16, 2025, 7:42am

That would be the idea

Fetch new ephemeral token
Keep old session alive
Create summarization based on prior transcript
Connect to new session with summarization
Close old session

Topic		Replies	Views
Realtime API - session_expired? API realtime	11	1018	February 21, 2025
Handling early conversation closure API function-calling , long-context , voice , realtime	8	432	February 24, 2025
How can I pass additional context to the realtime API during a conversation API realtime , api-realtime , api-realtime-speech	8	477	February 19, 2025
OpenAI Realtime API Ephemeral Tokens API realtime , api-realtime	1	818	February 20, 2025
Realtime API and session costs API advanced-voice , realtime , api-realtime , api-realtime-speech	2	481	November 4, 2024

(Realtime API) Hows everyone managing longer than 30min sessions

Related topics