Realtime API with a Web APp with multiple users

aron22 · October 13, 2024, 2:47pm

Hi

I’m playing around with the Realtime API, it’s really cool and makes fun. But I came across one question in a production scenario.

Each Websocker Session in the Backend is equal to a chat session, this makes sens, but brings me to the question of how a system looks at scale.

For Example, GCP Cloud Run supports web sockets, but there is no real session affinity for connections there is 100% there is still a chance to get routed to a different instance.

Is there any recommendation to system design for a backend used by a Web App?
Or perhaps a paper how open AI scales the realtime api feature?

Thanks for help

Foxalabs · October 13, 2024, 11:44pm

The view I take on this is if there is not a service provider able to keep a websocket open right now, there will be in a few weeks when hosting services realise they can make money serving realtime-api clients.

What I tend to do is build apps making the assumption that money is a great and rapid motivator. Then look into the scaling issue once I have an application that I think has a chance to scale.

Also, I’m pretty sure Azure has services for this, not looked, but as all of this is sitting on Azure right now… it would make sense that there is suitable hosting for it.

aron22 · October 14, 2024, 9:45am

Thank’s for you thoughts. Unfortunately, I don’t have time currently to wait on such service, I will set up by myself.

Foxalabs · October 14, 2024, 10:48am

Please keep us informed of your progress, it will likely be of use to many.

Topic		Replies	Views
Serving multiple users simultaneously using realtime API API realtime , api-realtime , api-realtime-speech	9	530	December 17, 2024
JavaScript->FastAPI->AzureOpenAI API	2	769	September 8, 2023
How does one scale a chat application for production? Is async necessary? API api	2	3316	August 3, 2023
Questions on Creating 100K+ threads API threads	6	802	March 18, 2024
ChatGPT API Proxy Server on Heroku - Intermittent H12 Errors API api	6	1587	December 22, 2023

Realtime API with a Web APp with multiple users

Related topics