Is a better System for Realtime Websockets in the works?

rootdraken · October 18, 2024, 10:17pm

While the current system is great, its biggest flaw is needing that connection to be proxied from a backend.

This increases costs beyond just the costs of the model usage, and increases latency which is kinda counter-productive to the whole idea.

I propose a Client Token and Server Token sort of setup. Aka a backend endpoint that generates a client token that expires in a min, and can be used to init a websocket directly to openAI from the client directly.

This lowers the complexity for devs, costs and latency, and would take very little work.

Topic		Replies	Views
Any thoughts about sending diffs on API API	6	482	June 1, 2023
Charging for adding history to realtime API API	1	131	October 2, 2024
Allow generating restricted client api tokens! GPT-4o low latency is wasted when forwarding client requests Feedback api	0	327	May 14, 2024
OpenAI Realtime API Ephemeral Tokens API realtime , api-realtime	0	171	January 7, 2025
Private and public API Key to build browser client apps API api	2	145	July 18, 2024

Is a better System for Realtime Websockets in the works?

Related topics