Allow generating restricted client api tokens! GPT-4o low latency is wasted when forwarding client requests

michael67 · May 14, 2024, 1:33am

Please allow generation of restricted client api tokens!

Routing client requests through backend servers defeats the purpose of all the amazing work done on improving GPT-4o latency. It can double the latency in some cases (320ms is advertised for GPT-4o and a backend route can easily add another 300ms).

Minimum features requested:

API call to generate a client token.
Set expiry.
Set rate limit.
Set allowed endpoints.

If you agree, please add a like!

Topic		Replies	Views
Is a better System for Realtime Websockets in the works? API	0	44	October 18, 2024
Important feature requests for GPT-3 API (presigned requests & keys) API	4	976	June 14, 2023
Plus users should have limited access to API Feedback gpt-4 , chatgpt	0	535	February 9, 2024
Feature Request: Domain and App ID Restrictions for API Tokens Feedback api	4	326	June 17, 2024
Feature Request: Please allow me to pin a given api-key to a given model Feedback api , api-keys	5	556	March 12, 2024

Allow generating restricted client api tokens! GPT-4o low latency is wasted when forwarding client requests

Related Topics