How can a single instance of gpt handle multiple users at the same time?

viyakfak · May 28, 2024, 7:00am

Hello,

I have a prompt created via the OpenAI API and I’d like to deploy it.
I was wondering how multiple users trying to access it simultaneously is handled. Is it something I should be worried of ? Can the API handle this situation ? Otherwise, what solutions exist for this situation (queue, job scheduler etc) ?

Thank you for your help!

supershaneski · May 28, 2024, 7:37am

if you are talking about assistants using assistants api, typically you should assign one thread per user.

viyakfak · May 28, 2024, 8:00am

No, I’m using a ‘gpt-3.5-turbo-0125’ with the regular api.

supershaneski · May 28, 2024, 8:53am

Yeah, gpt-3.5-turbo-0125 is the model. Since you tagged this post as assistants-api I am assuming you are using Assistants Api. Or do you mean Chat Completions API?

viyakfak · May 29, 2024, 3:33pm

Right, it’s the Chat Completions API

supershaneski · May 29, 2024, 11:19pm

if you are going to use chat completions api, you need to manage the context messages yourself. what this means is, you will need something in the backend to manage the messages per user. that is the minimum thing you need to do.

viyakfak · May 30, 2024, 5:13am

I understand but my questions is can my model be accessed through the OpenAI API by multiple user at the time ? Will OpenAI be able to answer to many users -each of its own history- in parallel ?

Diet · May 30, 2024, 11:54am

Completions and Chat Completions are completely stateless. They only “see” what you send them - you have to send the whole conversation each time to get an inference. And since you’re constructing the whole conversation object each time, it’s up to you what you put in it (or leave out).

Topic		Replies	Views
How to create Multi user chatbot on top of openAI API API gpt-35-turbo , chatgpt , api	2	3400	December 29, 2023
Do multiple instances of one Assistant running have any impact on the model output? API assistants-api	1	555	February 12, 2024
Setup an Assistant that would coach multiple user API chatgpt , api	1	98	November 29, 2024
Can one assistant run concurrently on multiple different threads at the same time? API	3	3231	March 12, 2024
Can one assistant run concurrently on multiple different threads at the same time (parallelism)? API	7	986	January 10, 2025

How can a single instance of gpt handle multiple users at the same time?

Related topics