Issues with API key - software gets stuck

Maxipa · October 29, 2024, 9:03am

Im a tier 2 user which allows me the following limits

gpt-4o-mini
2.000.000 TPM
5.000 RPM
20.000.000 TPD

Im doing around 7 requests per minute, each request having ±750 input tokens and ±50 output tokens.

My software seems to get stuck though relatively quickly and I don’t know if the issue is my software or OpenAI. Just would love to get an answer here so I know where to start digging

sps · October 29, 2024, 11:58am

@Maxipa You need to make these calls asynchronously for them to be processed concurrently.

merefield · October 29, 2024, 12:14pm

Yeah, your architecture is sub-optimal.

You need a queuing system (aka “batch”) with atomic jobs that retry on failure.

I personally use Sidekiq for this purpose.

Maxipa · October 29, 2024, 3:21pm

Thanks ill give it a shot!

Maxipa · October 29, 2024, 3:21pm

Thank you! I appreciate it!

Maxipa · October 29, 2024, 8:14pm

I have just talked with my programmer who responded

„You need to make these calls asynchronously for them to be processed concurrently.“ - we are already doing this and we couldnt see any errors related to calling chatgpt

We do have a retry mechanism in place

_j · October 29, 2024, 11:22pm

If this is a non-realtime process on an existing data set, you might consider using OpenAI’s own batch processing - you send a file of chat completions requests in API JSONL format for a 50% discount, a 24 hour turnaround for a file with responses (or recently for -mini, a 24 hour “wasn’t run” often).

Without even knowing what language you are using, it is hard to guess at any faults in techniques (although fault one is a programmer calling the API “ChatGPT”).

Maxipa · October 30, 2024, 9:42am

it is hard to guess at any faults in techniques

Absolutely! Please tell me, I am always managing to make the system work for a few minutes until all of a sudden it stops for one hour and works again. Is there any limitation from OpenAI in that scenario when there is always a one hour pause in between it not working and working again?

merefield · October 30, 2024, 11:55am

May I ask, what is your programmers explanation for the failure?

Shouldn’t they be taking ownership of the issue and posting here?

Topic		Replies	Views
Getting a 503 error for multiple requests API chatgpt , api , chat-completion	7	7959	December 18, 2023
Getting 429 errors without hitting limits API	11	2789	December 18, 2023
Concurrent request restriction API gpt-4	5	161	December 23, 2024
429:Too many requests while im in devlopment mode useing gpt3.5 but sent only 5requests today API chatgpt	8	1826	December 15, 2023
Assistants API - Too many requests API gpt-4 , api , assistants	9	1864	November 9, 2023

Issues with API key - software gets stuck

Related topics