Error while using Assistants api

adblkwrm · November 15, 2023, 7:24am

When I am trying to use the Assistants api I have kept getting this error
“last_error”: {
“code”: “rate_limit_exceeded”,
“message”: “You exceeded your current quota, please check your plan and billing details.”
},
But it works fine when using the chat api.
why?

_j · November 15, 2023, 7:30am

rate limit exceeded → the assistant is taking to much context length and looping and burning tokens until maxing out your account’s token-per-minute limit.

The rate limit saved your account from being emptied by this thing.

Not for experimentation. No success stories. Only $5 questions and $200+ an hour. Do not use.

Foxalabs · November 15, 2023, 7:33am

What Tier level is your account? and what are you attempting to do with the API?

adblkwrm · November 15, 2023, 7:38am

Assistants api is used by another account, just want to test new features

Foxalabs · November 15, 2023, 7:43am

Not sure I follow, what do you mean “another account” do you own both accounts?

adblkwrm · November 15, 2023, 7:56am

Yes. When I use the Assistants api with another account it tells me "You exceeded your current quota, please check your plan and billing details. "But this is normal when using the chat api.

merefield · November 15, 2023, 8:09am

Are you saying the Assistants API is flawed because it is too expensive and can too easily run away with itself causing too much cost?

adblkwrm · November 15, 2023, 8:14am

So in fact, I did not understand the meaning of his answer.

Foxalabs · November 15, 2023, 8:14am

Are you using multiple API keys within the same application?

_j · November 15, 2023, 8:15am

Basically yes. Unlike ChatGPT where the conversation is clipped to where people complain it doesn’t remember anything, assistants will run up the conversation to the maximum of the model when you continue to chat. 128k.

Also, when running your own vector database, for example with 1MB of your company’s tech support knowledge base and product offerings, you might have a threshold where only the top 5 chunks are fed to the AI, and only if they meet a semantic similarity threshold. Not the case with assistants - if you ask “how’s your day going”, the AI gets maximum retrieval placed into the context window.

Those are prices and anecdotes taken right from the forum. The AI looping until it hits your API rate limit and you get no answer. AI looping, calling your API over and over with the same query.

Until they offer transparency about billing and realitime per-call token usage, and allow controls over data and iterations similar to what a reasonable person may program themselves, I would have to say “program yourself”.

adblkwrm · November 15, 2023, 8:17am

No, I’ve only used that one key in the same application

merefield · November 15, 2023, 8:18am

Thanks, useful info.

You nailed it - my “home grown” bot picks a limited set of the semantic results currently and that limits the context/cost impact. It also has a failsafe so it doesn’t loop over a certain amount of times. I’m surprised there isn’t this safeguard?! That’s a showstopper for Production adoption of the Assistants API, surely?!

Sorry to take the Topic off on a tangent … but that’s critical information

Clearly, Assistants API service needs more work and more thought applied to it - but I guess that’s the point of the “preview” phase …

Foxalabs · November 15, 2023, 8:19am

Ok, I see, so how are you accessing another accounts assistants?

Foxalabs · November 15, 2023, 8:22am

It is all free while data is gathered, this is part of beta software development.

merefield · November 15, 2023, 8:24am

Fair enough (but only for two more days?!).

But this is basic stuff. Come on Open AI!

adblkwrm · November 15, 2023, 8:28am

Log in to another account to perform the operation. I’m not sure that’s what you want to ask.

adblkwrm · November 15, 2023, 8:30am

So we need to limit the search before we do it? What should I do?

Foxalabs · November 15, 2023, 8:30am

Ok, there is no login in with the API, you specify an API key and that is your credentials. Are you making API calls or using GPTs or… can you post a code snippet of your API calls please.

merefield · November 15, 2023, 8:37am

Wait until Open AI fixes this and improves the approach/algorithm before pushing this live.

adblkwrm · November 15, 2023, 8:38am

I followed the api documentation, and the code was pretty much the same.

Topic		Replies	Views
A single Assistant API method call exceeds Rate limit? Need advice API	5	2764	March 21, 2024
Is there a limit on the usage of the Assistant that is currently in beta release? Community assistants	10	1317	November 27, 2023
Assistants API (gpt-3.5-turbo-16k) usage exceeds limit due to message loop Bugs gpt-35 , gpt-35-turbo , chatgpt	18	6687	December 23, 2023
How to use Assistants with 128k? API gpt-4 , api	12	1331	January 22, 2024
Hitting Rate Limits with Multiple Assistant Calls on Tier 5 Account Bugs assistants-api	16	2102	March 8, 2024

Error while using Assistants api

Related topics