Error while using Assistants api

_j · November 15, 2023, 8:15am

Basically yes. Unlike ChatGPT where the conversation is clipped to where people complain it doesn’t remember anything, assistants will run up the conversation to the maximum of the model when you continue to chat. 128k.

Also, when running your own vector database, for example with 1MB of your company’s tech support knowledge base and product offerings, you might have a threshold where only the top 5 chunks are fed to the AI, and only if they meet a semantic similarity threshold. Not the case with assistants - if you ask “how’s your day going”, the AI gets maximum retrieval placed into the context window.

Those are prices and anecdotes taken right from the forum. The AI looping until it hits your API rate limit and you get no answer. AI looping, calling your API over and over with the same query.

Until they offer transparency about billing and realitime per-call token usage, and allow controls over data and iterations similar to what a reasonable person may program themselves, I would have to say “program yourself”.

Topic		Replies	Views
A single Assistant API method call exceeds Rate limit? Need advice API	5	2622	March 21, 2024
Is there a limit on the usage of the Assistant that is currently in beta release? Community assistants	10	1313	November 27, 2023
Assistants API (gpt-3.5-turbo-16k) usage exceeds limit due to message loop Bugs gpt-35 , gpt-35-turbo , chatgpt	18	6636	December 23, 2023
How to use Assistants with 128k? API gpt-4 , api	12	1294	January 22, 2024
Hitting Rate Limits with Multiple Assistant Calls on Tier 5 Account Bugs assistants-api	16	2052	March 8, 2024

Error while using Assistants api

Related topics