Rate limits too low for vector search with API

kw · April 26, 2025, 8:16pm

Using the vector store will easily outrun the rate limits for any model.
I just played a couple of requests in the playground and hit the rate limits several times. This is obviously due to the 20 relevant chunks (at about 800 chars) for each search in the vector store. But this is (at minimum) what you want from a search in large text corpora.

So the vector store seemed like a nice addition since you don’t have to set up a database and do the embeddings yourself. But it is simply not useful when you hit a rate limit every other time.

Topic		Replies	Views
Vector Store Indexing taking days Feedback assistants-api	4	258	March 17, 2025
Assistant API: Run failed Rate limit reached API bug , assistants , assistants-api	3	2392	November 9, 2023
Embeddings API extremely slow Feedback embeddings , api-embedding	9	534	April 24, 2025
Does OpenAI not chunk my documents in vector store? API gpt-4 , assistants-api , vector-store	1	214	November 11, 2024
O1 - Rate Limit and function calling API	3	1018	January 28, 2025

Rate limits too low for vector search with API

Related topics