Is there a way to call embeddings batch API synchronously?

ppant · July 30, 2024, 10:08am

I saw in LlamaIndex that they use a method call get_text_embedding_batch which takes in a list of strings and returns the embeddings in the response. But then according to the docs the batch APIs are async in nature and its response returns an ID using which we have to track the status of the job.

I just want to confirm if there is a batch API for embeddings that can be called synchronously?

anon22939549 · July 30, 2024, 10:20am

You can send multiple strings to an embedding model in a single request. Which isn’t the same thing as using the batch API which allows you to send multiple requests asynchronously.

With just the embeddings ending you can send up to 2048-strings in an array.

See,

https://platform.openai.com/docs/api-reference/embeddings/create#embeddings-create-input

input
string or array

Required
Input text to embed, encoded as a string or array of tokens. To embed multiple inputs in a single request, pass an array of strings or array of token arrays. The input must not exceed the max input tokens for the model (8192 tokens for text-embedding-ada-002), cannot be an empty string, and any array must be 2048 dimensions or less. Example Python code for counting tokens.

The batch API just lets you queue up up to 50,000 such requests (up to a file size of 100mb any way).

ppant · July 30, 2024, 10:25am

Thanks @anon22939549 for the reply.

Is there any difference regarding the pricing model of this? Like if I make one API call with a list of 100 strings vs 100 API calls each with a different string?

anon22939549 · July 30, 2024, 10:30am

Same price, either way. But sending one request is faster and easier and depending on the exact nature of your requests and usage tier, will help you manage your rate limits better.

Topic		Replies	Views
Sending list of strings to get embeddings API	4	5281	December 18, 2023
Does the Embeddings endpoint support batch conversion? API	0	1030	October 31, 2022
Embedding large number of sentences API	13	11681	December 25, 2023
Struggling to achieve fast, parallel, embeddings API embeddings , gpt-4 , api	1	598	December 5, 2024
Why does Azure Open AI embedding not allow to an array of text as input API	7	1589	December 25, 2023

Is there a way to call embeddings batch API synchronously?

Related topics