Why is Openai Embeddings API returning multiple vectors for one very long string?

tylersuard · July 9, 2023, 5:16am

I am using the Embeddings API. I am using the text-embedding-ada-002 model, which has a max token length of 8191. My string is much shorter, only 3000 characters, but it is still returning multiple embedding vectors for that one string. What is going on here?

Foxalabs · July 9, 2023, 5:34am

Welcome to the forum!

Can you please show your code and output, trying to work out if you are seeing the contents of a single embedding, which is a multi dimensional array, or actually many embeddings returned.

tylersuard · July 9, 2023, 5:52am

It has to do with one of the quirks of Langchain. I was doing embeddings.embed(text) when I should have been doing embeddings.embed([text])

Topic		Replies	Views
Question on Embedding - Embedding Length is uniform? API	4	753	December 18, 2023
Understanding "text-embedding-ada-002" vector length of 1536 API	5	22312	January 21, 2024
Embedding tokens vs embedding strings? API	12	8308	February 11, 2024
Problems using Embedding API API embeddings	2	2554	December 18, 2023
Embedding model token limit exceeding limit while using batch requests API embeddings , token , batching	8	24984	October 15, 2023

Why is Openai Embeddings API returning multiple vectors for one very long string?

Related topics