Parallelism/scaling in embedding endpoint

LinqLover · November 30, 2023, 1:30pm

Yes, you are right, thank you for the additional context. Yes, I need to investigate making parallel requests, but honestly, this is something that the API should handle for me. I noticed the https://github.com/openai/openai-cookbook/blob/main/examples/api_request_parallel_processor.py but I will have to port that to the language I am working with.

What are you referring to with language inference? Are we still talking about embeddings?

Topic		Replies	Views
Embedding large number of sentences API	13	11753	December 25, 2023
Embedding Longer Texts API	8	15535	December 25, 2023
Semantic embedding: super slow 'text-embedding-ada-002' API	12	8727	December 24, 2023
Simultaneous Requests - API API	5	5282	June 3, 2023
Embeddings + answers Endpoint Prompting	5	744	December 17, 2023