Getting OOM error, when embedding in batches using tokenizer and multi threading

gopi_3 · September 1, 2025, 6:35am

I passed a 1GB of file using openai’s text-embedding-3-small model, im getting OOM error, these are the limits passed for each api call by using multi threading…

MAX_TOKENS_PER_DOC = 8192
MAX_DOCS_PER_BATCH = 2048
MAX_TOKENS_PER_BATCH = 300000

is there a way to achieve high through put, so we can embed the large data of files which are in GB’s in very less time? how speed up the embedding process?

Topic		Replies	Views
Embedding large number of sentences API	12	12199	August 27, 2023
Some batches creation FAILED even though they were within the batch queue limit API embeddings , batch-api	0	422	December 23, 2024
Embedding with large quantity of data API	3	3098	September 22, 2023
Struggling to achieve fast, parallel, embeddings API embeddings , gpt-4 , api	1	889	December 5, 2024
Max Total Embeddings Tokens per Request API embeddings	4	5510	May 12, 2025

Getting OOM error, when embedding in batches using tokenizer and multi threading

Related topics