Token Limit For Embeddings vs. Text-Davinci-0003

sergeliatko · March 8, 2023, 8:22pm

@SomebodySysop from my former linguistics education base I can say that almost 4k of tokens in a chunk will contain at least several ideas in it (because a finished idea is more often contained in a paragraph or at most 3 paragraphs, so in tokens you’re about 500-600 tokens per idea at most). And the goal of using vector search is to match one idea (query) to another one (source chunk) as close as possible… Embedding more than one idea in a chunk will dilute the precision of the vector search (concept match) and make the perfect match almost unachievable.

Embedding chunks of text that big makes sense when you need vectors for clustering or classification of entire documents, or subject search. But when you need the facts search inside the documents - you need precision, and in this case it doesn’t make sense to me to vectorize texts longer than one “idea” (1-3 paragraphs or 200-600 tokens).

I would revisit your approach to check if that’s not your underlying issue…

Topic		Replies	Views
How can I send vectors as a chat context? Prompting embeddings	8	5456	May 15, 2023
Use file with text-davinci-001 to increase tokens in prompt Prompting	13	2188	December 15, 2023
Question about token limit differences in API vs Chat Documentation chatgpt , api	5	3405	May 26, 2023
Answering lots of questions from one large chunk of text without paying tokens to input the big text chunk for each question? API api	16	6998	December 24, 2023
Token Limitization Error when prompting Prompting chatgpt , api	8	1297	December 6, 2023

Token Limit For Embeddings vs. Text-Davinci-0003

Related Topics