Assistant's Retrieval Chunks in Playground: Can the Size be Controlled?

contributor · November 8, 2023, 3:25pm

Hello,

I experimented with the Assistant in Playground by enabling the retrieval feature and uploading an ebook as a text file, which was around 120K tokens in size. I instruct the assistant to impersonate a character from the book and to reference the text file for its responses.

From monitoring the usage dashboard, I observed that for every query I posed, the context input was about 30K tokens.

Here’s what I think happens:

Once the ebook is uploaded, OpenAI splits it into approximately four chunks, each close to 30K tokens.
Then, embeddings are created for these segments and stored in a vector database.
For each question I ask, the retrieval function searches the vector DB, selects the chunk most relevant to my query, and incorporates the actual text of that chunk into the prompt as additional context.

Is my interpretation accurate? Also, is there a way to tweak some settings so that the text is divided into smaller segments? I’m looking for a way to reduce the token count added to the prompt for each question.

Relevant documentation:

Thank you for your insights.

PreppyBrain · November 18, 2023, 4:14pm

It would be great to have chunk size and chunk overlap settings in assistant api

Topic		Replies	Views
What is the chunking strategy used by the Assistant? API assistants	6	5083	December 5, 2024
Assistants with knowledge base: How to determine atomic piece of information during chunking for more accurate retrieval? API assistants , assistants-api	0	1321	November 10, 2023
Understanding AI Assistant input token counts Prompting gpt-4 , lost-user , assistants-api	5	3369	June 26, 2024
Navigating Context Chunking in OpenAI's Assistants API API	0	812	November 9, 2023
Understanding the current Assistant Retrieval process API assistants	7	13729	November 20, 2023

Assistant's Retrieval Chunks in Playground: Can the Size be Controlled?

Related topics