How is summarization done by Assistants API on a large file?

rockettanuki · January 22, 2024, 7:46am

How summarization on a very large file works?

Asisstants API accepts a <512MB file. It has the tool myfiles_browser, which has a function open_url. When you ask a summarization of a file, it calls open_url and process it

However, 512MB ~ 128M tokens is 1k times larger than the context window of the largest capacity model, GPT-4-turbo, 128k tokens.

Does it call a hidden function to summarize recursively, or do anything else?

al.rivero · January 22, 2024, 8:48am

My understanding of the upload of files is that it was basically a RAG, is it not? If it is, surely there is some fragmentation of the file to associate each fragment to a vector. But no idea of how it would help to build a summary.

NoRhyme · February 7, 2024, 11:46pm

It is ‘chunking’ your file and passing each chunk and storing it into a vector database. By breaking down the the file it keeps the token size down, enabling it to handle more tokens in aggregate

Topic		Replies	Views
Logic behind uploading a large document API chatgpt	8	881	June 4, 2024
File summary most optimal way API	2	909	January 7, 2024
Recursive Summary Algorithm for Large Files API	2	1488	April 18, 2023
Handling text larger than limits? API	2	3163	December 17, 2023
Chatbot with user provided files: how to let GPT have a "overall" view of the file content? API	3	1378	December 16, 2023

How is summarization done by Assistants API on a large file?

Related topics