Is there any way by which I can let GPT-4 API summarize large PDF texts?

thanks for clarifying.

Unfortunately, every API call is treated separately - there is no “memory function” or some such. Depending on the specifics of what you want your summary to look like there are different approaches that you consider.

Option 1 involves summarizing all PDFs individually and then creating an aggregate summary that combines the individual summaries.

Option 2 would be a model whereby you summarize one document and then include the summary of that document as additional context for the summary of the next document. This way you are establishing some relationship between the summaries.

When you use one of the models with a longer context window, i.e. GPT-4-turbo models, you of course have the option to include multiple documents at a time for summarizing reducing the number of API calls required to summarize all 60+ PDFs.

I’ll dig up some additional threads discussing summarization approaches shortly.

EDITED: Here’s a thread on the topic that you may find helpful - might add others later:

3 Likes