Optimizing Retrieval Time for Single-Page PDF Data Extraction

Sylvester1212 · December 8, 2023, 8:19pm

We’ve been testing the retrieval process for single-page PDFs containing minimal content, specifically extracting person’s names, times, and dates. However, each test takes approximately 25 seconds to complete. Is there a way to improve and expedite the retrieval process for such simple data extraction tasks? We’re looking for suggestions to optimize the speed while maintaining accuracy. Your insights and recommendations would be greatly appreciated!

_j · December 8, 2023, 8:23pm

Without information about the techniques, endpoints, and models you are using, along with a bit more details, I have no guess if you are either doing something completely backwards or have already invented breakthrough techniques unparalleled in industry…

If time-sensitive, the biggest boost would be in making parallel calls: chunking the documentation into a blast of dozens of calls with a completion in seconds expected. You then just have the likely requirement of algorithmic de-duplication.

Topic		Replies	Views
Efficiently Interacting with super super Long PDFs/documents API gpt-4	0	753	November 13, 2023
Text-davinci-003 response time slowing beyond 30-45 seconds for completion API api	2	377	December 25, 2023
Best way to process PDF File that has over 100k lines? API embeddings , gpt-35-turbo , api	5	5759	November 27, 2023
Completion Speeds - ridiculously Slow - waiting over a minute Community chatgpt , api	4	992	May 17, 2023
Completion Speeds - How can we optimise speeds! URGENTLY! API	8	1290	December 25, 2023

Optimizing Retrieval Time for Single-Page PDF Data Extraction

Related Topics