PDF hierarchy extraction not fully done stops abruptly with partial output

k_v_mahesh · May 18, 2024, 3:02am

I have a pdf roughly 2 MB having 20 to 30 pages max. Using gpt4-turbo model.

When I pass the pdf asking to format in specific way section, sub section, title as per pdf structure it does for 2 pages and stops.

How do I get it for entire document. I need to pass the entire document for model to get the full context and continuity across pages.

I tried to split pdf as pages to pass to model and later tried to assemble it my self…still it stops at some random point with full completion of documents.

What am I missing or is there different way to prompt. 128k is large context for my doc should be easily manage able.

Topic		Replies	Views
Is there any way by which I can let GPT-4 API summarize large PDF texts? API gpt-4 , api	10	8083	May 6, 2024
What are the limitations of GPT-4 in analyzing PDF text? Prompting gpt-4	6	20704	March 12, 2024
Sending large document via API call and asking for a question over complete document? Prompting api	3	1170	February 26, 2024
Obtaining correct PDF page number in the response using GPTs Prompting gpt-4 , gpts	10	2679	August 13, 2024
Gpt-4-1106-preview does not read the whole file like web ChatGpt4 does API	5	696	January 25, 2024

PDF hierarchy extraction not fully done stops abruptly with partial output

Related Topics