PDF or DOCX - which is easier for GPT to read?

Sciss99 · July 16, 2024, 4:41am

I use GPT-4 for analyzing and processing my own university course notes. The whole set consists of about 20 docx files.

I found that uploading 20 docx files is not working well. So, I integrated all of the files into one single file, separating the different documents by giving each one a level 1 header, and then produced a meta-index on page one. However, I am not sure if GPT-4 can really read the file well.

So, I converted all 20 docx files into PDF files, then merged all with DEFTPDF DOT com, and again created a meta-index on page one. But then again, when I upload the file and ask about sources, after an analysis, page numbers and even cited text parts are all invented and wrong.

So, what format is better for GPT-4, PDF or Docx? Can it really read one or the other, or does it create for itself a sort of summary after uploading, and thus never can actually read single pages or topics, or understand the structure of a text at all?

masonrogers · April 24, 2025, 2:00am

I am wondering the exact same. Has anyone been able to help with this question?

tboorman-p · July 27, 2025, 5:44pm

Also wondering this.
In general have folks had better results uploading .docx or .pdf files?
Does length of document or number of documents steer individuals one way or the other.

DeadlyNightshade · November 17, 2025, 4:12pm

I have found that the context window can easily overflow when uploading a large pdf so I have used stages of refinement to reach more specific answers from a pdf. This is annoying, topic specific, and not scalable. So for now I have found that doing ctrl+f and finding groups of pages referencing something and just handing only that page set is most effective (keep context window small)

Topic		Replies	Views
What are the limitations of GPT-4 in analyzing PDF text? Prompting gpt-4	7	35064	December 28, 2025
Maximizing CustomGPT Performance: Exploring Alternative File Formats to PDFs GPT builders gpt-4 , pdf , custom-gpt	0	1025	March 4, 2024
My GPT - Knowledge base - Best practices GPT builders	8	24825	December 28, 2025
What is the best type of format to use for uploaded documents for GPTs? Plugin store gpts	3	4504	January 6, 2024
What is the best way to parse a PDF file with ChatGPT? API	10	52128	January 10, 2026

PDF or DOCX - which is easier for GPT to read?

Related topics