I use GPT-4 for analyzing and processing my own university course notes. The whole set consists of about 20 docx files.
I found that uploading 20 docx files is not working well. So, I integrated all of the files into one single file, separating the different documents by giving each one a level 1 header, and then produced a meta-index on page one. However, I am not sure if GPT-4 can really read the file well.
So, I converted all 20 docx files into PDF files, then merged all with DEFTPDF DOT com, and again created a meta-index on page one. But then again, when I upload the file and ask about sources, after an analysis, page numbers and even cited text parts are all invented and wrong.
So, what format is better for GPT-4, PDF or Docx? Can it really read one or the other, or does it create for itself a sort of summary after uploading, and thus never can actually read single pages or topics, or understand the structure of a text at all?