Hey everyone,
I’m working with GPT-4o both via the API and in ChatGPT for tasks related to PDFs. I’ve noticed a significant difference in performance between the two. Specifically, ChatGPT (web) is outperforming the API when it comes to intelligently identifying and extracting labels from PDFs that contain a lot of unstructured or random data.
My goal is to extract meaningful labels and map them to form widgets in the PDF. This is necessary because the widget field names are often generic, like “Text Field 1”, and don’t offer any context. ChatGPT handles this quite well, inferring logical labels and relationships, but the API responses are not nearly as accurate or useful.
Has anyone else experienced this kind of discrepancy between the API and ChatGPT web interface?
Also, I’d appreciate any best practices or tips for working with PDFs and form fields in this context.
Thanks!