Does ChatGPT’s browsing tool extract JSON-LD (schema) along with visible HTML?

eslam_saker · June 8, 2025, 4:42pm

Hi everyone, I’m trying to understand exactly how ChatGPT’s browsing feature works under the hood. In particular:

When using the browsing plugin, does the tool fetch both visible page content (e.g. <h1>, <p>, etc.) and any <script type="application/ld+json"> blocks?
If so, is there any official documentation or authoritative statement from OpenAI confirming that JSON-LD is scraped and merged into the text sent to the model?
My own tests show that when a page includes schema markup, ChatGPT’s answers include details (emails, trial lengths, certifications) that are only present in the JSON-LD—not visible in plain HTML. This suggests the browsing tool is harvesting that structured data. Can anyone point me to definitive docs or clarify how the browsing tool handles schema?

Thanks in advance for any insights or links to relevant docs!

sergeliatko · June 9, 2025, 7:04am

Not sure, but what I saw in responses it gave me that data was ignored unless available in body.

Topic		Replies	Views
Understanding ChatGPT web browsing - methodologies for accessing and interpreting web pages Plugins / Actions builders gpt-4	3	6442	May 15, 2023
Learning More About GPTs Browsing Functionality GPT builders	2	1427	January 29, 2024
GPTBot - how does it scrape JavaScript SPAs? Documentation chatgpt	1	837	September 28, 2023
Query on how ChatGPT parses HTML Inputs API gpt-4 , chatgpt , api	0	289	May 31, 2024
Do chatGPT 4o Plus API has web browsing access API	4	6934	February 20, 2025