Hi everyone, I’m trying to understand exactly how ChatGPT’s browsing feature works under the hood. In particular:
- When using the browsing plugin, does the tool fetch both visible page content (e.g.
<h1>
,<p>
, etc.) and any<script type="application/ld+json">
blocks? - If so, is there any official documentation or authoritative statement from OpenAI confirming that JSON-LD is scraped and merged into the text sent to the model?
- My own tests show that when a page includes schema markup, ChatGPT’s answers include details (emails, trial lengths, certifications) that are only present in the JSON-LD—not visible in plain HTML. This suggests the browsing tool is harvesting that structured data. Can anyone point me to definitive docs or clarify how the browsing tool handles schema?
Thanks in advance for any insights or links to relevant docs!