API Limitation: is there no way to retrieve the web searches that were used for a completion?

farmerhk · July 10, 2024, 5:28pm

Context

I’m building an app such as the following: I share a topic I want to learn more about and I ask ChatGPT to share 3 links I can read to learn more.
I’m using the API currently (specifically with Assistants, but I’m flexible).
The API will respond with links, but over 50% are varied types of hallucinations: broken links, 404 pages, etc.
When I use the ChatGPT UI, I see that there’s a “Searched X sites” (X varies per completion), that shows referenced links, and those are great!

Is there a way to programmatically get the links which were referenced? Based on my look at the docs and checks in the OpenAI Discord, I believe the answer is no.

farmerhk · July 10, 2024, 5:33pm

For anyone that’s also interested in possible resolutions to the core application need, here are some I’m aware of:

Other APIs: I might be better off using other APIs such as Google or Bing searches to find links.
Function Calling: I’ve considered that I could create a function to validate the “links” which are returned in the output.
Fine-Tuning or RAG: I considered these briefly, but I don’t believe either are relevant solutions. This doesn’t seem like a context problem. It’s a slight behavioral problem, but maybe too inherent to LLMs to be solvable without some extreme fine-tuning.
Alternative Out of the Box Solutions: Claude seems to have similar out-of-the-box limitations. However, the Perplexity LLM might be well-suited for this challenge out-of-the-box. Anecdotally, it worked well on quick tests.

jr.2509 · July 10, 2024, 5:43pm

This is the way to go. The API - unlike ChatGPT - does not come natively with the ability to search the web and the issue you are currently experiencing is the “expected behaviour”. A common approach to replicate the functionality using the API is to use function calling and then rely on search APIs such as the ones you mentioned to obtain actual links.

farmerhk · July 10, 2024, 5:44pm

Ah, key detail there that I missed before: “The API - unlike ChatGPT - does not come natively with the ability to search the web…”

I’ll check the docs to confirm that but was not aware before. I assumed the search was happening in the backend but not being exposed.

jr.2509 · July 10, 2024, 5:45pm

No worries. This is not explicitly addressed in the docs however.

farmerhk · July 10, 2024, 6:07pm

Woops - accidentally deleted my own response

Thanks to @jr.2509 for the pointers. Those led me to follow up searches and I realized there’s existing conversation on this.

In addition to @jr.2509’s responses here and elsewhere on the forums, here’s one useful thread: How to implement GPT4 API with internet access? - #9 by raymondyeh

And another here: How to get GPT-4 API access with Internet

Topic		Replies	Views
ChatGPT can now access the live Internet. Can the API? API	15	72896	January 5, 2025
Connect via API to the web for APA references API chatgpt , gpt-4-turbo	2	1512	December 25, 2023
API for searching the latest information on the internet API gpt-4 , chatgpt , plugin-development , api , chatgpt-plugin	10	8519	February 20, 2025
Gpt4o api not searching the web? API	7	2723	October 3, 2024
GPT4 API Internet Access -- What to Do API	5	8726	May 2, 2024

API Limitation: is there no way to retrieve the web searches that were used for a completion?

Related topics