Query string improperly formatted for API REST call to wikidata

paul.fishwick · December 30, 2023, 9:34pm

I am building a Wikidata helper by using an action. When I say “list all cats”, this is how the GPT formulates the REST HTTP endpoint call:

[debug] Calling HTTP endpoint
{
“domain”: “query.wikidata.org”,
“method”: “get”,
“path”: “/query”,
“operation”: “GetQueryResult”,
“operation_hash”: “0377bd9dafa61ec388091f01fba3b83fdb5cdfc9”,
“is_consequential”: false,
“params”: {
“query”: “SELECT ?cat ?catLabel WHERE {\n ?cat wdt:P31 wd:Q146 .\n SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en". }\n}”
}
}

The query is not formulated correctly. If you do a select on the SELECT string by first removing \n and then removing the \ in front of ", this SELECT string works in:

https://query.wikidata.org/

Here is the corrected SPARQL string that I’d like to be the query value:

SELECT ?cat ?catLabel WHERE {?cat wdt:P31 wd:Q146 . SERVICE wikibase:label { bd:serviceParam wikibase:language “[AUTO_LANGUAGE],en”. }}

I removed the \n and the \ in front of ". If you copy and paste this into the query service URL, it works correctly.

I tried multiple ways of getting the GPT to fix this string in Instructions, but failed.

Any ideas on how to proceed? The good news is that the Wikidata helper does a great job at formulating the SPARQL text as long as I do not create an action. But an action would allow us to see the results of the query and not just the SPARQL query.

Diet · December 30, 2023, 10:16pm

Have you tried instructing the model to not include newline characters in the sparql string?

paul.fishwick · December 30, 2023, 11:30pm

Yes I tried that but it ignored the instruction. Perhaps I can rephrase but so far no luck

anon22939549 · December 31, 2023, 5:56am

The two options I see are this,

Continue revising the instructions. The model is generating some formatted text for the SPARQL, most likely because that’s how it appears in most of the training data it has been exposed to. One possible way to rectify this is by including a one-shot to few-shot example of the construction of a properly formatted query in your instructions with a note that the query will be sent as a parameter in a REST API call.
Create a workaround. On a computer you control (e.g. not query.wikidata.org) set up essentially an API proxy relay. The computer you control accepts the query however it is created by the model, rectifies it into something which will be accepted by the endpoint, makes the request, receives the response and forwards it back to the model. It’s an extra hop in the chain but shouldn’t add too much latency overall, plus it gives you a lot more control and let’s you do things like cache API calls, etc if the response is unlikely to change in realtime.

The key thing you need to understand is that the models are stochastic, you cannot guarantee they will always correctly follow specific instructions, so it is very useful (and often necessary) to construct backstops to ensure the models cannot screw up too badly.

In time, eventually, that may change and they may get to the point where we can absolutely rely on their unfailing adherence to our instructions, but that’s not today.

paul.fishwick · January 2, 2024, 5:09pm

Really good detailed comments. I am continuing to do #1. I tried one shot, but maybe more than one is a good approach. I am getting GPT-4 usage cap messages, so have to wait a couple of hours to continue testing.

#2 is curious and I follow your logic, but unsure how to go about this proxy relay.

We need more guidance from OpenAI on Actions. Or maybe I have not read enough of what they have in the way of documentation.

Topic		Replies	Views
Summarizing or question answering from long Wikipedia articles? API	25	4027	January 4, 2024
GPT: Action that queries REST API returning hallucinated data GPT builders	7	817	January 22, 2024
Custom GPTs - We NEED Verbatim Outputs OR Longer Instruction Capacity GPT builders chatgpt	23	2279	March 26, 2024
What kind of academic resources has GPT-3 been trained on? API	12	1408	April 20, 2024
Actions refuse to run because "to=api" has been disabled Plugins / Actions builders actions	3	272	July 21, 2024

Query string improperly formatted for API REST call to wikidata

Related topics