Best approach for factual/verfiable data?

cristi · May 10, 2021, 10:03am

What are some good prompt approaches for factual/verifiable text generation? (like: historical, medical, tech, research, and even data from wikipedia)

I’m having a hard time wrapping my head around this problem and I’d appreciate any insightful action-oriented input!

kearonis · May 18, 2021, 12:20pm

Well, this is one reason why OpenAI requires human input into the generation workflow. The model can generate too much fake info.

You only control the prompt and the settings, so try to play with those. For example, lower temperature and strict enough prompt would help to get more consistent and predictable results.

I’m not an ML engineer, but I can imagine one can solve this issue by comparing the output against a set of facts from a facts-based knowledge base (knowledge graph?) and classify it as true or not.

Get the output
Extract the entities and connections between them
Find the claims the text makes (based on connections between the entities)
Compare the information with your knowledge base
Decide if the text is trustworthy enough

Not a task for a regular app developer though

tabarak · May 18, 2021, 2:24pm

Hi @cristi! Please also take a look at our suggestions for Factual Responses.

cristi · May 19, 2021, 10:12am

thanks I already know that…

cristi · May 19, 2021, 10:14am

thanks kearonis! some of your suggestions might be helpful!

un1crom · May 21, 2021, 1:36pm

@cristi i access third party apis with queries, then use the content from those responses in the GPT3 prompt.

WolframAlpha.com - a system and API that provides computer algebra computation, 1000s of algorithmic answers to queries, verified data in a variety of domains and easy to use interpreted output data.
Wolfram|Alpha APIs: Computational Knowledge Integration
Wolfram|Alpha Full Results API Explore

WikiData - a system and API that provides data look ups from Wikipedia. The data isn’t always verified, but in some
Wikidata API
https://query.wikidata.org

Wikipedia API -
https://www.mediawiki.org/wiki/API:Tutorial

PubChem - US Government resource for chemicals
https://pubchem.ncbi.nlm.nih.gov/ and PUG REST

Google Knowledge Graph

TextRazor
https://www.textrazor.com/

DoWhy from Microsoft
https://microsoft.github.io/dowhy/

Topic		Replies	Views
How to improve prompts for davinci-instruct Prompting	4	1053	November 26, 2021
"The Art to Start" (series about Prompt Design) Prompting	9	1290	December 17, 2023
What kind of academic resources has GPT-3 been trained on? API	12	1446	April 20, 2024
Factual completions Prompting	17	1438	December 21, 2023
Prompts for content generator like jarvis.ai Prompting	6	1558	December 17, 2023

Best approach for factual/verfiable data?

Related topics