How to effectively validate the answers generated by LLMs?

paul1216 · November 26, 2024, 7:46am

Hello, I am currently using LangChain with GPT-4 to build an SQL agent.
I have prepared 100 questions to test the SQL agent, and I want to evaluate the correctness of its answers.
However, I want to avoid manually verifying the accuracy of these 100 answers.

Are there any other methods to assess the SQL agent’s response accuracy?

Thank you!

MARK0 · November 26, 2024, 7:50am

It’s possible to create an adversarial agent designed to verify the accuracy of generated SQL queries. This agent would take the user’s message along with the SQL query produced by the system and cross-check it against the database schema to ensure it aligns correctly.

Topic		Replies	Views
Data Validation: Help and Suggestions needed Community gpt-4 , chatgpt	5	47	August 7, 2024
Calculating the Confidence Scrore for the Responses to the Prompts in case of Text 2 SQL application Community gpt-4 , plugin-development	0	161	July 4, 2024
Answer validation, how implement a fact checking method? Prompting gpt-4 , prompt-engineering , assistants-api	1	172	September 3, 2024
Evaluating LLM Chat Responses without Evaluation Dataset? API gpt-4 , assistants-api	2	367	June 14, 2024
Seeking Guidance on Building a ChatGPT-Style Data Analyst Tool with Database Integration Plugins / Actions builders gpt-4 , chatgpt , api , openai	11	3375	September 23, 2024

How to effectively validate the answers generated by LLMs?

Related topics