Calculating the Confidence Scrore for the Responses to the Prompts in case of Text 2 SQL application

ramichetty99 · July 4, 2024, 10:59am

We are working on the Text 2 SQL application is built using Langchain and Python. Most of the time wth the table data, DDL given the SQL agent is generates the right queries that in turn return back the right response. Using the Langsmith also for measuring the accuracies. The ask is that for every prompt that is issued by the user, what can be the approach to provide the score that the returned response is correct? If anyone has implemented the same the information can be helpful. Here the Model Accuracy is not being looked at the Model Level. It is at the prompt level , Query formed and the response that is returned.

Topic		Replies	Views
Gpt-4o-mini response evaluation Community gpt-4 , rag , evals	3	242	February 17, 2025
How to get a Text to SQL Model to not answer if a question is too vague Prompting langchain , gpt	1	1270	January 23, 2024
How to effectively validate the answers generated by LLMs? Community chatgpt	1	272	November 26, 2024
Text-to-SQL or Text-to-Dataframe Community gpt-4	0	199	July 28, 2024
Structured Output Confidence Score API gpt-4	1	274	March 20, 2025

Calculating the Confidence Scrore for the Responses to the Prompts in case of Text 2 SQL application

Related topics