Evaluating LLM Chat Responses without Evaluation Dataset?

anon10827405 · June 14, 2024, 7:46pm

What do you expect is really what it boils down to.

You can easily give the essence of the “answer” to an LLM for grading. It doesn’t need to be exact. All that matters is that the semantics somewhat resemble what’s expected & capture what you’re expecting

Topic		Replies	Views
Evaluating the effectiveness of text generation API	1	986	November 12, 2021
Need human like response to test the model performance API	3	1458	November 29, 2023
How to evaluate chat conversations (not just question-answer pairs) GPT builders gpts	5	2678	February 15, 2024
Fine-tuning GPT-3 on entire conversations to mimic style and extract relevant knowledge API	13	5017	December 16, 2023
How to get personalized responses? Prompting	13	1093	June 13, 2021

Evaluating LLM Chat Responses without Evaluation Dataset?

Related topics