working on a llm application, now need to test the performance for the given set of articles. how can I create a set of questions with answers that are good human like response . So that I can compare them with generated answer of the model. types like how,why,what and more and be used .

Need human like response to test the model performance

20euai044 November 29, 2023, 10:24am 4

I’m using pre- trained gpt model for my use case. So I need ground truth to evaluate the results and form metrics

Topic		Replies	Views
How to efficiently create ground truth sets using GPT-4? Prompting gpt-4 , chatgpt	1	460	October 11, 2024
How to evaluate chat conversations (not just question-answer pairs) GPT builders gpts	5	2235	February 15, 2024
Evaluating LLM Chat Responses without Evaluation Dataset? API gpt-4 , assistants-api	2	500	June 14, 2024
Evaluating the effectiveness of text generation API	1	965	November 12, 2021
How to test an API, built on GPT? API	2	2272	April 9, 2024