LLM and Prompt Evaluation Frameworks

Diet · September 18, 2024, 8:31am

I wonder if prompt evals actually work, or if they give people a false sense of security

they also seem to be advertising hallucination countermeasures using perplexity. I’m not sure you if you can infer any hallucination probability by just adding logprobs .

Topic		Replies	Views
Tools for Testing Custom GPT Prompts Prompting prompt-engineering	55	14796	March 12, 2025
Prompt Regression Testing - API Usage Prompting api , prompt-engineering	10	261	February 14, 2025
How do you measure prompt performance? API	4	4405	August 3, 2024
Managing prompts in production Prompting api , prompt , prompt-engineering	11	4229	January 22, 2025
Is an LLM which both generates and critiques its output a contradictory practice? Prompting gpt-4	3	147	November 23, 2024

LLM and Prompt Evaluation Frameworks

Related topics