Prompt Evaluations at Scale for Production

raivat1 · March 24, 2024, 12:18pm

Hi everyone,

I wonder how everyone is comparing their prompts when building for production. For example, say you make a small change to your prompt and wanna see how it behaves overall (So need to test this n times). Except for writing a script, what are other alternatives? I’ve heard of solutions like LangSmith but I’m not sure how useful these tools are and how widely they’re used.

Thanks,
Raivat

parth4 · August 4, 2024, 12:01pm

Hey @raivat1, we are building getmaxim.ai , its experimentation suite addresses all your prompt engineering needs, helping you rapidly and systematically iterate on prompts.

You can:

Test, iterate, manage, and version prompts. You can organize prompts in folders and sub-folders and attach tags to them. You can also version your prompts with custom description allowing you to easily track changes and compare across versions.
Run side-by-side bulk experiments on playground different permutations and combinations of prompts and models to identify the right prompt-model combination for your use case.
Run tests on large test suites and simplify decision-making by comparing output quality, cost, and latency across different combinations of prompts, model, and model parameters
Deploy prompts with different deployment variables and experimentation strategies without any code changes, enabling teams to seamlessly execute prompt A/B testing

Topic		Replies	Views
Managing prompts in production Prompting api , prompt , prompt-engineering	11	4144	January 22, 2025
Seeking Prompt Evaluation Tool with GPT-4-1106-Vision-Preview Support Prompting gpt-4 , prompt , help-needed	0	833	February 5, 2024
Tools for Testing Custom GPT Prompts Prompting prompt-engineering	55	14570	March 12, 2025
Online tool available for writing effective prompts Prompting api	5	641	January 27, 2025
Do you use ChatGPT in your product? Community prompt-engineering , tools	2	681	July 16, 2024

Prompt Evaluations at Scale for Production

Related topics