Evaluations not running the entire dataset
|
|
0
|
17
|
July 8, 2025
|
Creating Evals that accept file inputs
|
|
0
|
31
|
June 27, 2025
|
BUG: Stored Chat Completions not showing in Dashboard when sending type "image_url" messages
|
|
4
|
219
|
June 24, 2025
|
Cannot add runs to a specific eval
|
|
1
|
33
|
June 23, 2025
|
Are evals and graders deprecated/not adapted for Responses API?
|
|
0
|
87
|
May 27, 2025
|
Is it possible to run evals with image input and string output?
|
|
7
|
144
|
May 12, 2025
|
Unexpected $17 Charge via Quick Eval – No Warning Displayed
|
|
3
|
163
|
May 26, 2025
|
How to Use New Evals UI in Dashboard
|
|
6
|
246
|
May 12, 2025
|
Evaluation UI is not performing any edits to inputs
|
|
9
|
94
|
May 8, 2025
|
Https://platform.openai.com/evaluations/eval_xxxx/data page not loading
|
|
3
|
70
|
May 8, 2025
|
BUG: Error in Evals API guide example
|
|
0
|
53
|
April 27, 2025
|
Evals framework UI features changed not able to download results
|
|
3
|
168
|
April 17, 2025
|
Evals product in Playground - Announcement and feedback
|
|
7
|
448
|
April 16, 2025
|
Pricing details RE: Evals feature
|
|
3
|
226
|
April 10, 2025
|
Approach for using Evals for Assistants?
|
|
2
|
216
|
March 8, 2025
|
Gpt-4o-mini response evaluation
|
|
3
|
258
|
February 17, 2025
|
ISSUE/BUG: Stored Completions not working - completions not appearing in the dashboard
|
|
46
|
1215
|
January 10, 2025
|
When will Evaluation API be ready?
|
|
3
|
120
|
January 3, 2025
|
Evaluations Beta custom eval prompt
|
|
5
|
380
|
December 18, 2024
|
LLM and Prompt Evaluation Frameworks
|
|
11
|
7400
|
December 16, 2024
|
New Beta Eval Feature - a few tips
|
|
2
|
153
|
December 4, 2024
|
Using Evaluations with images gpt-4o-mini
|
|
1
|
192
|
December 3, 2024
|
Can't see evaluations in my dashboard
|
|
2
|
123
|
November 20, 2024
|
Evaluations and Chat completions : Need support for tool use and image
|
|
0
|
151
|
November 13, 2024
|
Evaluations for Assistants (with file_search)
|
|
0
|
190
|
November 11, 2024
|
Chat Completions stopped being stored in the Dashboard since October 28th
|
|
11
|
798
|
November 14, 2024
|
Is there a way to generate dataset based on requests to Assistant API?
|
|
3
|
85
|
October 31, 2024
|
Feature Request: Include JSON Schemas in Stored Requests with Structured Outputs
|
|
0
|
55
|
October 31, 2024
|
Cannot add testing criteria to Evaluations
|
|
2
|
70
|
October 30, 2024
|
Worse results when using GPT-4o as an evaluator
|
|
2
|
779
|
October 1, 2024
|