|
Deprecation notice: Evals will be shut down on November 30th, 2026
|
|
5
|
67
|
July 2, 2026
|
|
OpenAI Evals Bug Unknown Parameter
|
|
3
|
185
|
May 22, 2026
|
|
Evals API: file_search return_value always empty — no way to see retrieval results
|
|
0
|
54
|
March 19, 2026
|
|
Evals Created Via API Cannot Be Opened in Dashboard UI
|
|
0
|
48
|
February 3, 2026
|
|
Evaluations UI in the Dashboard are failing
|
|
11
|
361
|
January 29, 2026
|
|
Evals with custom endpoint model
|
|
0
|
72
|
January 14, 2026
|
|
OpenAI eval API - text similarity grading
|
|
2
|
112
|
December 10, 2025
|
|
Missing scopes: api.evals.delete
|
|
0
|
55
|
November 19, 2025
|
|
LLM and Prompt Evaluation Frameworks
|
|
13
|
14099
|
November 18, 2025
|
|
Create eval run via Node SDK fails due to incorrect property name: max_completion_tokens (incorrect) vs max_completions_tokens (correct)
|
|
2
|
114
|
November 13, 2025
|
|
Agent Builder Evals Incomplete
|
|
0
|
61
|
November 9, 2025
|
|
[Playground/Evals] How can I generate an output when a tool is invoked?
|
|
0
|
79
|
October 22, 2025
|
|
Evals with the responses data source always regenerate outputs
|
|
0
|
53
|
October 20, 2025
|
|
Evals Datasource API - Scheduled Evaluations
|
|
0
|
48
|
October 9, 2025
|
|
Evaluations - fails when importing from Logs
|
|
0
|
90
|
October 8, 2025
|
|
Evals product in Playground - Announcement and feedback
|
|
7
|
830
|
October 1, 2025
|
|
Chat Prompt Evaluation Unable to Reference Prompt ID or Saved Prompt on platform
|
|
2
|
164
|
October 1, 2025
|
|
New in Evals: Full Audio Support
|
|
0
|
131
|
September 12, 2025
|
|
Cannot set verbosity for gpt-5 evals
|
|
0
|
157
|
August 26, 2025
|
|
Evals: Invalid 'reasoning_effort' for non-reasoning model: gpt-5-chat-latest
|
|
3
|
1519
|
August 18, 2025
|
|
Is error 500 caused by JSONL file ID formatting?
|
|
3
|
114
|
August 15, 2025
|
|
BUG: Stored Chat Completions not showing in Dashboard when sending type "image_url" messages
|
|
5
|
452
|
August 8, 2025
|
|
Evals framework UI features changed not able to download results
|
|
5
|
407
|
August 4, 2025
|
|
Accessing Eval Results in OpenAI Agent SDK Response
|
|
0
|
162
|
August 1, 2025
|
|
Provide bulk test data for published prompt eval in Dashboard UI
|
|
0
|
92
|
July 30, 2025
|
|
What does 7 Free Weekly Evals actually mean?
|
|
1
|
911
|
July 22, 2025
|
|
Evaluations not running the entire dataset
|
|
0
|
97
|
July 8, 2025
|
|
Creating Evals that accept file inputs
|
|
0
|
134
|
June 27, 2025
|
|
Cannot add runs to a specific eval
|
|
1
|
100
|
June 23, 2025
|
|
Are evals and graders deprecated/not adapted for Responses API?
|
|
0
|
167
|
May 27, 2025
|