On this topic, according to the simple evals OpenAI released, GPT-4o currently surpasses anthropic opus on the human eval bencmark: 91% to 85%.
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
GPT-4-Turbo models perform better the older GPT-4 models in LMSys benchmark | 14 | 6560 | May 13, 2024 | |
Which AI is best for Python coding? | 6 | 14394 | February 27, 2025 | |
Thoughts on GPT-3.5-Turbo vs. Claude 3 Haiku | 4 | 10043 | April 13, 2024 | |
GPT-4o vs. gpt-4-turbo-2024-04-09, gpt-4o loses | 38 | 14864 | June 11, 2024 | |
Testing New GPT-4o vs Top 5 AI | 0 | 3006 | May 14, 2024 |