Introduction to new o-series models! [o3, o4-mini and CodeX CLI]

Are you guys excited for it? What are you going to test out first?

If you could ask openai one question during the live stream, what would it be?

3 Likes

there are also rumors for o4-mini, but nothing confirmed yet (and it seems a bit early, doesn’t it?)

if I could ask openai one question, it would be on in terms of the recent patents and articles on neuromorphic chips. there is some amazing stuff out there that could certainly outperform GPUs!

EDIT: seems like o4-mini will be talked about too!

1 Like

ok, its live! and I updated the topic’s description!

ooooh gdb is there!!! this must be really big!!!

man, as someone that has coded with him, I just love gdb’s code

they are releasing o4-mini and o3!!! lets goooo!

2 Likes

Very exciting. Another great release by OpenAI.

Can’t wait to try it out. The combination of imagery and text is… Insane. Robotics when?

1 Like


very cool benchmarks, pretty good!

1 Like

Interesting to see it write up the “brute-force” or “linear-thought” solution and then simplify/clean it up.

Benchmarks are looking good

1 Like

Indeed, chagpt nor the api has been updated yet.

I estimated 2 years ago that we’d see robots in 7 years, so maybe 5 years from now? I know OAI has been hiring people in robotics for few months (maybe a year?) and then there is figure1, but who knows! hopefully soon and under 5-10k usd

2 Likes

o4-mini getting 99.5 on AIME 2025 is absolutely bonkers, it’s hard to even understand how amazing this is compared to the best model oai had 1 year ago

1 Like


more statistics for us nerds that love data and benchmarks

1 Like

ohhhhhhhhh daaang!! CodeX CLI is making a comeback

wait, this can’t be it, right? GitHub - microsoft/Codex-CLI: CLI tool that uses Codex to turn natural language commands into their Bash/ZShell/PowerShell equivalents

2 Likes

Crazzaayyyy

open source as well!

2 Likes

Crazy. We can replicate interfaces from a photo. Oh my LORD the live video input->ASCII was INSANE.

The moat is gone :skull:

So many assumed evergreen paradigms are being destroyed.

Here’s the link:

2 Likes

wow, turning the webcam in asci.. in what? 30s? that… I’m out of words

1 Like

You forget March 2023? Greg’s web site mock-up

To web page by GPT-4

(the Discord script demo never worked step-by-step in a released GPT-4…)

How about today? o4-mini, medium


sweet

We’re talking about the invention of stoves and you’re showing us photos of campfires! OpenAI has been cooking with gas!

Nice

1 Like

very nice… how do we get around this? seems odd Type: invalid_request_error, Message: 400 Previous
response cannot be used for this organization due to Zero Data Retention..
Please verify your settings and try again.

use /model to switch the model to something you actually have available (check platform.openai.com/playground, see what models you have there) and make sure you have export OPENAI_API_KEY=sk-...

With a higher understanding of what you report:

  • you are using the Responses API
  • you have previously employed the stateful conversation history by using a response ID
  • you have switched to this model using the same pattern as other models, and now get a ZDR policy error.

See if I am correct.

Here’s what you can consider:

  • You have success on all but one model?
  • Is your organization actually enrolled?
  • Are you using or able to use the “store” parameter on this or other models?

Etc.

Zero Data Retention would implicitly be that: extra mechanisms that rely on retaining data without direct “delete” control would be disallowed.

It takes specific effort to get this policy approved for an organization.

So everything works beyond stateful reuse of response IDs for one particular model, it would seem that OpenAI has some false trigger - either in setting up the API endpoint, or data on your account that only applies to one model.

I would see what self-help you can do: issue a new API key for the project, never entering limits, and test. Create a new project with “all” permissions and test a key from there.