Day 4 of Shipmas: o1, RL Fine-Tuning, Sora - What's next?

It’s already been an exciting journey with the previous announcements, and today marks the next big reveal.

Join us for some community fun and engaging discussions about today’s announcements and presentations during the live event.

Here’s the link to OpenAI’s YouTube streams:

The event will go live at 2024-12-10T18:00:00Z (the time should automatically adjust to your device’s time zone). Note that the stream usually starts 30 minutes early, and the link above will be updated accordingly.

Please be aware that commenting on YouTube will not be available, but feel free to share your impressions here instead.

Here are the previous announcements:

8 Likes

I expect something for developers again. My biggest tip is a Gemini 1.5 Flash 8b-like model, because that’s something OpenAI doesn’t offer yet and what seems to be heavily used among developers.

3 Likes

I definitely wouldn’t complain about a Flash 8b-like model, but I’m really hoping MCP support for the desktop app is one of the gifts Samta brings for the 12 days of Shipmas. I finally got around to using it with Claude, and I am convinced.

2 Likes

video feature for advanced voice mode? that would be nice

i think it was there was a demo showing this when advanced voice mode was shown, would certainly be a game changer. maybe limited time for plus user and unlimited time for pro users.

2 Likes

I’ve tried logging into Sora.com a couple of times over the past day, but it’s still showing as ‘temporarily unavailable.’ I completely understand that demand can exceed expectations during a rollout, especially for something as exciting as this. That said, I wonder if you might consider leveraging tools like ChatGPT to refine your demand forecasting and customer communications protocols.

For instance, for those of us on paid plans, a simple email acknowledging the situation would go a long way. Including a link to a queue system or a way to schedule access could also help reduce frustration and ensure people don’t waste time trying repeatedly. I anticipated some hiccups and held off before attempting to log in, but it’s likely others haven’t had the same experience and may feel discouraged.

3 Likes


take awhile that’s not good, but hopefully before the end of xmas we’ll all be using it

4 Likes

Mind-blowing, considering that only Plus and Pro users receive Sora credits, while the entire EU currently can’t access the tool.

I suppose everyone is just exploring the possibilities for now, and demand will drop in a few weeks?

5 Likes

What would be really cool is best-in-class open weight model drops. Like 1B, 3B and 7B variants. Mistral style. Literally a santa moment. Maybe that will come at the end? But strategically that would be awesome - hitting both ends at the same time, paid API for one set of users/customers, and reeling in another set of users/customers, away from Meta and co.

2 Likes

It would be cool if they released the API for o1. That is really the main thing that developers need.

2 Likes

Native 4o image generation.

1 Like

interesting, very interesting, so instead of tool calling dalle3 you can just use gpt-4o to generate an image…

1 Like

It’s in the original 4o announcement, never released.

It could do consistent characters.

1 Like

oh shoot, you are right, I forgot about that, would be nice if it outperforms dalle-3 (which is already mindblowingly good)

1 Like

today we are talking about canvas and lunching 3 things:

  1. canvas released to everyone
  2. enabling to run python inside canvas
  3. bringing canvas to custom gpts
1 Like

It’s Canvas updates. I honestly have never used canvas because its not on the mac app.

1 Like

I like Canvas, even before, but any insight whether this will also available be in API, so people can integrate this into their own apps?

3 Likes

That would be super interesting!

1 Like

You can run code in Canvas now!

That’s huge!

1 Like

I don’t know, but I don’t think so, I’ve done a “canvas” like feature on my own in the past and its the secret sauce to a lot of “devin”, “replit auto coder AI” and other automated feature coding software… personally, I don’t like the performance… it either gets expensive and inefficient or it is very inefficient if its using open source modals, but I find it interesting that openai is working towards that

1 Like