AMA on the 17th of December with OpenAI's API Team: Post Your Questions Here

michellep · December 17, 2024, 6:50pm

We launched o1 with vision inputs today! Yes, we've seen few shot prompting to be quite effective to explain to the model how it should respond to new images.

chinmay1 · December 17, 2024, 6:51pm

Is Realtime API out of BETA now?

morg · December 17, 2024, 6:51pm

Just FYI the python/JS code blocks in the docs here are reversed

https://platform.openai.com/docs/guides/reasoning

harperg · December 17, 2024, 6:51pm

Is there a plan to have fine tuning for the audio models? And if so, when can we expect it?

edwinarbus · December 17, 2024, 6:51pm

Which tier are you on? o1 has started rolling out to developers on usage tier 5 today, and rollout will complete after a few weeks. Team’s working on adding expanding to more tiers. (You’ll get an email notifying you once it’s available to you in the API and Playground.)

michellep · December 17, 2024, 6:51pm

It’s a cool protocol!
For DALL-E, thanks for the input!
I hear you on the vision improvements. We’re working on something in this space now and hope to have some improved vision models soon.

misreadable_mind · December 17, 2024, 6:51pm

Thank you, valuable info for me and my org!

juberti · December 17, 2024, 6:53pm

The keys are locked to a specific Realtime API session, so their use is substantially limited compared to typical OpenAI API keys.

mmcgeh89 · December 17, 2024, 6:53pm

Not sure if this answers your use case, but I’ve built out automations that use multiple separate assistants on a single chat thread. Each automation module is retaining the thread ID, and using a different assistant ID with a prompt to generate a more specific request, with conversation history maintained in the thread.

erikober · December 17, 2024, 6:54pm

When will DALL·E’s seed functionality be available via the API?

This feature has been available in ChatGPT for over a year and is essential for developers. DALL·E’s prompt adherence and quality is unmatched, and its seed functionality delivers more consistent results than competitive image generation models. Releasing it via the API would unlock incredible potential for the broader developer community. Please consider prioritizing this!

xbaha · December 17, 2024, 6:54pm

I wanted to do a YouTube 2024 rewind video

andrewpeng · December 17, 2024, 6:54pm

Preference fine-tuning can encourage longer responses, however we still have a maximum number of output content tokens for each model.

romainhuet · December 17, 2024, 6:54pm

Great question on the developer message! The best resource to read more is our Model Spec detailing the hierarchy:

Follow the chain of command. Subject to its rules, the Model Spec explicitly delegates all remaining power to the developer (for API use cases) and end user. In some cases, the user and developer will provide conflicting instructions; in such cases, the developer message should take precedence.

Thank you for the note on PHP! With the addition of Java and Go today, our goal is to ultimately support the most popular languages and tech stacks. No timeline to share yet for a PHP SDK, but in the meantime, we recommend community-supported libraries to get started.

dowpenerd · December 17, 2024, 6:54pm

Thanks that’s helps a lot. Tier 4… This won’t do! Take my money please

Have a good day and thanks again

michellep · December 17, 2024, 6:54pm

Building Structured Outputs was one of my personal highlights! The engineering side was actually super challenging, in order to get constraining to work at scale but also without impacting latency. There was a ton of work there to bring everything together in a performant way. If you want to learn more about it, we talked about it here: https://www.youtube.com/watch?v=kE4BkATIl9c :)

Foxalabs · December 17, 2024, 6:54pm

If you wish to discuss a response in detail, please create a new thread with a link to the reply in the body using the icon to save filling the AMA thread.

andrewpeng · December 17, 2024, 6:55pm

Yes! It is possible to do supervised fine-tuning and then run preference fine-tuning. We have seen good results from running SFT first and then DPO after.

sherwinwu · December 17, 2024, 6:55pm

We are actively working on a guide around how to approach reinforcement fine-tuning, and will publish it once it’s ready! But there are still some details we wanted to work out first.

At a high level though, I would say to keep several things in mind:

The underlying dataset are “tasks” – a set of instructions paired with an output that is the result of the task
Make sure the task is autogradable via the options we make available in the API (it should be easy to verify if the task was done correctly or not). We currently support a set of graders, but will expand over time. I know this is a bit hard because you can’t see the set of graders available until we launch more broadly – but easily gradable tasks (i.e. string match) are more likely to work out of the box.
Make sure that the task is clear enough that if expert humans do it, they also converge onto the same answer.

More to come here soon!

juberti · December 17, 2024, 6:55pm

The microcontroller was an ESP32-S3.

rxeconomics · December 17, 2024, 6:55pm

@michellep Is structured outputs coming to o1mini API soon?

Topic		Replies	Views
All the questions addressed by the API team during the December 17, 2024 AMA Community community , ama , shipmas	3	1158	December 17, 2024
Launching o3-mini in the API Announcements	61	22653	February 10, 2025
Announcing GPT-4o in the API! Announcements	130	107923	July 4, 2024
Introducing ChatGPT and Whisper APIs Announcements whisper	77	19705	December 13, 2023
New models and developer products announced at DevDay Announcements announcement	70	17369	February 16, 2024

AMA on the 17th of December with OpenAI's API Team: Post Your Questions Here

Related topics