AMA on the 17th of December with OpenAI's API Team: Post Your Questions Here

gp · December 17, 2024, 6:37pm

When should we expect price reductions for whisper and update for whisper v3 (which you guys launched more than a year ago) on the API? Please send whisper some love.

rxeconomics · December 17, 2024, 6:38pm

Any plans to make prompt caching better? Specifically, allow specification of what content to cache, give it a variable name, and allow reference to the cache in subsequent API calls for a certain time duration. I know another company is doing this

j0rdan · December 17, 2024, 6:40pm

Great updates on Realtime API.

Any timeline as to when the issue with the realtime model dropping out of voice mode when reconstructing conversation history is going to get resolved?
Also the audio getting randomly cutoff at the end from time to time.
These two are probably the most critical issues of the API right now.

stephtierney88 · December 17, 2024, 6:41pm

Well we got: o1 w vision via api (w reasoning guide params) I’m happy :') ty guys

michellep · December 17, 2024, 6:41pm

Hey Lou -- sorry about this unfortunate behavior. This can happen when the model outputs are sufficiently improbable so the only valid token per your schema is the newline token. It's not really a bug in that engineering changes cannot fix it. When this happens, I would suggest trying to use a simpler schema or changing your prompting so it's more clear to the model what it should respond with. Let me know if this helps!

romainhuet · December 17, 2024, 6:42pm

We don’t have plans for a Sora API yet, but we’d love to hear more! What will you build if we ship one?

anon25271712 · December 17, 2024, 6:42pm

since this is a AMA for the API team, I wonder, do you guys have plans to release documentation examples of frontend microphone usage on different languages and frameworks that work seamlessly as examples on any github repository?

i feel like that would help many build more reliable features with faster integration

romainhuet · December 17, 2024, 6:42pm

We’re seeing a lot of developers building with the Assistants API, and we’re continuing to invest in the tools developers need to build agents. Next year, we plan to bring o1 to the Assistants API and will have more to share soon!

RonaldGRuckus · December 17, 2024, 6:43pm

Ooh. I would love to automate building relaxing, tranquil looping background videos to go alongside my custom-made music.

I’d also like to build N videos for a prompt and be able to approve them for future stock videos. Maybe even incorporate a vision model to somehow rank them before being sent to me for approval

dentro-ai · December 17, 2024, 6:44pm

Can’t wait for O1 with vision!
Really think ANY business can profit from this, as all of them use badly formatted / scanned in / manually written on PDFs.

Looking forward to replace a pipeline of 20 LLM calls to a single call.

That leads me to my question:
with the release of better models (gpt3 > gpt4 > gpt4o > o1), do you see a spike in reduced overall tokens used via the API as people replace longer prompts or multiple inference call with simpler prompts or less inference calls?

haydenh · December 17, 2024, 6:44pm

What is OpenAI’s view on Model Context Protocall?

It would be great if everyone is on the same page about how we’ll build external connections to OpenAI APIs going forward.

Ideally you guys would come out and support it.

SomeUser2022 · December 17, 2024, 6:45pm

More generally, what are we as developers not doing as much as you think we should? What do you wish we did differently, or more or less of? We take constructive criticism too

anon25271712 · December 17, 2024, 6:46pm

For Sora? o1 story or multiple scenes + chain of requests to Sora API

jeffsharris · December 17, 2024, 6:46pm

Nothing to share yet on V3 Whisper in the API. But for both audio understanding and TTS, do check out the new GPT-4o mini audio preview model. It’s got state of the art speech understanding and you can prompt the model directly to control how it hears and speaks! For example, give it a prompt like "Say the following in a somber tone, and make sure to pause your speech appropriately: "

Foxalabs · December 17, 2024, 6:46pm

If you wish to discuss a response in detail, please create a new thread with a link to the reply in the body using the icon to save filling the AMA thread.

benofilis · December 17, 2024, 6:47pm

One more for the assistants API. It would be really great to have the realtime api able to interact with assistants. That would give really cool and tailored interactive scenarios to users

info502 · December 17, 2024, 6:48pm

Where is the code that Sean DuBois showed for weRTC client?

jeffsharris · December 17, 2024, 6:48pm

It’s something we care about! Giving the model more context and examples is a great way to get smarter responses. Nothing to announce just yet but stay tuned in 2025

Topic		Replies	Views
All the questions addressed by the API team during the December 17, 2024 AMA Community community , ama , shipmas	3	592	December 17, 2024
Introducing ChatGPT and Whisper APIs Announcements whisper	77	19434	December 13, 2023
How to confirm that you got the correct value from a text other than repeating the same prompt over and over API	39	568	September 1, 2024
How to prevent ChatGPT from answering questions that are outside the scope of the provided context in the SYSTEM role message? API	53	168213	December 2, 2023
New models and developer products announced at DevDay Announcements announcement	70	17273	February 16, 2024

AMA on the 17th of December with OpenAI's API Team: Post Your Questions Here

Related topics