[DevDay] Predictions & Desires of API Features?

Hey everyone,

DevDay kicks off TOMORROW!! :tada:

Get the popcorn ready, the drinks filled to the rim. Last years DevDay was jampacked with great information, and I have no doubt this will be a worthwhile watch.


@jr.2509 shared a screenshot from an OpenAI employee hinting of some of “the most requested” API features being announced during the keynote & it got me excited.


So. I’m wondering.

What are your most desired API features?

Mine off the top of my head:

  • Tooling usage for o1
  • Advanced Voice Mode
  • Assistants no longer in beta
  • Web Browsing
10 Likes

Here are the features I expect to be most requested:

  • Better cost control and reporting in the platform dashboard
  • Internet access as a tool for assistants (and other endpoints)
  • Improved tool handling and control for assistants, especially for RAG
  • Access to o1 for everyone
  • I wouldn’t be surprised if TTS (text-to-speech) and STT (speech-to-text) become additions for assistants.

However, to be honest, I’m not sure what the most requested features for chat.completions would be.

5 Likes

Assistant enhancements are also top of my mind, in particular greater control and customization of file search (such as the ability to include and filter by meta data).

Greater control over o1 reasoning tokens would be great although that seems too early.

Like @RonaldGRuckus I am also betting on AVM via API.

2 Likes

Looking forward to o1 API access for non Level 5 users ^^

3 Likes

Sora?

Although AVM via API would be pretty sweet. But it just came out.

Web browsing via API, that would be cool too.

Probably something to control assistants more as well.

Oh, and GPT-5 !!! :rofl: (not happening)

3 Likes

+1 on just better cost control and reporting, preferably via both a dashboard and API. E.g. simple API access to get stats on utlization - number of input and output tokens, per organization and per API key, and for specified time period.

I also agree, the killer announcement would be Perplexity competitor - web search via API, with citations!

4 Likes

Prediction: A robot will come out and present everything flawlessly? Okay, maybe next year! Haha…

5 Likes

are there any web events or something for those who cannot attend?

1 Like

You’re looking at it! :wink:

Seriously, though, I don’t think there’s anything specifically planned for online. Last year we did a Bingo thing as you probably recall, but no live presentation put a damper on that…

Will be good to discuss with people as the videos drop, though, I think.

4 Likes

I think the robotics event is October 10th Paul… but I have to say, it would be pretty neat to see an update on figure 1

1 Like

My list:

  • Search bar on ChatGPT history (ok fine, not an API feature, maybe with assistants?)

  • o1 release?

  • I don’t think AVM or tts-2 will be out, it would certainly be a surprise

  • sora-preview would be nice

  • option to view how much a request would cost (I know there’s a way to do this, but an improvement on this would be nice) - specially on o-models (o1-mini, o1, o1-preview)

  • search the web would also be nice, but I’m not sure that is something they’ll want to add to the API due to competitors

  • improvements to the assistant pipeline (I don’t use it much, but I suspect there will be many updates on this)

  • lower the price of models again?

  • SearchGPT for everyone would make my day, I know some users were selected to test it during the same time as AVM, so maybe (ok fine, probably not an API thing, but it would certainly create a little bit of excitement)

  • More image generation functionalities via the API?

  • Memory? I mean, most of us are using some database for this

  • “auto-gpt” mode? would certainly make us all spend more tokens, but I feel like there would need to be some IDE plugin to work with it (replit and devin are banking on this, so I guess it would make a lot of devs make happy noises)

  • GPT generation via the API? If hosted on OpenAI servers to deliver zero friction, that would be amazing, but I don’t think the metrics on GPT is all that strong for this to be entertained

  • Improvements to abstract more away of the fine tuning process?

  • simple commands to the api to view models and organization/personal metrics? for example: openai models on the terminal (yes, I’m aware we can do this ourselves with a little bit of code and some aliases, but still, it would be cool to see something like this as a natural evolution of the API’s maturity)

  • parsing of prompts longer than token limitations? similar to what gpt does (the “continue” button) but native to the API?

  • lower the costs and increase the rate limit, everyone is always happy if this happens

  • no breaking changes (as in, no changing from completions to chatcompletions like we had with gpt-4)

  • streaming on where there is no output streaming

  • a toggle-able progress bar for when you send a request (right now, when you send a prompt it just hangs for a few seconds before just giving you back the output)

Honestly, the API has progressed so much, I’ve sort of got used to it since gpt-4 (similar to python2 to python3) that I feel like my list is “weak”.

Can’t wait! I’ll see you all there!

2 Likes

Web Browsing in GPT-4o and Assistant coming out of beta is my guess. But I wanna be surprised so hoping for much more.

I hope it’s SORA.

2 Likes

Here’s a combination of real expectations and dreams come true:

  1. o1-preview and o1-mini being made available to all paid API users (or maybe tier 2 and up).
  2. o1 model being further teased, and hopefully released??
  3. Multi-modal reasoning models
  4. Advanced Voice mode made available through the API, likely with very heavy restrictions and rate limits
  5. Web browsing via the API (SearchGPT?)
  6. A new image or video generator to meet the competitor’s offerings
  7. Something open-sourced maybe?
  8. Lots of strawberry references
2 Likes

I also wonder if something related to greater determinism is on the table, i.e. advancements to the seed parameter.

some further changes to batches might also be interesting, i.e. different buckets for turnaround times.

2 Likes