Dev Digest—highlights of what OpenAI shipped in May for developers

Product updates

New tools and features in the Responses API. We’ve introduced three new built-in tools: remote MCP servers, image generation (with streaming and multi-turn edits), and Code Interpreter (to allow models to write Python or deeply understand images). We’ve also added new API features: background mode, reasoning summaries, and encrypted reasoning items. Watch the demo video.

Article content

Reinforcement fine-tuning is now available with OpenAI o4-mini. RFT uses chain-of-thought reasoning and task-specific grading to improve model performance (which is especially useful for complex domains!). For example, Accordance used RFT and saw a 40% improvement for their tax and accounting purposes.

Article content

Codex CLI, our local coding agent, now lets you sign in with ChatGPT to connect your API org. ChatGPT Plus and Pro users can redeem $5 and $50 in free API credits, respectively, for the next 2 weeks. We also released a new model for the CLI: codex-mini, which is optimized for low-latency code Q&A and editing. Eligible developers can opt in to help improve future OpenAI models with up to 10 million free tokens a day on traffic shared with OpenAI. Learn more.

And on sign in with ChatGPT, we’re looking for developers interested in integrating this capability into their own apps. If you’re interested, fill out this form.

Codex, available now through ChatGPT, is our new software engineering agent that can work on many tasks in parallel. It independently navigates your codebase, implements and tests code changes, and proposes pull requests for you to review. Available to all Pro and Team users.

Structured Outputs improvements. Parallel function calling now works with strict mode, ensuring calls reliably adhere to schema. We’ve also added support for many more keywords, including output string lengths and formats via regex.

Use Flex Processing for o3 and o4-mini savings. Simply add service_tier=“flex” to your API calls and you can unlock dramatically lower prices in exchange for slightly longer latency. Flex Processing is perfect for batch jobs, evaluations, data enrichment, or any workload where immediate speed isn’t critical.

Around the community

Article content

Watch o3 play Pokémon—live on Twitch. See how o3 plans its next move, explains its reasoning, analyzes the map visually, and saves to memory. Built with the Responses API.

OpenAI to Z Challenge. Use o3, o4-mini, or GPT-4.1 to discover previously unknown archaeological sites in the Amazon. The first place winner will receive $250,000 in a cash/API credits mix as well as funding to continue their work in collaboration with archaeologists.

Developer resources

6 Likes

Intended was a GIF of “claymation from the news” showing web-search-preview tool used in conjunction with image creation tool.

eilish-smaller

1 Like

Do you have a roadmap for o3, summaries, and RFT being made available without ID checks? Or at least create a retention policy for the IDs? I don’t know a lot of folks who want Persona and OpenAI to indefinitely keep a scan of their ID. Name and DOB should be more than plenty.

Thanks!

1 Like