Fine-tune the GPT-4.1 family using direct preference optimization

polepole · June 11, 2025, 5:56pm

You can now fine-tune the GPT-4.1 family using direct preference optimization.

https://x.com/openaidevs/status/1932858051876565475?s=46

https://platform.openai.com/docs/guides/direct-preference-optimization

sergeliatko · June 11, 2025, 6:17pm

That’s a cool one, immediately I see a lot of applications. Thanks.

OnceAndTwice · June 11, 2025, 7:06pm

I saw the news as soon as it came out and immediately got to work. With even just 30 examples and default hyperparameters, I’m already seeing (small) improvements to my SFT model. Specifically, I was able to get it to present less of a negative attitude and avoid instances of runaway gibberish. I think DPO is a great way to punch out the kinks you get from a SFT run.

Topic		Replies	Views
# OpenAI Expands GPT-4o Mini Fine-Tuning Access API fine-tuning , gpt-4o-mini	1	219	August 22, 2024
OpenAI releases updates to fine-tuning and custom models Community fine-tuning	2	992	April 5, 2024
Fine-tuning updates: Reinforcement fine-tuning now available + GPT-4.1 nano fine-tuning Announcements	12	3315	June 12, 2025
# Customize GPT-4o Mini with Fine-Tuning: Now Available for Tiers 4 and 5 API fine-tuning , developers , gpt-4o , gpt-4o-mini	0	334	August 9, 2024
Can you vision finetune 4o with DPO? API	3	171	June 4, 2025

Fine-tune the GPT-4.1 family using direct preference optimization

Related topics