Structured Outputs not reliable with GPT-4o-mini and GPT-4o

Mini is, well, mini.

Faster, cheaper, but also fewer neurons.

I wouldn’t go for a weaker model if reliability is important. Depending on your exact use case, it might be possible to tweak the prompt to get a similar result, but I think what you discovered is generally true:

you can trade inference cost for engineering cost. decreasing one will probably increase the other.

7 Likes