Mini is, well, mini.
Faster, cheaper, but also fewer neurons.
I wouldn’t go for a weaker model if reliability is important. Depending on your exact use case, it might be possible to tweak the prompt to get a similar result, but I think what you discovered is generally true:
you can trade inference cost for engineering cost. decreasing one will probably increase the other.