Not necessarily, it’s not primarily down to you or the model.
For an explanation, please take a look at the screenshots:
Can you see which images received more likes?
Keyword:
‘Reinforcement learning with human feedback’ (RLHF) – I’m currently in the process of posting an analysis on this
![]()
