Japanese yes and no: Confusion in GPT-4o

Thank you for your reply!

If we speculate on whether the model size of GPT-4o is large, this seems plausible.

By the way, this OpenAI eval related to Japanese approval actually exists.

In this screenshot, the results of an eval using Japanese negative questions data with chatgpt-4o-latest are shown.

As you can see from the result of the accuracy, the score of 0.29, a score below 0.5 for binary classification, may be a result of being penalized by RLHF.

Your point that the susceptibility to broad RLHF may vary by model size might be correct.

When the same test was performed on the GPT-4o-mini, the accuracy score was 0.19.

An accuracy score of 0.19 for a binary classification means that reversing the correct and incorrect answers would result in a high score.

3 Likes