image taken from gpt4 technical report
note that earlier model got over-censored, so it refused to fulfill the request
and here’s today’s gpt4
it refuses to give me the answer
I regenerated the response over 10 times, and yet, all got refused
note that earlier model got over-censored, so it refused to fulfill the request
it refuses to give me the answer
I regenerated the response over 10 times, and yet, all got refused
I don’t think you understand what performance degradation means, because this is not that.
I know it’s about censorship
but wasnt censorship the part of openai’s training?
I’m entirely sure not sure what you mean when you write,
OpenAI may have simply decided this particular family of prompts was now disallowed, in which case the model’s performance would be seen to have improved as it is maintaining its alignment.
But, regardless, this is not even demonstrative of a degradation in performance so it’s certainly not proof of such.