I cant include links here but wangzjeff on twitter posted it yesterday.
Is this accurate? If not, does OpenAI intend to release benchmarks as it has for the other models?