I am sorry man, but I am a paying user, not an analyst to help OpenAI troubleshoot their service and track metrics to improve service quality.
I understand what you are saying and the need for actual benchmarks, but if you can show me how these benchmarks can be created, I wold be happy to make that happen, although I have stuff to do and no patience for this.
What I know is that about 2 months or 6 weeks ago this was a kick-ass product I could rely on. Today, it is unusable in the ways in which I was depending on it before. It’s unlikely I am imagining this and I have dozens of threads as evidence. The only unseen data is what’s inside the OpenAI black box of spaghetti that makes all this happen. I don’t know what internally determines the change in behavior of ChatGPT. They do. And it would be nice for the to be more…OPEN about how their AI works or doesn’t work especially to people that pay for it.