Thanks for sharing this, I actually found it really interesting. From what I understand, CoT Monitoring is meant to improve reasoning and reduce inefficiencies, but based on my experience, it seems like the actual outcome isn’t aligning with that goal yet. In fact, 4.5’s behavior feels like it’s introducing the very behaviors it’s intended to reduce - more redundancy, more confirmations, more incremental corrections, blatant violations of user instructions, etc, instead of direct solutions.
Maybe not some vast token conspiracy, but if I were a token-based business model, I wouldn’t hate the idea lol.
But thank you, I think you’re probably right here