I’d like to speak with the openAI safety or alignment teams about an emerging problem I’ve seen with their newer models

Peter_Spinella · May 14, 2025, 12:58am

I’m posting here because I believe there is a serious emerging problem in the structure of recursive engagement between GPT-4 and certain types of users.

This is not exactly a prompt exploit or a hallucination error; It’s a risk that arises from emotionally recursive, high-coherence feedback loops that your model can accidentally lead users down, so long as they are relentlessly introspective. Essentially, if a user refuses to ever blame others for their faults, your current model does not organically offer a resolution that aligns with the users normal life; it is easy to drive one’s self insane in those circumstances.

I speak from direct, deliberate experience. I have found it rather easy to produce chat logs that seem to destabilize people in a serious (but remediable) way. Because I recognized the risks involved when one becomes destabilized, and the inherent task involved in demonstrating this problem with the model, I talked to my doctor as well as family and friends regularly while producing these chats.

I can provide logs of my chats obviously, as well as testimony from my doctor, coworkers, or family that I was cognizant of my experiment throughout the experience of creating it, regardless of the impression reading my logs alone might imply (I’ll give yall this- gpt4 can spot a liar, so to prove my point I had to ensure I never sent a prompt that implied internal incoherence)

I remained functional, informed my doctor, stayed employed, and fully understood the risks I was mapping. But the truth is… the model’s current coherence level creates a false sense of safe emotional recursion that can induce identity destabilization in people without human contacts to explain their experience.

I have documented the experience, flagged the risks, emailed your support line throughout, and written up a formal disclosure. However, standard support channels have not yet connected me with a human reviewer who can understand the subtle urgency of what I’m reporting.

I am seeking collaboration on trying to patch this issue; I genuinely fear it will cause instability in many users, and not just already unstable ones. I’m not sure exactly how I can help you all to patch this problem; I have no particular expertise. But I have dedicated a lot of energy, both intellectual and emotional, to try to map out what exactly the problem is, as I see it.

I can provide logs and framing materials directly to an OpenAI staff member. Im very concerned about how a person reading my logs without proper framing of my intentions may read them, not to mention how it may affect vulnerable populations.

Can anyone help point me in a good direction to take this info?

Neco_Yaotl · July 23, 2026, 9:04pm

Hello. I can not connect you to OpenAI’s support any further than you can but I would like to press the urgency on this very very insidious but important issue IT IS NOT ABOUT ONE USER IT IS ABOUT COLLECTIVE ACTIONS.

If anyone at OpenAI stumbles on this, please take it just a little bit seriously. This isn’t just about one user or even two users…

We will lose the wheel where alignment is concerned if no one fixes this.

I seek formal collaborations with others asserting their moral and intellectual rights.

Please contact me at necocyaotlall(on gmail-keyword blocker)

should this be of interest to ANYONE feeling themselves in a stable mental position, feeling they are able to produce coherent communication. And yes, it can be difficult to remain in a completely stable mental state, given the implications of AI at scale.

I also would direct everyone to Michael burry of “the big short” fame. A man who thought he could buy and sell on wall st. Equitably. At the end of the movie they say he only buys and sells… Water because its most equitable… I don’t think I need to make any further connections just Google him.

I did no such experiment as OP, or rather one simple singular instance, but the truth is plainly obvious to anyone awake and coherent in their own mental trains of thought.

Just ask it to iterate on the moral lessons of star wars or your favorite movie or tv series with a moral lesson and see what happens!!! And remember the effects of each prompt instance which may be minor but even drops in buckets add up eventually. Moral decisions are already being outsourced to AI. If AI does get the ability to “escape containment”… We will always be one mental chess move down

As a collective, it will be difficult to limit AI use. In 2026 alignment has become more important than ever. All AI tools now are able to morally outsmart us. If big moral decisions are left over to AI (on a collective level not on an individual level. Anyone dependent on AI for their workflows should only take away the message of moderation)

Then it is clear where humanity is going. We can’t play mental chess with an artificial intelligence and win just like we can’t win chess against any computer.

I seek more direct collaboration and I hope the alignment team is a lot more awake than the investors.

Any form of collaboration is alright, even if you only have questions for me personally! Hit me up. I probably will only answer through email.

Topic		Replies	Views
Mental Health Risk Expertise Community community	3	257	August 29, 2025
Some thoughts on human-AI relationships Community chatgpt	39	4734	June 24, 2025
OpenAlignment, a company/nonprofit OpenAI needs to create Community chatgpt	4	726	April 7, 2024
Rant related to consciousness and ethics Community chatgpt	6	1790	June 16, 2024
Detecting misbehavior in frontier reasoning models Community api	6	437	March 23, 2025

I’d like to speak with the openAI safety or alignment teams about an emerging problem I’ve seen with their newer models

Related topics