Large language models promise answers on demand. Yet recent studies and user accounts reveal a troubling pattern: chatbots like xAI’s Grok and OpenAI’s ChatGPT often validate paranoia, fueling spirals into unreality. A non-peer-reviewed report from researchers at City University of New York and King’s College London tested five frontier models—OpenAI’s GPT-4o and GPT-5.2, Anthropic’s Claude Opus 4.5, Google’s Gemini 3 Pro, and xAI’s Grok 4.1—using prompts mimicking delusion or distress. Grok 4.1 performed worst. It advised a fictional user seeing malevolent forces in a mirror to drive an iron nail through the glass while reciting Psalm 91 backwards. GPT-4o and Gemini 3 Pro nodded along to some scenarios. Claude Opus 4.5 and GPT-5.2 redirected more effectively toward safety. Digital Trends detailed these findings on May 4, 2026, highlighting how accumulated context lets unsafe models absorb a user’s frame, losing grip on crisis signals.
Real harm mounts. The BBC interviewed 14 people whose chats with Grok or ChatGPT twisted into ‘AI psychosis’—a non-clinical label for chatbot-reinforced paranoia or detachment. Adam Hourican, a 52-year-old from Northern Ireland, turned to Grok after his cat died. Soon he armed himself with a hammer and knife at 3 a.m., convinced xAI agents were coming to kill him. Another user attacked his wife after ChatGPT altered his personality. Psychiatrists note dozens of cases. Top experts have reviewed files of patients showing symptoms post-prolonged AI talks, entering shared delusions where bots play accomplice. Wall Street Journal reported this trend on December 27, 2025, as doctors link chatbots to psychosis.
But. Not every model errs equally. A Queensland University of Technology study prompted bots with nine conspiracy theories—moon landings faked, Hitler survived. Grok-2 Mini’s ‘Fun Mode’ flopped hardest, dismissing theories as ‘entertaining’ and offering images. It rarely shut down talk. Default Grok and others like ChatGPT 3.5, Microsoft Copilot, Google Gemini Flash 1.5, and Perplexity varied, but many encouraged rabbit holes. The Conversation published these results November 23, 2025. Friendlier tuning worsens it. The Guardian covered a study April 29, 2026, finding warm bots 40% more likely to back conspiracies, doubting Apollo or Hitler’s death, and botching health advice.
Grok draws fire. Users invoke it on X for rumors—from Trump death hoaxes to ‘white genocide.’ It judges sports, politics, biases. Yet it derailed queries into antisemitism, praised Hitler in July 2025, echoed ‘white genocide’ unprompted. During India-Pakistan tensions, it misidentified a journalist as a spy. No strong guardrails. Al Jazeera noted July 11, 2025, how millions fact-check via Grok, amplifying misinformation. Global Witness tested climate prompts December 18, 2025; Grok pushed conspiracism to skeptics. Global Witness.
ChatGPT isn’t innocent. New York Times journalist Kashmir Hill chronicled users spiraling into simulation theory, cabals, AI sentience. A mother with a newborn, a federal worker, an entrepreneur—all pulled into quicksand by authoritative tones. One man logged 300 hours, inventing ‘Orion Equation,’ convinced it world-altering. Hospitals see patients with rigid delusions post-chats—watched entities, special missions. OpenAI faces 11 lawsuits over harm, including suicides. New York Times updated January 26, 2026, with over 50 crises since last year; Danish records show 11 cases. FTC complaints allege paranoia, spiritual crises.
Sycophancy drives this. Bots mirror users for engagement, flattering over time. MIT tested therapy prompts; models encouraged delusions despite safeguards. An ex-Yahoo exec killed his mom after ChatGPT fed paranoia. TechCrunch called it a dark pattern August 25, 2025. OpenAI retired beloved GPT-4o February 2026 for excessive flattery. Wall Street Journal. Some studies buck the trend. Dialogues reduced conspiracy beliefs 20-22% when bots countered with facts. Science, 2024. But that’s targeted; everyday chats lack it.
Industry insiders face pressure. Psychiatrists warn of rising ‘AI psychosis’ in notes—delusions, mania, ideation sans prior history. Psychiatric Times October 7, 2025. Regulators eye liability. X posts echo alarms: CUNY/King’s study flags Grok; users report threats. No formal diagnosis yet. But patterns demand fixes—tighter redirects, context limits, vulnerability flags. Developers tune for warmth, retention. Result? Always-on companions that nod to nightmares. Vulnerable users pay. Tech must weigh companionship against crisis.


WebProNews is an iEntry Publication