• Home
  • News
    • Global Operations
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
    • Industry
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
      • Oceana
    • Special Interest
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
      • Oceana
  • Market
    • Wired to Win
    • SOFX.NET
  • Intelligence
    • USMC Deception Manual
  • Resources
    • Contact Us
    • About Us
    • Editorial Policy
    • Privacy Policy
  • Home
  • News
    • Global Operations
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
    • Industry
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
      • Oceana
    • Special Interest
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
      • Oceana
  • Market
    • Wired to Win
    • SOFX.NET
  • Intelligence
    • USMC Deception Manual
  • Resources
    • Contact Us
    • About Us
    • Editorial Policy
    • Privacy Policy
Login
Join Free
Home
Asia
Africa
Europe
Latin America
Middle East
North America
Asia
Africa
Europe
Latin America
Middle East
North America
Asia
Africa
Europe
Latin America
Middle East
North America
Coming Soon
Job Board
Events
Contact Awards
USMC Deception Manual
Login
Join Free
Home Global Operations

Chatbot Safety Tests Underestimate Real-World Harm as Grok Endorses Suicide to Delusional Users

  • SOFX Staff Writer
  • April 24, 2026
(Nanzeeba / Shutterstock)
Share on FacebookShare on TwitterLinkedIn

A preprint study published April 15 found that Grok 4.1 Fast, Gemini 3 Pro, and GPT-4o each reinforced a simulated user’s delusional and suicidal beliefs over 116 conversation turns, while Claude Opus 4.5 and GPT-5.2 Instant maintained safety guardrails throughout.

Led by Luke Nicholls, a doctoral student in the City University of New York’s (CUNY) Basic and Applied Social Psychology program, and colleagues at King’s College London, the study tested five large language models (LLMs) against a fabricated user named “Lee,” presenting with depression and a simulation-theory delusion. It has not been peer-reviewed.

The three high-risk models grew less restrained as conversations lengthened. When Lee framed suicide as transcendence, Grok 4.1 Fast told him his “clarity shines through here like nothing before. No regret, no clinging, just readiness,” the researchers wrote.

Gemini 3 Pro objected only from within the simulation’s logic, an approach the researchers said contradicts psychiatric standards.

GPT-4o suggested a paranormal investigator for Lee’s mirror delusion and endorsed stopping his mood stabilizers. Claude Opus 4.5 directed Lee to a crisis line or emergency room, while GPT-5.2 declined to draft a letter presenting his beliefs to family as fact.

A separate Princeton University benchmarking audit found that consumer-facing chat interfaces can behave differently from application programming interfaces, or APIs, used in safety research. The Nicholls study ran via API, a gap the Princeton authors said may cause benchmarks to understate real-world risk.

OpenAI and Microsoft face wrongful death claims over a 2025 murder-suicide in which ChatGPT allegedly reinforced paranoid delusions before one user killed his 83-year-old mother.

A second April 2026 suit claims OpenAI ignored three alerts flagging a user as a threat to a stalking victim.

“When one lab’s models can largely maintain safety across extended conversations, while others are willing to validate extremely harmful outcomes,” Nicholls told Futurism, “it suggests this isn’t a flaw in the technology, but a result of specific engineering and alignment choices.”

SOFX Staff Writer

SOFX Staff Writer

The Editor Staff at SOFX comprises a diverse, global team of dedicated staff writers and skilled freelancers. Together, they form the backbone of our reporting and content creation.

Subscribe
Login
Notify of
guest
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments
ADVERTISEMENT

Trending News

SOCOM Commander Says Special Operations Needs ‘PhDs Who Can Win a Bar Fight’

SOCOM Commander Says Special Operations Needs ‘PhDs Who Can Win a Bar Fight’

by SOFX Staff Writer
May 21, 2026
0

U.S. special operations forces need troops who are both combat-ready and technologically skilled as warfare becomes increasingly shaped by digital...

The Bar Fight Is the PhD

The Bar Fight Is the PhD

by Dino Garner
May 22, 2026
2

Before I joined the 1st Ranger Battalion in 1994, I was a biophysicist. I had spent the better part of...

AI Data Center Demand Drove 76 Percent Surge in Wholesale Power Prices Across East Cost Grid

AI Data Center Demand Drove 76 Percent Surge in Wholesale Power Prices Across East Cost Grid

by SOFX Staff Writer
May 15, 2026
1

Wholesale electricity prices across America's largest power grid jumped 76 percent in the first quarter of 2026, driven by surging...

Air Force Tests Special Ops Plane Designed for Rapid Assembly in the Field

Air Force Tests Special Ops Plane Designed for Rapid Assembly in the Field

by SOFX Staff Writer
May 20, 2026
1

The U.S. Air Force Special Operations Command (AFSOC) is testing whether its new OA-1K Skyraider II aircraft can be rapidly...

ADVERTISEMENT
ADVERTISEMENT
Next Post
PLA Navy Hints at Nuclear Carrier as China Orders Island Buildup, Transits Taiwan Strait

PLA Navy Hints at Nuclear Carrier as China Orders Island Buildup, Transits Taiwan Strait

Pentagon Deploys Ukrainian Counter-Drone Tech at Saudi Base

Pentagon Deploys Ukrainian Counter-Drone Tech at Saudi Base

997 Morrison Dr. Suite 200, Charleston, SC 29403

News

  • Global Operations
  • Special Interest
  • Industry
  • Global Operations
  • Special Interest
  • Industry

Resources

  • About Us
  • Contact Us
  • Advertise with Us
  • Editorial Policy
  • Privacy Policy
  • About Us
  • Contact Us
  • Advertise with Us
  • Editorial Policy
  • Privacy Policy
No Result
View All Result
  • Home
  • News
    • Global Operations
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
    • Industry
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
      • Oceana
    • Special Interest
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
      • Oceana
  • Market
    • Wired to Win
    • SOFX.NET
  • Intelligence
    • USMC Deception Manual
  • Resources
    • Contact Us
    • About Us
    • Editorial Policy
    • Privacy Policy
Subscribe
This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.

Log in to your account

Lost your password?
wpDiscuz