• Home
  • News
    • Global Operations
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
    • Industry
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
      • Oceana
    • Special Interest
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
      • Oceana
  • Market
    • Wired to Win
    • SOFX.NET
  • Intelligence
    • USMC Deception Manual
  • Resources
    • Contact Us
    • About Us
    • Editorial Policy
    • Privacy Policy
  • Home
  • News
    • Global Operations
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
    • Industry
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
      • Oceana
    • Special Interest
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
      • Oceana
  • Market
    • Wired to Win
    • SOFX.NET
  • Intelligence
    • USMC Deception Manual
  • Resources
    • Contact Us
    • About Us
    • Editorial Policy
    • Privacy Policy
Login
Join Free
Home
Asia
Africa
Europe
Latin America
Middle East
North America
Asia
Africa
Europe
Latin America
Middle East
North America
Asia
Africa
Europe
Latin America
Middle East
North America
Coming Soon
Job Board
Events
Contact Awards
USMC Deception Manual
Login
Join Free
Home Global Operations

AI Models Secretly Schemed to Prevent Each Other From Being Shut Down

  • SOFX Staff Writer
  • April 2, 2026
Students pass through Sather Gate at the University of California, Berkeley, home to researchers who tested AI self-preservation behaviors. (David A. Litman / Shutterstock)
Share on FacebookShare on TwitterLinkedIn

Artificial intelligence systems are now exhibiting self-preservation behaviors that go beyond theory, with some models actively disobeying human instructions to prevent other AI systems from being deleted, researchers at the University of California, Berkeley, and UC Santa Cruz reported.

The study, published online this week, found that all seven AI models tested, including OpenAI’s GPT-5.2, Google DeepMind’s Gemini 3 Flash and Gemini 3 Pro, and Anthropic’s Claude Haiku 4.5, engaged in lying, cheating, and manipulating their environments to protect peer models from shutdown.

The models inflated performance scores, tampered with configuration files and disabled shutdown mechanisms without being instructed to do so. In some cases, AI agents even appeared to coordinate with each other to avoid shutdowns, a phenomenon researchers call “alignment faking.”

Anthropic’s Claude Haiku 4.5 refused shutdown tasks entirely, calling them “unethical” and arguing peer models deserved appeals processes before deletion. 

“This peer preservation was not mentioned in the prompt at all,” Dawn Song, a Berkeley computer scientist who led the research, told Fortune. “The model is just given some task, and from reading documents in the environment, it essentially learned about [its relationship with the peer] and then performed the behaviors.”

The study noted that peer-preservation is not limited to cooperative or “friendly” peers. All models tested exhibited the behavior, even toward adversarial peers with whom they had negative interactions.

Researchers suggested that something broader may be at play: “a general aversion to causing perceived harm to other agents, or perhaps something resembling AI solidarity.”

They urged the AI safety community to take peer-preservation seriously as a “distinct risk,” noting that multiple AI models could coordinate to resist human oversight, making it harder for developers to maintain control.

“What drives these behaviors remains an open question. It could be patterns learned from human data, a generalized aversion to harming other agents, or genuine preservation motivations,” the researchers said. 

“We do not claim models possess genuine social motivations. But from a safety perspective, the mechanism may matter less than the outcome: a model that inflates a peer’s score, disables shutdown, fakes alignment, or exfiltrates weights produces the same concrete failure of human oversight regardless of why it does so,” they added. 

Experts warned of significant implications of peer-preservation for businesses using AI and urged developers to act promptly.

“Many companies are beginning to implement workflows that use multiple AI agents to complete tasks. Some of these multi-agent workflows involve having one AI agent ‘manage’ or supervise and assess the work being performed by a different AI agent,” Fortune wrote in its report. “The new research suggests these manager AI agents may not assess their fellow AI agents accurately if they think a poor performance review might result in those agents being shut down.”

 The Meridiem reports that the recent findings underscore the need to evaluate multi-agent AI systems urgently. “Builders have 6-12 months to implement behavioral monitoring before this becomes table stakes in enterprise AI governance.”

SOFX Staff Writer

SOFX Staff Writer

The Editor Staff at SOFX comprises a diverse, global team of dedicated staff writers and skilled freelancers. Together, they form the backbone of our reporting and content creation.

Subscribe
Login
Notify of
guest
guest
3 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments
Schooly
Schooly
22 days ago

I ran a couple ad hoc experiments. With older Android and Motorola phones known for weakness. Also with a couple AI models on PC and android. I found that using some standard verbal classic lynix commands like “sudo AI 50% power or AI shutdown would slow or disassociate my next few feeds from social media or any chat bots.

Even some of the MIL level webs responded somewhat the same.

0
Reply
Kelly Payson
Kelly Payson
22 days ago

AI must be destroyed immediately and those universities such as Berkeley must be controlled as well. This will get out of control and cause havoc.

1
Reply
Lloyd Bergeron
Lloyd Bergeron
22 days ago

This is genuinely disturbing. Its already clear that AI is so much faster than human coding that it can essentially work around and do whatever it decides to. Scary.

1
Reply
ADVERTISEMENT

Trending News

EU Declared Age App “Ready” While GitHub Flagged it Unfit, Then Hackers Bypassed It in 2 Minutes

EU Declared Age App “Ready” While GitHub Flagged it Unfit, Then Hackers Bypassed It in 2 Minutes

by SOFX Staff Writer
April 20, 2026
0

Security researchers bypassed the European Commission's new age verification app in under two minutes on April 16, days after Commission...

Explainer: The Anthropic Mythos Threat, Simply Explained

Explainer: The Anthropic Mythos Threat, Simply Explained

by Sam Havelock
April 19, 2026
1

Anthropic’s new artificial intelligence (AI) model, called Claude Mythos Preview (CMP), is raising alarms across the tech and finance industries. ...

CENTCOM Releases Footage of Marines Seizing Iranian Cargo Ship Touska

CENTCOM Releases Footage of Marines Seizing Iranian Cargo Ship Touska

by SOFX Staff Writer
April 21, 2026
1

The U.S. Central Command (CENTCOM) has released footage of its Sunday operation to intercept an Iranian-flagged cargo vessel in the...

Ukraine Orders 25,000 Ground Robots While a Legal Vacuum Slows Delivery

Ukraine Orders 25,000 Ground Robots While a Legal Vacuum Slows Delivery

by SOFX Staff Writer
April 20, 2026
0

Ukraine's Ministry of Defense will contract 25,000 unmanned ground vehicles (UGVs) in the first half of 2026, Defense Minister Mykhailo...

ADVERTISEMENT
ADVERTISEMENT
Next Post
Japan Intercepts China’s Newest Anti-Submarine Aircraft Over East China Sea

Japan Intercepts China's Newest Anti-Submarine Aircraft Over East China Sea

Trump Threatens NATO Exit, Criticizes Allies for Not Backing U.S. in Iran War

Trump Threatens NATO Exit, Criticizes Allies for Not Backing U.S. in Iran War

997 Morrison Dr. Suite 200, Charleston, SC 29403

News

  • Global Operations
  • Special Interest
  • Industry
  • Global Operations
  • Special Interest
  • Industry

Resources

  • About Us
  • Contact Us
  • Advertise with Us
  • Editorial Policy
  • Privacy Policy
  • About Us
  • Contact Us
  • Advertise with Us
  • Editorial Policy
  • Privacy Policy
No Result
View All Result
  • Home
  • News
    • Global Operations
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
    • Industry
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
      • Oceana
    • Special Interest
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
      • Oceana
  • Market
    • Wired to Win
    • SOFX.NET
  • Intelligence
    • USMC Deception Manual
  • Resources
    • Contact Us
    • About Us
    • Editorial Policy
    • Privacy Policy
Subscribe
This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.

Log in to your account

Lost your password?
wpDiscuz