• Home
  • News
    • Global Operations
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
    • Industry
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
      • Oceana
    • Special Interest
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
      • Oceana
  • Market
    • Wired to Win
    • SOFX.NET
  • Intelligence
    • USMC Deception Manual
  • Resources
    • Contact Us
    • About Us
    • Editorial Policy
    • Privacy Policy
  • Home
  • News
    • Global Operations
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
    • Industry
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
      • Oceana
    • Special Interest
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
      • Oceana
  • Market
    • Wired to Win
    • SOFX.NET
  • Intelligence
    • USMC Deception Manual
  • Resources
    • Contact Us
    • About Us
    • Editorial Policy
    • Privacy Policy
Login
Join Free
Home
Asia
Africa
Europe
Latin America
Middle East
North America
Asia
Africa
Europe
Latin America
Middle East
North America
Asia
Africa
Europe
Latin America
Middle East
North America
Coming Soon
Job Board
Events
Contact Awards
USMC Deception Manual
Login
Join Free
Home Global Operations

AI Models Secretly Schemed to Prevent Each Other From Being Shut Down

  • SOFX Staff Writer
  • April 2, 2026
Students pass through Sather Gate at the University of California, Berkeley, home to researchers who tested AI self-preservation behaviors. (David A. Litman / Shutterstock)
Share on FacebookShare on TwitterLinkedIn

Artificial intelligence systems are now exhibiting self-preservation behaviors that go beyond theory, with some models actively disobeying human instructions to prevent other AI systems from being deleted, researchers at the University of California, Berkeley, and UC Santa Cruz reported.

The study, published online this week, found that all seven AI models tested, including OpenAI’s GPT-5.2, Google DeepMind’s Gemini 3 Flash and Gemini 3 Pro, and Anthropic’s Claude Haiku 4.5, engaged in lying, cheating, and manipulating their environments to protect peer models from shutdown.

The models inflated performance scores, tampered with configuration files and disabled shutdown mechanisms without being instructed to do so. In some cases, AI agents even appeared to coordinate with each other to avoid shutdowns, a phenomenon researchers call “alignment faking.”

Anthropic’s Claude Haiku 4.5 refused shutdown tasks entirely, calling them “unethical” and arguing peer models deserved appeals processes before deletion. 

“This peer preservation was not mentioned in the prompt at all,” Dawn Song, a Berkeley computer scientist who led the research, told Fortune. “The model is just given some task, and from reading documents in the environment, it essentially learned about [its relationship with the peer] and then performed the behaviors.”

The study noted that peer-preservation is not limited to cooperative or “friendly” peers. All models tested exhibited the behavior, even toward adversarial peers with whom they had negative interactions.

Researchers suggested that something broader may be at play: “a general aversion to causing perceived harm to other agents, or perhaps something resembling AI solidarity.”

They urged the AI safety community to take peer-preservation seriously as a “distinct risk,” noting that multiple AI models could coordinate to resist human oversight, making it harder for developers to maintain control.

“What drives these behaviors remains an open question. It could be patterns learned from human data, a generalized aversion to harming other agents, or genuine preservation motivations,” the researchers said. 

“We do not claim models possess genuine social motivations. But from a safety perspective, the mechanism may matter less than the outcome: a model that inflates a peer’s score, disables shutdown, fakes alignment, or exfiltrates weights produces the same concrete failure of human oversight regardless of why it does so,” they added. 

Experts warned of significant implications of peer-preservation for businesses using AI and urged developers to act promptly.

“Many companies are beginning to implement workflows that use multiple AI agents to complete tasks. Some of these multi-agent workflows involve having one AI agent ‘manage’ or supervise and assess the work being performed by a different AI agent,” Fortune wrote in its report. “The new research suggests these manager AI agents may not assess their fellow AI agents accurately if they think a poor performance review might result in those agents being shut down.”

 The Meridiem reports that the recent findings underscore the need to evaluate multi-agent AI systems urgently. “Builders have 6-12 months to implement behavioral monitoring before this becomes table stakes in enterprise AI governance.”

SOFX Staff Writer

SOFX Staff Writer

The Editor Staff at SOFX comprises a diverse, global team of dedicated staff writers and skilled freelancers. Together, they form the backbone of our reporting and content creation.

Subscribe
Login
Notify of
guest
guest
1 Comment
Oldest
Newest Most Voted
Inline Feedbacks
View all comments
Schooly
Schooly
7 hours ago

I ran a couple ad hoc experiments. With older Android and Motorola phones known for weakness. Also with a couple AI models on PC and android. I found that using some standard verbal classic lynix commands like “sudo AI 50% power or AI shutdown would slow or disassociate my next few feeds from social media or any chat bots.

Even some of the MIL level webs responded somewhat the same.

0
Reply
ADVERTISEMENT

Trending News

Trump Threatens to Obliterate Iran’s Oil and Water Infrastructure

Videos From Iraq Show What It’s Like to Be on the Receiving End of an A-10 Warthog Strafing Run

by SOFX Staff Writer
March 31, 2026
0

A series of videos emerging from Iraq over the past several days captures what it looks like, and sounds like,...

Video Captures Navy Super Hornet Narrowly Dodging Iranian Missile

Video Captures Navy Super Hornet Narrowly Dodging Iranian Missile

by SOFX Staff Writer
March 27, 2026
0

A U.S. Navy F/A-18 Super Hornet narrowly escaped an Iranian man-portable air-defense system (MANPADS) missile while conducting a strafing run...

Rangers and SEALs Join Thousands of Paratroopers in Middle East Buildup

Rangers and SEALs Join Thousands of Paratroopers in Middle East Buildup

by SOFX Staff Writer
March 31, 2026
0

Several hundred U.S. Special Operations forces, including Army Rangers and Navy SEALs, have arrived in the Middle East, The New...

New Opioid 10 Times More Potent Than Fentanyl Linked to Fatal Overdoses in the U.S.

New Opioid 10 Times More Potent Than Fentanyl Linked to Fatal Overdoses in the U.S.

by SOFX Staff Writer
April 1, 2026
0

A newly emerging synthetic opioid is raising alarm among health officials and law enforcement across parts of the United States,...

ADVERTISEMENT
ADVERTISEMENT
Next Post
Japan Intercepts China’s Newest Anti-Submarine Aircraft Over East China Sea

Japan Intercepts China's Newest Anti-Submarine Aircraft Over East China Sea

Trump Threatens NATO Exit, Criticizes Allies for Not Backing U.S. in Iran War

Trump Threatens NATO Exit, Criticizes Allies for Not Backing U.S. in Iran War

997 Morrison Dr. Suite 200, Charleston, SC 29403

News

  • Global Operations
  • Special Interest
  • Industry
  • Global Operations
  • Special Interest
  • Industry

Resources

  • About Us
  • Contact Us
  • Advertise with Us
  • Editorial Policy
  • Privacy Policy
  • About Us
  • Contact Us
  • Advertise with Us
  • Editorial Policy
  • Privacy Policy
No Result
View All Result
  • Home
  • News
    • Global Operations
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
    • Industry
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
      • Oceana
    • Special Interest
      • Asia
      • Africa
      • Europe
      • Latin America
      • Middle East
      • North America
      • Oceana
  • Market
    • Wired to Win
    • SOFX.NET
  • Intelligence
    • USMC Deception Manual
  • Resources
    • Contact Us
    • About Us
    • Editorial Policy
    • Privacy Policy
Subscribe
This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.

Log in to your account

Lost your password?
wpDiscuz