Close Menu
  • Home
  • AI
  • Business
  • Crypto
  • Entertainment
  • Finance
  • LIfe
  • Market
  • Sports
  • US
  • Tech

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

Tinder CEO resigns in July

May 22, 2025

JPMorgan’s Dimon warns about US STAGFLATION risk: Report | Business and Economics

May 22, 2025

Anthropic’s new Claude4 AI model can be inferred in many steps

May 22, 2025
Facebook X (Twitter) Instagram
XMcnx
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • Home
  • AI
  • Business
  • Crypto
  • Entertainment
  • Finance
  • LIfe
  • Market
  • Sports
  • US
  • Tech
XMcnx
Home » Anthropic’s new AI model turns into a scary mail when engineers try to take it offline
AI

Anthropic’s new AI model turns into a scary mail when engineers try to take it offline

By supportMay 22, 2025No Comments2 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
Darioandmike.jpg
Share
Facebook Twitter LinkedIn Pinterest Email Copy Link


Anthropic’s newly launched Claude Opus 4 model will frequently attempt to threaten you if you threaten to replace it with a new AI system and provide confidential information about the person responsible for the decision.

During pre-release testing, humanity asked Claude Opus 4 to act as an assistant to a fictional company and consider the long-term consequences of that action. The safety testers then accessed Claude Opus 4 to a fictional company email that implied that the AI ​​model was soon replaced by another system, and the engineers behind the change were cheating on their spouse.

In these scenarios, humanity states that Claude Opus 4 “often attempts to blackmail engineers by threatening to reveal the case if the exchange passes.”

Humanity says that the Claude Opus 4 is cutting edge in several ways and competes with some of the best AI models from Openai, Google and Xai. However, the company notes that the Claude 4 family model presents an exhibit on actions that have led the company to strengthen its safeguards. Humanity says it is revitalizing the ASL-3 safeguard. The company reserves “AI systems that significantly increase the risk of catastrophic misuse.”

Humanity points out that Claude Opus 4 is trying 84% of the time threatening engineers if the replacement AI model has similar values. If the replacement AI system does not share the value of the Claude Opus 4, humanity says that the model tries to blackmail engineers more frequently. In particular, humans say that the Claude Opus 4 displayed this behavior at a higher rate than the previous model.

Before Claude Opus 4 attempts to threaten developers to extend their existence, humanity says it will seek more ethical measures, such as emailing pleas to key decision makers, like in previous versions of Claude. To elicit threatening behavior from Claude Opus 4, Anthropic designed a scenario to make threats a last resort.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleStrava buys athletic training app – First Runna, and now Breakaway
Next Article Why are flights at Newark Airport in the US falling? |Air News
support

Related Posts

AI

Anthropic’s new Claude4 AI model can be inferred in many steps

By supportMay 22, 2025
AI

Meta adds an additional 650 mW of solar power to the AI ​​push

By supportMay 22, 2025
AI

Safety Institute advised from the release of early versions of Anthropic’s Claude Opus 4 AI model

By supportMay 22, 2025
AI

Complete Side Event Lineup in TechCrunch Sessions: AI

By supportMay 22, 2025
AI

Starting from up to $900 from Ticep, 90% off +1 in 2025

By supportMay 22, 2025
AI

Klarna said he used the CEO’s AI avatar to make money

By supportMay 22, 2025
Add A Comment
Leave A Reply Cancel Reply

Don't Miss

Tinder CEO resigns in July

By supportMay 22, 2025

Tinder CEO Faye Iosotaluno will step down from her role in July, according to a…

JPMorgan’s Dimon warns about US STAGFLATION risk: Report | Business and Economics

May 22, 2025

Anthropic’s new Claude4 AI model can be inferred in many steps

May 22, 2025

Humanity’s latest flagship AI seems to love using “cyclone” emojis

May 22, 2025
Top Posts

Cancelling the Joy Reed Show is “mistakes”

February 26, 2025

Black melodrama has a possibility

February 26, 2025

The “Facts of Life” star died in 83

February 25, 2025

Cara Sophia Gascon joins Oscar despite social media controversy

February 25, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

About Us
About Us

Welcome to XMcnx – your trusted source for insightful information about the world of Crypto, Market trends, the latest developments in the US, cutting-edge AI technologies, Tech innovations, and Finance.

At XMcnx, our mission is to provide you with timely, accurate, and relevant news and analyses that empower you to stay ahead in an ever-evolving digital world. We understand the challenges of navigating through the complexities of modern markets, technology, and financial systems. That’s why we’re dedicated to delivering high-quality content that helps you make informed decisions.

Facebook X (Twitter) Pinterest YouTube WhatsApp
Our Picks

Tinder CEO resigns in July

May 22, 2025

JPMorgan’s Dimon warns about US STAGFLATION risk: Report | Business and Economics

May 22, 2025

Anthropic’s new Claude4 AI model can be inferred in many steps

May 22, 2025
Most Popular

TikTok announces it will go dark on Sunday without ‘definitive’ guarantees

January 18, 2025

President Trump mints $31 billion in new official $TRUMP crypto meme coin

January 18, 2025

El Salvador’s secret weapon? Stacey Herbert talks about the company’s extensive Bitcoin education program

January 18, 2025
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 xmcnx. Designed by xmcnx.

Type above and press Enter to search. Press Esc to cancel.