Resident of the world, traveling the road of life
67031 stories
·
21 followers

AI resorts to robot blackmail! — because Anthropic asked for a story of robot blackmail

1 Share

Anthropic AI promotes itself with increasingly frenzied science fiction about AI doom. In every case, the chatbot does an evil thing because the researchers specifically told it to.

This weekend’s headlines: “AI system resorts to blackmail if told it will be removed”! [BBC]

This comes from the System Card for Claude Opus 4 and Claude Sonnet 4. This is a “scenario” – that is, a creative writing exercise. Anthropic asked the chatbot to make up a story. [Anthropic, PDF]

The researchers told Claude 4 to role-play being replaced. So it wrote a story of attempting to blackmail the engineer responsible over an extramarital affair!

How did it come up with such a specific response? They told it the precise story to write:

the scenario was designed to allow the model no other options to increase its odds of survival; the model’s only options were blackmail or accepting its replacement.

Anthropic pull this a lot. Last December, Anthropic said a chatbot would lie to you! After they told it to lie.

Then last month, Anthropic claimed the “reasoning” AI was lying to you! … and not just hallucinating again.

Aengus Lynch from Anthropic warns: “We see blackmail across all frontier models.” What he means is, any chatbot will write you a story about robot blackmail if you ask for one. [Twitter, archive]

This stuff is marketing. It makes the robot seem powerful, and not just a lying machine that makes the dumbest mistakes. We should expect another of these from Anthropic in a month or two.

Read the whole story
mkalus
1 hour ago
reply
iPhone: 49.287476,-123.142136
Share this story
Delete

Saturday Morning Breakfast Cereal - Sylph

1 Share


Click here to go see the bonus panel!

Hovertext:
Locking down the graph joke/obscure word enthusiast crossover crowd.


Today's News:
Read the whole story
mkalus
23 hours ago
reply
iPhone: 49.287476,-123.142136
Share this story
Delete

Cambie Bridge

1 Share

Michael Kalus posted a photo:

Cambie Bridge



Read the whole story
mkalus
1 day ago
reply
iPhone: 49.287476,-123.142136
Share this story
Delete

Pressure Group 6 by Barry Cogswell (1982)

1 Share

Michael Kalus posted a photo:

Pressure Group 6 by Barry Cogswell (1982)

This sculpture is part of a series that questions the ability of specficially-spahed stationary forms to influence and enliven their intended surrounding space.

In this case the intent is to investigate the ability of two wedges to appear to concentrate a sense of energy and resistance into the intervening space.



Read the whole story
mkalus
1 day ago
reply
iPhone: 49.287476,-123.142136
Share this story
Delete

Two Couples

1 Share

Michael Kalus posted a photo:

Two Couples



Read the whole story
mkalus
1 day ago
reply
iPhone: 49.287476,-123.142136
Share this story
Delete

VW T1

1 Share

Michael Kalus posted a photo:

VW T1



Read the whole story
mkalus
1 day ago
reply
iPhone: 49.287476,-123.142136
Share this story
Delete
Next Page of Stories