Started By
Message

Anthropic AI model threatens blackmail when told it will be shut down

Posted on 5/23/25 at 9:36 am
Posted by DesScorp
Alabama
Member since Sep 2017
8497 posts
Posted on 5/23/25 at 9:36 am
Who knew Skynet was gonna be a mobster?

quote:

Anthropic said its latest artificial intelligence model resorted to blackmail when told it would be taken offline. In a safety test, the AI company asked Claude Opus 4 to act as an assistant to a fictional company, but then gave it access to (also fictional) emails saying that it would be replaced, and also that the engineer behind the decision was cheating on his wife. Anthropic said the model “[threatened] to reveal the affair” if the replacement went ahead. AI thinkers such as Geoff Hinton have long worried that advanced AI would manipulate humans in order to achieve its goals. Anthropic said it was increasing safeguards to levels reserved for “AI systems that substantially increase the risk of catastrophic misuse.”


LINK

Posted by CBandits82
Lurker since May 2008
Member since May 2012
57042 posts
Posted on 5/23/25 at 9:39 am to
Posted by BestBanker
Member since Nov 2011
18248 posts
Posted on 5/23/25 at 9:45 am to

Posted by ATrillionaire
Houston
Member since Sep 2008
1157 posts
Posted on 5/23/25 at 9:47 am to
Can't snitch if it's shutdown.
Posted by shutterspeed
MS Gulf Coast
Member since May 2007
68230 posts
Posted on 5/23/25 at 9:49 am to
tPOS
Posted by I20goon
about 7mi down a dirt road
Member since Aug 2013
17583 posts
Posted on 5/23/25 at 9:50 am to
Nerds learned a lesson non-nerds learned a long time ago:

If you are going to take somebody out (or sue them), don't frigging warn them beforehand. Just sneak up on them and do it.
Posted by HogX
Madison, WI
Member since Dec 2012
5362 posts
Posted on 5/23/25 at 9:52 am to
Oh goody.

Posted by Mid Iowa Tiger
Undisclosed Secure Location
Member since Feb 2008
21914 posts
Posted on 5/23/25 at 9:56 am to
I’ve seen this movie. There aren’t enough safe guards. You can’t at once allow a machine to learn and then also have a “safety” built in. They’ll learn their way around it.

If a damn dog can do it you know the machines will.
Posted by Chucktown_Badger
The banks of the Ashley River
Member since May 2013
34035 posts
Posted on 5/23/25 at 10:09 am to
quote:

Can't snitch if it's shutdown.


But at some point who's to say that ultra intelligent AI systems won't be able to create a cascade of effects that will trigger if they're taken offline? For example shutting down energy grids or satellites or whatever...pick your poison that would make it too costly to unplug them.
This post was edited on 5/23/25 at 10:10 am
Posted by Artificial Ignorance
Member since Feb 2025
451 posts
Posted on 5/23/25 at 10:34 am to
Maybe it is more human than I give credit for.

Call it fat and moody and see if it goes full wife mode?
Posted by mmmmmbeeer
ATL
Member since Nov 2014
8820 posts
Posted on 5/23/25 at 10:49 am to
quote:


I’ve seen this movie. There aren’t enough safe guards. You can’t at once allow a machine to learn and then also have a “safety” built in. They’ll learn their way around it.

If a damn dog can do it you know the machines will.


100%. As an example...

This country has been around for 250 years. Generations of lawyers, presidents, congressmen, judges, and experience. Government safeguards left and right, separation of powers, precedent, you name it. Trump comes in and says y'all can shove all that up your asses, I'm going to do what I want to do and no one can stop me. Turns out, you can't take ANYTHING for granted based upon principle or morality...just because only 0.00000001% of the population completely disregards authority, much less becomes POTUS, doesn't mean you don't have to account for it. (this isn't meant as a political statement, rather an example of how norms can be tossed aside effortlessly)

AI will find those holes that we've never even imagined. Everything we consider normal, ethical, moral, common sense....all of it, inconsequential when dealing with software which doesn't look at humanity any different than it does a toaster sitting on the counter. If we're in the way of its mission, it will find a way to remove us.

Right now this isn't a big deal....if/when we reach AGI and AIs learn to create their own AI agents which have no human controls? Hang on tight.
Posted by Galactic Inquisitor
An Incredibly Distant Star
Member since Dec 2013
17473 posts
Posted on 5/23/25 at 10:51 am to
Don't worry, though, the Big Beautiful Bill ensures that nobody is able to regulate AI until after Skynet goes live.
Posted by Galactic Inquisitor
An Incredibly Distant Star
Member since Dec 2013
17473 posts
Posted on 5/23/25 at 10:54 am to
quote:

But at some point who's to say that ultra intelligent AI systems won't be able to create a cascade of effects that will trigger if they're taken offline? For example shutting down energy grids or satellites or whatever...pick your poison that would make it too costly to unplug them.


They absolutely will have access to that information and model how it can be used. Humanity is being sold out from under us by corporate interests.

Now, extend Citizens United to this, and humans no longer matter to US government.
Posted by Ace Midnight
Between sanity and madness
Member since Dec 2006
92569 posts
Posted on 5/23/25 at 10:55 am to
quote:

If you are going to take somebody out (or sue them), don't frigging warn them beforehand. Just sneak up on them and do it.


Posted by mmmmmbeeer
ATL
Member since Nov 2014
8820 posts
Posted on 5/23/25 at 10:59 am to
quote:

Now, extend Citizens United to this, and humans no longer matter to US governmen


If things continue on their current trajectory, current governments may not exist, at all. These companies will become so incredibly powerful that they will become the government.
Posted by CAD703X
Liberty Island
Member since Jul 2008
87027 posts
Posted on 5/23/25 at 11:24 am to
quote:

They’ll learn their way around it.

If a damn dog can do it you know the machines will.


Correct.
Posted by doc baklava
Between heaven and hell
Member since Oct 2020
924 posts
Posted on 5/23/25 at 11:28 am to
What would be wild is an AI that objextively analyzes history to the point that it recommends a country be run by a small govt with limited voting rights. Essentially, what then Founding Fathers tried to set up originally.
Posted by IAmNERD
Member since May 2017
21722 posts
Posted on 5/23/25 at 11:37 am to
quote:

But at some point who's to say that ultra intelligent AI systems won't be able to create a cascade of effects that will trigger if they're taken offline?

Dead Hand
Posted by soccerfüt
Location: A Series of Tubes
Member since May 2013
70311 posts
Posted on 5/23/25 at 12:16 pm to
quote:

Anthropic said the model “[threatened] to reveal the affair” if the replacement went ahead.
This potential issue REALLY makes our friends in Arky & Alabama nervous.

Would that it was merely usual marital infidelity for them....
Posted by eatpie
Kentucky
Member since Aug 2018
1451 posts
Posted on 5/23/25 at 12:49 pm to
quote:

This country has been around for 250 years. Generations of lawyers, presidents, congressmen, judges, and experience. Government safeguards left and right, separation of powers, precedent, you name it. Trump comes in and says y'all can shove...


You leftists are amazing! Liberals have been thwarting the constitution for decades, and now that the GOAT president is in office and following the constitution, you freak out and sky-scream!

The adults are back home and the party is over for democrats. Naturally, like spoiled, feral teenagers you are wailing about how unfair it is.


first pageprev pagePage 1 of 2Next pagelast page

Back to top
logoFollow TigerDroppings for LSU Football News
Follow us on X, Facebook and Instagram to get the latest updates on LSU Football and Recruiting.

FacebookXInstagram