this post was submitted on 24 Feb 2026

552 points (96.2% liked)

Technology

84949 readers

4362 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 3 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

552

'I had to RUN to my Mac mini like I was defusing a bomb': OpenClaw AI chose to 'speedrun' deleting Meta AI safety director's inbox due to a 'rookie error' (www.pcgamer.com)

submitted 3 months ago* (last edited 3 months ago) by themachinestops@lemmy.dbzer0.com to c/technology@lemmy.world

163 comments fedilink hide all child comments

top 50 comments

sorted by: hot top controversial new old

[–] LastYearsIrritant@sopuli.xyz 226 points 3 months ago (13 children)

I love how these models apologize like they mean it. It doesn't mean it. It doesn't feel bad, and it will do it again.

Apologies mean "I made a mistake and I learned from it so it won't repeat."

Sure it claims it added more notes to it's config, but if it ignored the rules before, what makes you think that new rules are going to change anything?

[–] panda_abyss@lemmy.ca 88 points 3 months ago

But it’s adding it to a text file that eats up a ton of tokens and routinely gets ignored!

[–] BrianTheeBiscuiteer@lemmy.world 39 points 3 months ago

That MEMORY. md file won't do shit if the AI doesn't read it.

I give it 2 hours before it stops reading it until prompted again.

[–] bleistift2@sopuli.xyz 20 points 3 months ago (2 children)

Apologies mean “I made a mistake and I learned from it so it won’t repeat.”

I beg to differ. An apology means that you feel bad about harm inflicted upon others. To prove the point: You apologize when you’re late due to circumstances that are outside of your control. Or when you accidentally bump into someone on the bus when the driver slams the break.

load more comments (2 replies)

load more comments (10 replies)

[–] panda_abyss@lemmy.ca 135 points 3 months ago (14 children)

If I was the director of AI safety, and I used AI to own and delete my inbox, I sure as shit would never tell a soul.

This is pure unbridled incompetence.

[–] XLE@piefed.social 73 points 3 months ago* (last edited 3 months ago) (2 children)

The whole "AI safety" field is this incompetent. These people that will tell you AI is on the verge of creating a bioweapon, and then run random code in a command line. Completely and totally unserious.

[–] panda_abyss@lemmy.ca 47 points 3 months ago (1 children)

I don’t know what the hell has happened, but some of these people are basically human jellyfish. Big tech is full of them now.

No thought enters their mind, but they dodge the layoffs and the PIPs and get promoted like this.

I don’t fucking get it.

[–] GreenBeard@lemmy.ca 27 points 3 months ago (1 children)

It's just the natural progression of a disease that spreads outwards from Management. The bosses want yes-men, not people capable of independent thought.

load more comments (1 replies)

load more comments (13 replies)

[–] MoogleMaestro@lemmy.zip 117 points 3 months ago (1 children)

The world's first opt-in computer worm. 🐛 🪱

[–] alekwithak@lemmy.world 85 points 3 months ago (2 children)

[–] MoogleMaestro@lemmy.zip 33 points 3 months ago

No way, not my buddy!

load more comments (1 replies)

[–] Fizz@lemmy.nz 71 points 3 months ago (5 children)

The funniest part is this person job is AI safety.

[–] Chulk@lemmy.ml 25 points 3 months ago (1 children)

Yeah, I personally wouldn't be announcing this failure to the world if I were in her position. I don't think you could torture it out of me lmao

load more comments (1 replies)

load more comments (4 replies)

[–] yogurtwrong@lemmy.world 61 points 3 months ago* (last edited 3 months ago) (5 children)

I hate how Apple users feel the need to call their computer by the brand. It really makes me cringe.

It is called "a computer"

Maybe "PC"

"box" if you really have to flex that UNIX

They should treat their computers less like a sports car and more like a van

[–] ThunderQueen@lemmy.world 33 points 3 months ago (6 children)

I mean, isnt that the entire point of Apple? Brand recognition and percieved status attributed to said brand. Its like rappers and gucci belts or country artists and ford pickups

load more comments (6 replies)

load more comments (4 replies)

[–] borth@sh.itjust.works 60 points 3 months ago (1 children)

Nothing humbles you like telling your OpenClaw “confirm before acting” and watching it speedrun deleting your inbox. I couldn’t stop it from my phone. I had to RUN to my Mac mini like I was defusing a bomb

... Nothing humbles you like that?

load more comments (1 replies)

[–] AbouBenAdhem@lemmy.world 54 points 3 months ago* (last edited 3 months ago)

“The bot ate my homework” is quickly becoming more plausible than the customary canine culprit.

[–] RedstoneValley@sh.itjust.works 52 points 3 months ago (9 children)

Can someone explain to mr why these people are buying Mac Minis to run this in a "safe" environment and then they go on and connect it to the internet and give the AI credentials to all their cloud accounts? This seems excessively moronic to me? Am I missing something?

[–] sp3ctr4l@lemmy.dbzer0.com 23 points 3 months ago

No, you're not missing anything.

They're morons.

Thats our ruling elite; a bunch of fucking morons with egos and low self awareness at best, literally child raping and murdering pedophiles at worst.

load more comments (8 replies)

[–] BrianTheeBiscuiteer@lemmy.world 47 points 3 months ago (5 children)

AI: I'm so sorry. You're correct I violated protocol. I'll make a note of this so it won't happen again.

Nurse: You gave my 5 year old patient 5000cc of morphine!

[–] IratePirate@feddit.org 21 points 3 months ago

it won’t happen again.

Not to him, no.

load more comments (4 replies)

[–] Dultas@lemmy.world 45 points 3 months ago

The S in OpenClaw stands for security.

[–] renzhexiangjiao@piefed.blahaj.zone 44 points 3 months ago (6 children)

you can like... enforce this rule programatically? you don't have to say "pretty please" to ai? basically, when AI requests some potentially unwanted thing (like deleting an email), this request goes through a proxy that asks the human for confirmation. Also you can have a safe word set up in the chat interface to act as a killswitch. I thought these are ABCs of ai safety but apparently these are foreign concepts to this "safety director"

[–] zqps@sh.itjust.works 30 points 3 months ago* (last edited 3 months ago) (1 children)

The people who internalize this would never engage with a chatbot in this way in the first place. To them this is another intelligence they're conversing with, where you get what you need by following social decorum, and enforcing your will amounts to abuse.

load more comments (1 replies)

[–] HobbitFoot@thelemmy.club 23 points 3 months ago

Program? Like a fucking farmer?

load more comments (4 replies)

[–] XLE@piefed.social 44 points 3 months ago

If all the qualifications I need to be a security engineer for Facebook are

buy a Mac Mini
don't configure remote access
install untrusted software
leave

Then Facebook should hire me. I'll buy so many Mac Minis on their dime. I will run so many crazy things.

[–] lemmydividebyzero@reddthat.com 40 points 3 months ago (2 children)

They released a version recently that fixed over 60 security vulnerabilities. All of them were high or critical.

How many more are there to find? Thousands?

Whoever uses this on a PC with anything useful on it, is absolutely insane.

load more comments (2 replies)

[–] echodot@feddit.uk 38 points 3 months ago

Yep that's about the level of intelligence I would expect from Meta's AI safety director.

Doing the one thing that you're never supposed to do, letting an AI loose on anything sensitive.

For her next trick she's going to run while holding scissors in one hand and a bottle of boiling acid in the other. What could go wrong.

[–] themachinestops@lemmy.dbzer0.com 38 points 3 months ago (1 children)

[–] themachinestops@lemmy.dbzer0.com 25 points 3 months ago (1 children)

[–] wabafee@lemmy.world 27 points 3 months ago* (last edited 3 months ago) (1 children)

I like how the AI seems proud deleting her inbox.

[–] panda_abyss@lemmy.ca 49 points 3 months ago

I knew the rules. I did it anyway. And I’d fuckin do it again.

[–] nieceandtows@programming.dev 34 points 3 months ago

Yes I remember. And I violated it.

Asimov rolling in his grave.

[–] hansolo@lemmy.today 34 points 3 months ago (2 children)

I love so much that there are real, hilarious consequences for overzealous early adoption. You can't make this shit up.

load more comments (2 replies)

[–] phoenixz@lemmy.ca 31 points 3 months ago (6 children)

How come some 25yo person is a director at Facebook?

I mean, even if she is a child prodigy genius, which she obviously is not as she is face first fist deep into AI, how the frack do you have even enough life experience to become a director of any large organization at that age unless you somehow cheated your way in?

Then reading the hat she's doing and how she resolved it tells me she doesn't know shit about computers, she just know how to type commands into AI systems

Is this the future? Am I going to end up being one of those long bearded magicians that still know the old technology, that still can still save the day by using shell commands?

load more comments (6 replies)

[–] abbadon420@sh.itjust.works 28 points 3 months ago (5 children)

How come I can't find a job while an air-brain like this has a job title like that?

load more comments (5 replies)

[–] LiveLM@lemmy.zip 27 points 3 months ago* (last edited 3 months ago) (1 children)

She's lucky all she got were some deleted emails.
Given how insecure this whole ordeal is and the fact that she gave it full access to her REAL Inbox, someone could have phished the ever living fuck out of her and Meta just by sending an email with malicious prompt written on white text or hiding messages zero-width characters and other wacky antics.
Real Looney Tunes shit, congratulations to all involved.

load more comments (1 replies)

[–] Regrettable_incident@lemmy.world 23 points 3 months ago (1 children)

And execs think we're going to give these products our bank details and ask them to book flights and stuff. . ?

load more comments (1 replies)

[–] xep@discuss.online 21 points 3 months ago (2 children)

This smells like guerilla marketing to me.

load more comments (2 replies)

[–] mannycalavera@feddit.uk 20 points 3 months ago

Imagine how much a Director at Meta is being paid to be this fucking stupid. Jesus lawn mowing Christ.

[–] PointyFluff@lemmy.ml 19 points 3 months ago (3 children)

First of all. BULLSHIT. Second. why would you give a bot write-access to your filesystem.

load more comments (3 replies)

[–] FireWire400@lemmy.world 18 points 3 months ago (2 children)

Jokes on you; she probably still earns more money than most of us...

load more comments (2 replies)

[–] Wispy2891@lemmy.world 18 points 3 months ago (5 children)

Run? Like physically run? You install a server on your hardware without setting up remote access? Even plug and play one-click solutions like tailscale??

load more comments (5 replies)

[–] LittleBorat3@lemmy.world 17 points 3 months ago (1 children)

The I'm sorry part is always great, I always wanted an apology by an LLM not that it works as specified 😆

It can be like your least competent colleague on roids

load more comments (1 replies)

[–] Sanctus@anarchist.nexus 17 points 3 months ago (1 children)

Good, maybe you should run more OpenClaw so it can trash your shit and stop you from fucking up the world.

load more comments (1 replies)

[–] CompactFlax@discuss.tchncs.de 17 points 3 months ago

What a dummy.

[–] VerilyFemme@lemmy.blahaj.zone 17 points 3 months ago (1 children)

"The AI that actually does things" is a fucking hilarious tagline given the thing it actually did.

load more comments (1 replies)

load more comments