I need people to understand that stuff like this will keep happening, for two reasons:

rysiek@mstdn.social

@arichtman because surely there will be no way to prompt-inject a request to write a malicious python script and run it.

dancast@wandering.shop

@rysiek I am sure it passed its unit tests.

rysiek@mstdn.social

@dancast oh yeah, they probably got generated by it, and in a way they always pass.

paco@infosec.exchange

@rysiek At the bottom of that article is a headline for suggested next article:
“Also read: Microsoft is making Teams secure by default, automatically enabling new protections to reduce AI-driven threats.”

It wasn’t secure by default? But they’re gonna change that?

And I love how it flip flops from rock solid certainty “secure by default” to corporate weasel-speak “reduce AI-driven threats” in the span of a single sentence.

rysiek@mstdn.social

@paco Satya Nadella made sure Microsoft focused on security over 2 years ago, after all!
https://www.geekwire.com/2024/haunted-by-repeated-breaches-microsoft-is-putting-security-above-all-else-vows-ceo-satya-nadella/

paco@infosec.exchange

@rysiek “We are doubling down on this very important work, putting security above all else — before all other features and investments,” Nadella said before adding “at least for the rest of this week. Maybe even a whole month.”

edcates@mastodon.social

@paco @rysiek "putting security above all else" = "instructing the code bots to only write secure code." Then telling them again because they *really* mean it this time!

paul_ipv6@infosec.exchange

@rysiek

wait. so giving 4 year olds in the playground assault rifles can't ever be made safe? say it isn't so...

dgodon@mastodon.online

@rysiek @paco so you’re telling me they treat security as seriously as Meta treats privacy?

dominikg@mastodon.gamedev.place

@rysiek I would assume that anything a chatbot has permission to do, will get done, given enough time. Instructions to an LLM are just text which can and will get ignored. Also the chatbot can say that they did something even though no action has taken place.

It's all just meaningless text to the LLM.