LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help.

benjamineskola@hachyderm.io

@mysturji Yes, that is my point.

benjamineskola@hachyderm.io

@RoBo2 Yes: probability. The sentence is a common one, so it’s likely to be reproduced in the output. But the LLM has no conception of whether a cat really did sit on the mat.

You probably could build an LLM so that it showed the probabilities of each token; but it wouldn’t solve the problem being discussed here at all.

pontus_k@mastodon.social

@benjamineskola Some of these systems have access to deterministic tools that could give them a better output. For example, all LLMs struggle with counting letters, but in a lot of cases they have the capability to call the unix utility 'wc' to count letters. Putting 'MAKE NO MISTAKES' in the prompt could possibly make it a bit more likely that it does so and gets it right. Don't get me wrong, I think it's absolutely stupid that this is where we are.

juliancalaby@social.treehouse.systems

@benjamineskola Colleague who is adding "AGENTS.md" files to our repositories is adding very similar paragraphs to those files.

Ugh.

benjamineskola@hachyderm.io

@pontus_k you don’t need to hunt for ways to make this make sense.

benjamineskola@hachyderm.io

@juliancalaby That sort of thing bugs me so much. Like, if you insist on using these tools (and I know I'm not going to win the fight against them more generally), then at least use them properly.

I've tried to have conversations about 'how do we know whether this actually makes a difference' and so on, and I think it's probably better than it could be, but it's still very silly.

juliancalaby@social.treehouse.systems

@benjamineskola I wrote our company's AI policy, added terms to require short- and long-term evaluation of whether this is actually working for us, and management as a whole agreed so it's company policy. Which is a nice. However the head of the company has gone very AI and is pulling the company in that direction despite ... well ... all the clear points against it and the person who is functionally our sysadmin is heading up a project to add it into our workflows and is using it to do stuff with our infrastructure.

I'm now trying to keep them accountable and biding my time before this blows up in their faces.

Thankfully the "accountability" story is working out fairly well so far, but it's fucking exhausting dealing with this bullshit.

dandean@indieweb.social

@prietschka @benjamineskola It’s refreshing to see people stating this plainly. These people are dumb, they make bad choices, and making that observation is not mean.

violetmadder@kolektiva.social

@Aedius @skotchygut @benjamineskola

Designed by wannabe supervillains, built with wartime-scale resources.

The hell do people expect??

violetmadder@kolektiva.social

@linkplay @nelson @benjamineskola @solonovamax

It's just rolling linguistic dice, words bouncing around between probablistic paddles in a bigass pachinko matrix. It's not designed to vet facts. It's designed to regurgitate plausible spitwads that RESEMBLE facts. And the weights behind all those paddles and slots are tuned to whatever agenda the designers wish.

...And the designers serve planetwrecking technofascist war profiteers who party with people like Epstein.

Why would anyone ever trust it with so much as a goddamn casserole recipe??

sherapantsuit@mastodon.social

@Su_G @Aedius @benjamineskola I shamelessly stole the term from @davidgerard

mrundkvist@archaeo.social

@benjamineskola
Even if you curate the source text in detail, there is no guarantee that you avoid falsehoods and imaginings. When you just feed the entire WWW blindly into your LLM, of course all bets are off.

#ai #aibubble #llm

benjamineskola@hachyderm.io

@mrundkvist yes exactly. It’s a complete misunderstanding of how these tools work.

mrundkvist@archaeo.social

@benjamineskola
Also a breathtakingly naïve idea about truth!

troed@masto.sangberg.se

@benjamineskola Part of why I'm so frustrated is that their detractors simply do not understand how they work and do not seem to want to.

(No, they're not simply correct "by chance")

/actual very very senior greybeard dev who challenged his own erroneous convictions regarding LLMs and now find them useful

benjamineskola@hachyderm.io

@troed whatever you feel the need to tell yourself to justify it.

troed@masto.sangberg.se

@benjamineskola Do you ever ponder that you might be wrong about something?

benjamineskola@hachyderm.io

@troed all the time! But that doesn’t mean I’m wrong here.

troed@masto.sangberg.se

@benjamineskola You are. Interested in learning?

(I was wrong about LLMs for development, took the time to learn, and changed my mind)

benjamineskola@hachyderm.io

@troed I’m not really interested in hearing your justifications for why actually it makes total sense to just tell the text generator ‘don’t make stuff up!’ as if it’s doing so by choice.