I've never been opposed to the word "hallucinating" for describing how AI makes mistakes ... until now.

joriki@infosec.exchange

all the outputs of LLMs and the like are hallucinations, it's just that the "bell curve" of the outputs overlap the appearance of most of what the user wants

buermann@mastodon.social

@grammargirl

They are AI mirages: they look like what you asked for but the closer you look the less there is.

Only users can hallucinate.

jesser29@social.vivaldi.net

@grammargirl i had a discussion with someone who thought the screen would go fuzzy or similar when AI was hallucinating. So they thought it would be obvious

buermann@mastodon.social

@BenAveling @draNgNon @grammargirl

The AI is generating language from some matrix algebra that regurgitates transforms of the test data or mirages of it. Only users can hallucinate and believe the mirages are real while a whirring vortex of vectors can't believe in anything.

downes@mastodon.social

@grammargirl I think it's funny that people who object to the use of 'halluctinate' because it anthropomorphises AI are nonetheless happy with their use of the word 'confident', as in 'confidently makes mistakes', in the same context.

danielmunoz@maly.io

@orionkidder @grammargirl I’ve heard the Spanish science communicator Ignacio Crespo argue that “hallucination” is misleading in this context, because it imports a human mental-state metaphor into a statistical text-generation error. “Confabulation” may be closer: a plausible-sounding reconstruction that fills gaps. Still, it also comes from human cognition, so it can anthropomorphise the model too.

danielmunoz@maly.io

@orionkidder @grammargirl I think the deeper problem with “hallucination” is that it imports a human mental-state metaphor into a statistical text-generation error. That can make people expect obviously bizarre output, when the real danger is often confident, plausible-sounding falsehoods. “Confabulation” has a similar problem, though. But, I don’t know, it sounds better to me.

gotofritz@hachyderm.io

@RnDanger @AccordionBruce @orionkidder @grammargirl

It seems such a pointless, minor nuance that will make no difference whatsoever in practice

(yes I am aware talking about this kind of minor nuances is your day job, but still, someone's gotta say it)

felichsdakatze@mastodon.social

@grammargirl
Some of the kookiest genuinely bat nuts crazy people Ive ever met, spoke exceptionally well, and logically connected ideas together. They could make exceptionally convincing arguments that were nonetheless wrong.

"Spoke eloquently" is a lower bar than some assume.

elfburgerman@mastodon.social

@grammargirl
I'm opposed to your use of 'AI'. An LLM is not an intelligence, even though that is what people call it.
Every word the industry likes for its own products probably helps to mislead the public.
Every form of anthropomorphisation of LLMs should be banned.

elfburgerman@mastodon.social

@gotofritz @RnDanger @AccordionBruce @orionkidder @grammargirl
Language can be used as one of the most dangerous tools we have because it shapes the way we think (and thus our future) mostly on a subconscious level. The more subtly a word misleads, the more difference it can make in practice.

shamhatt@mastodon.social

@grammargirl This is a great Wittgenstein conundrum but to be honest I would leave it as is, the scientific community will find its own terms in publications; we are otherwise living dangerous times and the last thing we want is to split hairs and divert people from the very issue at hand. Personally I am decanting for a good Côtes Du Rone for my hallucinations, and of course some squirt of AI.

orionkidder@writing.exchange

@danielmunoz @grammargirl This is why I refer to its "error rate." It's a machine that produces false answers to such a large degree that it shouldn't be trusted. It's simply faulty.

orionkidder@writing.exchange

@elfburgerman @gotofritz @RnDanger @AccordionBruce @grammargirl I think this is true. Like I said above, I have zero expectation that my language use is going to make a damn bit of difference at scale, but in individual conversations, refusing the metaphor of consciousness can help reframe.

It's just an error. The machine is faulty. It makes errors a lot.

rndanger@infosec.exchange

@elfburgerman @gotofritz @AccordionBruce @orionkidder @grammargirl
I agree.
"Hallucination" is a great marketing term to make people want to trust a machine, but it's a pretty poor choice of words to convey any understanding of what the machine does or how it does it

ohir@social.vivaldi.net

@grammargirl
> The word "hallucination" ... it's a widely used industry term

It is a widely used industry lie that regurgirators do not lie but somehow are slightly mistaken.

While technically it is a "less expected but still possible words rehashing output" or "imperfect probability glitch" or like, the "lie" term has the accurate and precise definiens of what the output is factually. So that term should be used. I hope it soon will be obligatory for "the industry" to use in the EU.

:))

orionkidder@writing.exchange

@RnDanger @elfburgerman @gotofritz @AccordionBruce @grammargirl Exactly. Making machines seem like magic, seem like they have no internal mechanism, is a common tactic. It's why we refer to external hard drives that we don't own or control as "the cloud."

clickhere@mastodon.ie

@grammargirl Definitely the latter, but with a slight addition:

"AI tells you things that are wrong in a way that sounds completely believable - which is the system functioning as designed. Confirm all facts!"

denofearth@mas.to

@grammargirl
I think of this as the nines imbalance.

In a datacenter there is talk of nines of uptime. Going from two nines (99%) uptime to three minutes (99.9%) requires an order of magnitude investment. Another again for four nines (99.99%).

The AI nines imbalance is that
It is one nine accurate (90%)
but four nines eloquent (99.99%)

arh1@toot.cafe

@grammargirl I appreciate "bullshit" as a better term per this article: https://www.psypost.org/scholars-ai-isnt-hallucinating-its-bullshitting/