Machine translations are often brought up as a gotcha whenever I criticize LLMs.

rupert@mastodon.nz

@df @Gargron Turing did not dream of spending the entire energy budget of the world at the time so people could generate letters from a few bullet points and the recipients could summarise them to different bullet points.

ahltorp@mastodon.nu

@grishka One problem with LLMs is that they tend to translate and summarise what’s likely to be in the source text, not what’s actually in the text.

This means that when translating/summarising a text that deviates from the usual content in a subject or genre, the LLM will push it towards the common.

Using the result to understand the original contents is therefore very risky. For example, when screening texts, ”incorrect” content might be ”corrected”, increasing the likelihood it will pass.

tekchip@mastodon.social

@ClipHead @Gargron it depends. Was the song written by prompt also delivered by my friends? If yes, then I'd enjoy it just as much.

Is it any less valid than a mass reproduced pre-written card that a friend, who I know is busy, still made the time to buy for me?

cliphead@social.cologne

@melioristicmarie @Tekchip @Gargron

This!

feministo@wien.rocks

@Gargron

failed technologies, like Zeppelin

tekchip@mastodon.social

@ClipHead @melioristicmarie @Gargron which this?

"there is no value in the average."

or

"my walls are full of art by humans that some would call terrible... who the fuck cares?"

Can't have it both ways.

glc@mastodon.online

@Gargron

LLMs are Shannon 1948 as far as the theory goes (building on Markov, but adding computer technology). With some compression techniques.

But I think you're talking about something else entirely, not purely syntactical.

cliphead@social.cologne

@Tekchip
No, they just gave you the song.

They had the possibility to meet and write a song, but chose not to.

Are you making excuses now for "fake" songs...or fake friends?

@Gargron

kiloku@burnthis.town

@Tekchip @Gargron the tiny potential for very rare good outcomes are not worth the constant poisoning of humanity's collective information corpus.

For every "good" generated content there are dozens of thousands of terrible slop that are difficult to separate from genuine useful information or material when doing research or code reviews, etc.

Not to mention that these "good" outcomes are much costlier to humanity than creating by hand, with no benefit.

melioristicmarie@tech.lgbt

@Tekchip
so... is this a slop account? am i tooting with cheapgpt?

are you a human playing with toys you do not comprehend?

dear dogs, may i have the confidence of a mediocre "white" man.

so... l.l.m.s tokenize english text... and then calculate an average.

humans making shitty art is qualitatively perfection in comparison to word salad from a calculator. when you enter this into wannabe deep seek... i will be waiting with bated breath for the token response. ; )

@ClipHead @Gargron

sahil@tiny.tilde.website

@Gargron where is the perceptron

falcennial@mastodon.social

@Gargron they'll never create intelligence because intelligence requires will and they do not understand will. they dont even posses one of their own: their own behaviour is driven by feelings and shaped by a commercial playbook. there is zero chance they will ever create intelligence.

cliphead@social.cologne

@Tekchip
There's no point in explaining, if you don't get "this", tbh.

@melioristicmarie @Gargron

vy@mastodon.social

@Gargron there is a great essay on translation by Simon Leys

tekchip@mastodon.social

@Kiloku @Gargron the problem is you want to assume they are rare outcomes. I don't believe they are. Unfortunately that's where we're at an impasse. It's literally impossible to measure the good outcomes.

I agree the environmental outcome is terrible. I don't like that part. What we can look forward to is the technology improving. General computers used to use WAY more power than they do now. The same is going to happen with LLM technology. Hopefully sooner than later. Folks are working on it.

kiloku@burnthis.town

@Tekchip @Gargron (also, most of what "AI" boosters *think* is good generated content is actually laughably bad to anyone who knows the subject matter of the content it generates. I'm certain you've shared something that you thought was indistinguishable from human created content that other people knew and saw a bunch of problems with as soon as they examined it further than a cursory glance)

melioristicmarie@tech.lgbt

@ClipHead sigh. i agree. there are more porous space to beat my head against.

@Tekchip @Gargron

tekchip@mastodon.social

@melioristicmarie @ClipHead @Gargron lol are you an LLM or just don't care to review my profile? Shoot even do a google search. I'm easy to find. Seems like you've lost the plot.

kiloku@burnthis.town

@Tekchip @Gargron I *know* they are rare.

kelson@notes.kvibber.com

@jawarajabbi @Gargron Similarly, I've read two different translations of Les Miserables and fragments of several others, and they're drastically different, despite all being professional human translators working from the same source text and translating it to the same language.

(The oldest ones are really awkward to read now. They're also old enough to be in the public domain, so every random set of Serious Classic Books is going to print one of the the 1860s or 1880s versions instead of a more modern translation they'd have to pay royalties for.)