Machine translations are often brought up as a gotcha whenever I criticize LLMs.
-
@grishka @Gargron Google Translate switched to using the same tech LLM uses. Actually, it's the opposite: the transformer model that LLM uses was created for translation first.
If you are going to compare both, since the tech is pretty much the same, the main change between then is how they are trained: people often use LLMs that are trained to behave chatbots for translation, it create biases that are not present in models that are only trained for translation, mainly, LLMs are prone to "ignore all instructions".
But the tech is pretty much the same: transformer models deal way better with context in comparison with older models.
-
I have the impression that primarily anglophone people don't read as much translated literature, because so much good literature already exists in their language, so this issue may not be as familiar within that demographic. As someone who did not grow up anglophone, I can tell you there is a world of difference between a good and a bad translation even when done by humans. Machine translations are not even on the scale.
@Gargron Preach!
-
Machine translations are often brought up as a gotcha whenever I criticize LLMs. It's worth pointing out two things: Machine translations existed decades before LLMs, and yes, machine translations are useful. However: I would never in my life read a machine translated book. Understanding what a social media post is talking about in rough terms? Sure. Literature? Absolutely not. Hell, have you ever seen machine translated subtitles? It's absolute garbage.
@Gargron deepl isnt that awful tbh
-
@Gargron Machine translated UIs are even worse a crime. LLMs don't have the slightest idea of the context of some random button, and (looking at Microsoft's German UI translations recently) seem to choose the worst possible word to drop into that.
@galaxis @Gargron To be fair, it happens with human translations too: I fixed some translations in open-source projects in which the translation interface only showed the text, but not the context, and the previous translator translated it wrongly. Example: Wikipedia had "Large (width)" (Largo) translated to "Huge" (Grande). If you check the edit history for this entry in Wikimedia, that's my name fixing this issue. But, sure, it's mostly common in machine translations, as I commented in some other toot in this thread.
-
I have the impression that primarily anglophone people don't read as much translated literature, because so much good literature already exists in their language, so this issue may not be as familiar within that demographic. As someone who did not grow up anglophone, I can tell you there is a world of difference between a good and a bad translation even when done by humans. Machine translations are not even on the scale.
@Gargron Douglas Hofstadter's 2018 assessment of the state of machine translation holds up remarkably well (he agreed with you):
-
Machine translations are often brought up as a gotcha whenever I criticize LLMs. It's worth pointing out two things: Machine translations existed decades before LLMs, and yes, machine translations are useful. However: I would never in my life read a machine translated book. Understanding what a social media post is talking about in rough terms? Sure. Literature? Absolutely not. Hell, have you ever seen machine translated subtitles? It's absolute garbage.
@Gargron
For many years, YouTube has been filled with tragic machine-translated captions, even before the new AI trend of the past few years. Not only machine translated ones, but also machine generated ones converted from spoken words.The problem with any machine translations or generated captions is that people aren't proofreading them. Viewers get garbage and the machines didn't get proper feedback.
-
@Gargron Many years ago, while on holiday in Amsterdam, I bought a Dutch translation of a book by one of my favourite authors, Terry Pratchett.
In it, there was an essay, in English, by Terry, about his struggles to find a translator for the book, which was only accomplished when he realised that it wasn't just a case of taking the text and replacing it with Dutch.
No, large sections would have to be entirely re-written by the translator, to use concepts that a Dutch audience would find familiar.
And not just in Dutch, but every language.
The example he gave was one character who was experiencing the feeling of being stuck in traffic on a busy road on a Sunday afternoon, and after miles of driving, finding that the cause of the tailback was a little old lady out for her weekly drive to church in her trusty old Morris Marina, never getting above 20 MPH becuase it felt too fast.
This is something that British people are well acquanted with, but the Dutch translator had to come up with a completely different way of explaining this, because it's not something particularly prevalant over there.
It's not just about translating the words, its translating the feelings, the emotions, to something readers in another place will understand.
And LLM's fail spectacularly at that. -
I have the impression that primarily anglophone people don't read as much translated literature, because so much good literature already exists in their language, so this issue may not be as familiar within that demographic. As someone who did not grow up anglophone, I can tell you there is a world of difference between a good and a bad translation even when done by humans. Machine translations are not even on the scale.
@Gargron I've translated two commercial games in my native language (French), and I've grown to appreciate games that have French translations made by actual humans (especially those with dialogues), because there's always a bunch of stuff that any machine translation algorithm is never going to pick up and, when done right, really makes it worth playing
-
Machine translations are often brought up as a gotcha whenever I criticize LLMs. It's worth pointing out two things: Machine translations existed decades before LLMs, and yes, machine translations are useful. However: I would never in my life read a machine translated book. Understanding what a social media post is talking about in rough terms? Sure. Literature? Absolutely not. Hell, have you ever seen machine translated subtitles? It's absolute garbage.
@Gargron This.
-
@virgilpierce @Gargron
There's an old joke from the 1960s about machine translation of the saying "the spirit is willing, but the flesh is weak" from English to Russian and then back again.
The result was "the vodka is good but the meat is rotten."I've heard that one in German with some equivalent of "but the steak is not quite done".
-
Machine translations are often brought up as a gotcha whenever I criticize LLMs. It's worth pointing out two things: Machine translations existed decades before LLMs, and yes, machine translations are useful. However: I would never in my life read a machine translated book. Understanding what a social media post is talking about in rough terms? Sure. Literature? Absolutely not. Hell, have you ever seen machine translated subtitles? It's absolute garbage.
@Gargron@mastodon.social I think it's actually perfectly fine to just accept they fill their niche of roughly guessing what something means despite the fact modern translation models work pretty much the same as actual large language models? People trying to gotcha over this are usually not acting in good faith anyway (or dangerously reducing/simplifying things)
-
@Gargron I think anglophones experience start difference between good and bad translations more often through video games
-
All your bases are belong to Us
@gabboman@gabboman.xyz @aeva@mastodon.gamedev.place @Gargron@mastodon.social tbf that's not translation, that's japanese speakers writing english
And IMO broken english in an old videogame is so much better than soulless LLM translation. Like yeah, it may be jibberish, but it's a part of the charm -
I have the impression that primarily anglophone people don't read as much translated literature, because so much good literature already exists in their language, so this issue may not be as familiar within that demographic. As someone who did not grow up anglophone, I can tell you there is a world of difference between a good and a bad translation even when done by humans. Machine translations are not even on the scale.
@Gargron Yes, the German version of Lord of the Rings has different translators. When I tried reading it as a kid, I felt so lost. It was boring as hell.
Decades later I heard that the first translation is considered a bad one.
-
I have the impression that primarily anglophone people don't read as much translated literature, because so much good literature already exists in their language, so this issue may not be as familiar within that demographic. As someone who did not grow up anglophone, I can tell you there is a world of difference between a good and a bad translation even when done by humans. Machine translations are not even on the scale.
From what I've observed, people who claim that LLMs can replace artists don't understand art, people who claim that they can replace musicians don't understand music, people who claim that they can replace writers don't understand literature, and people who claim they can replace translators don't rely on translations. If I had a button that would erase LLMs from the world but it would take machine translations away (which is a false dichotomy anyway), I would absolutely still press it.
-
Machine translations are often brought up as a gotcha whenever I criticize LLMs. It's worth pointing out two things: Machine translations existed decades before LLMs, and yes, machine translations are useful. However: I would never in my life read a machine translated book. Understanding what a social media post is talking about in rough terms? Sure. Literature? Absolutely not. Hell, have you ever seen machine translated subtitles? It's absolute garbage.
@Gargron I was read the Rig Veda, machine translated from Sanskrit to English. It was great for understanding the words that were used. And after I read a human translation I understood the text, in as far as one can understand a religious text
-
From what I've observed, people who claim that LLMs can replace artists don't understand art, people who claim that they can replace musicians don't understand music, people who claim that they can replace writers don't understand literature, and people who claim they can replace translators don't rely on translations. If I had a button that would erase LLMs from the world but it would take machine translations away (which is a false dichotomy anyway), I would absolutely still press it.
@Gargron But it seems that LLMs are here to stay. This time, it doesn't seem to be just a passing fad. There is a lot of investment involved.
-
From what I've observed, people who claim that LLMs can replace artists don't understand art, people who claim that they can replace musicians don't understand music, people who claim that they can replace writers don't understand literature, and people who claim they can replace translators don't rely on translations. If I had a button that would erase LLMs from the world but it would take machine translations away (which is a false dichotomy anyway), I would absolutely still press it.
Technology is not inevitable. We've decided not to have asbestos in our walls, lead in our pipes, or carginogenic chemicals in our food. (If you're going to argue that it's not everywhere, where would you rather live?) We could just not do LLMs. It's allowed.
-
Machine translations are often brought up as a gotcha whenever I criticize LLMs. It's worth pointing out two things: Machine translations existed decades before LLMs, and yes, machine translations are useful. However: I would never in my life read a machine translated book. Understanding what a social media post is talking about in rough terms? Sure. Literature? Absolutely not. Hell, have you ever seen machine translated subtitles? It's absolute garbage.
i do appreciate automatic subtitles extremely for hitting all my humor-chords. may they never evolve.
that being said:
i am lucky and able to read in several languages and read a lot of our bookclub books in original language. i can't count how many times i liked books that many of the others couldn't even finish their translated ones (assumedly) because the language was so poor. (and then we have those who listen to books and it totally depends on the person who was recorded. -
Technology is not inevitable. We've decided not to have asbestos in our walls, lead in our pipes, or carginogenic chemicals in our food. (If you're going to argue that it's not everywhere, where would you rather live?) We could just not do LLMs. It's allowed.
@Gargron It is a technology that humanity has been seeking for a long time. At least since the 1950s, with Turing and his colleagues.