Skip to content
  • Hjem
  • Seneste
  • Etiketter
  • Populære
  • Verden
  • Bruger
  • Grupper
Temaer
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Kollaps
FARVEL BIG TECH
  1. Forside
  2. Ikke-kategoriseret
  3. Machine translations are often brought up as a gotcha whenever I criticize LLMs.

Machine translations are often brought up as a gotcha whenever I criticize LLMs.

Planlagt Fastgjort Låst Flyttet Ikke-kategoriseret
255 Indlæg 170 Posters 0 Visninger
  • Ældste til nyeste
  • Nyeste til ældste
  • Most Votes
Svar
  • Svar som emne
Login for at svare
Denne tråd er blevet slettet. Kun brugere med emne behandlings privilegier kan se den.
  • lauerhahn@sfba.socialL lauerhahn@sfba.social

    @Gargron I minored in linguistics in college, and a lot of exciting work was being done at the time around developing syntax models of how languages worked (and different ways humans use syntax), in part to inform machine translation models. This was more than 25 years ago. No LLMs involved.
    I have not kept up with current developments in machine translation but I strongly suspect that it's built on the foundation of those decades of work actually understanding how languages function, and what maps or doesn't map. Which is completely different than expecting generative AI to create a model.

    aran@localization.cafeA This user is from outside of this forum
    aran@localization.cafeA This user is from outside of this forum
    aran@localization.cafe
    wrote sidst redigeret af
    #66

    @lauerhahn @Gargron Alas no. Most machine translation engines now are purely statistical. They don't bother with semantic analysis, they just brute forced a mathematical model with tons of data.

    1 Reply Last reply
    0
    • stuartb@social.teamb.spaceS stuartb@social.teamb.space

      @Gargron Many years ago, while on holiday in Amsterdam, I bought a Dutch translation of a book by one of my favourite authors, Terry Pratchett.
      In it, there was an essay, in English, by Terry, about his struggles to find a translator for the book, which was only accomplished when he realised that it wasn't just a case of taking the text and replacing it with Dutch.
      No, large sections would have to be entirely re-written by the translator, to use concepts that a Dutch audience would find familiar.
      And not just in Dutch, but every language.
      The example he gave was one character who was experiencing the feeling of being stuck in traffic on a busy road on a Sunday afternoon, and after miles of driving, finding that the cause of the tailback was a little old lady out for her weekly drive to church in her trusty old Morris Marina, never getting above 20 MPH becuase it felt too fast.
      This is something that British people are well acquanted with, but the Dutch translator had to come up with a completely different way of explaining this, because it's not something particularly prevalant over there.
      It's not just about translating the words, its translating the feelings, the emotions, to something readers in another place will understand.
      And LLM's fail spectacularly at that.

      vfrmedia@social.tchncs.deV This user is from outside of this forum
      vfrmedia@social.tchncs.deV This user is from outside of this forum
      vfrmedia@social.tchncs.de
      wrote sidst redigeret af
      #67

      @stuartb @gargron

      Morris Marina wasn't completely uncommon in the Netherlands - although only a handful made it over there and interestingly it seems nearly all have been preserved as oldtimers!

      But the old lady in NL would more likely ride her bicycle to church, probably at much less than 30 km/h and quite likely so would all the rest of the congregation - the translator would have definitely needed to find another concept to match this..

      I got a DIY maintenance manual for my car which is in German (there isn't an equivalent one of same quality in English) and I definitely won't wholly trust an LLM to translate that (instead I print the relevant pages, go through it by hand and make notes of points that don't immediately come to mind as my German is only as good as a teenager)

      stuartb@social.teamb.spaceS 1 Reply Last reply
      0
      • stuartb@social.teamb.spaceS stuartb@social.teamb.space

        @Gargron Many years ago, while on holiday in Amsterdam, I bought a Dutch translation of a book by one of my favourite authors, Terry Pratchett.
        In it, there was an essay, in English, by Terry, about his struggles to find a translator for the book, which was only accomplished when he realised that it wasn't just a case of taking the text and replacing it with Dutch.
        No, large sections would have to be entirely re-written by the translator, to use concepts that a Dutch audience would find familiar.
        And not just in Dutch, but every language.
        The example he gave was one character who was experiencing the feeling of being stuck in traffic on a busy road on a Sunday afternoon, and after miles of driving, finding that the cause of the tailback was a little old lady out for her weekly drive to church in her trusty old Morris Marina, never getting above 20 MPH becuase it felt too fast.
        This is something that British people are well acquanted with, but the Dutch translator had to come up with a completely different way of explaining this, because it's not something particularly prevalant over there.
        It's not just about translating the words, its translating the feelings, the emotions, to something readers in another place will understand.
        And LLM's fail spectacularly at that.

        mikefromlfe@cupoftea.socialM This user is from outside of this forum
        mikefromlfe@cupoftea.socialM This user is from outside of this forum
        mikefromlfe@cupoftea.social
        wrote sidst redigeret af
        #68

        @stuartb @Gargron@mastodon.social
        Exactly!
        I did some work for a technical translation company after I retired (not translating I hasten to add) and the skill was making the language relevant to the target audience. To do that the translator had to have both a knowledge of the subject matter, the language and what sort of person would be using the translation.
        And they wondered why Google wasn't good enough, and why we charged what we did.

        1 Reply Last reply
        0
        • qgustavor@urusai.socialQ This user is from outside of this forum
          qgustavor@urusai.socialQ This user is from outside of this forum
          qgustavor@urusai.social
          wrote sidst redigeret af
          #69

          @grishka @Gargron Google Translate switched to using the same tech LLM uses. Actually, it's the opposite: the transformer model that LLM uses was created for translation first.

          If you are going to compare both, since the tech is pretty much the same, the main change between then is how they are trained: people often use LLMs that are trained to behave chatbots for translation, it create biases that are not present in models that are only trained for translation, mainly, LLMs are prone to "ignore all instructions".

          But the tech is pretty much the same: transformer models deal way better with context in comparison with older models.

          1 Reply Last reply
          0
          • gargron@mastodon.socialG gargron@mastodon.social

            I have the impression that primarily anglophone people don't read as much translated literature, because so much good literature already exists in their language, so this issue may not be as familiar within that demographic. As someone who did not grow up anglophone, I can tell you there is a world of difference between a good and a bad translation even when done by humans. Machine translations are not even on the scale.

            kaleissin@wandering.shopK This user is from outside of this forum
            kaleissin@wandering.shopK This user is from outside of this forum
            kaleissin@wandering.shop
            wrote sidst redigeret af
            #70

            @Gargron Preach!

            1 Reply Last reply
            0
            • gargron@mastodon.socialG gargron@mastodon.social

              Machine translations are often brought up as a gotcha whenever I criticize LLMs. It's worth pointing out two things: Machine translations existed decades before LLMs, and yes, machine translations are useful. However: I would never in my life read a machine translated book. Understanding what a social media post is talking about in rough terms? Sure. Literature? Absolutely not. Hell, have you ever seen machine translated subtitles? It's absolute garbage.

              nicole4fox@datastream.cortexvoid.netN This user is from outside of this forum
              nicole4fox@datastream.cortexvoid.netN This user is from outside of this forum
              nicole4fox@datastream.cortexvoid.net
              wrote sidst redigeret af
              #71

              @Gargron deepl isnt that awful tbh

              1 Reply Last reply
              0
              • galaxis@mastodon.infra.deG galaxis@mastodon.infra.de

                @Gargron Machine translated UIs are even worse a crime. LLMs don't have the slightest idea of the context of some random button, and (looking at Microsoft's German UI translations recently) seem to choose the worst possible word to drop into that.

                qgustavor@urusai.socialQ This user is from outside of this forum
                qgustavor@urusai.socialQ This user is from outside of this forum
                qgustavor@urusai.social
                wrote sidst redigeret af
                #72

                @galaxis @Gargron To be fair, it happens with human translations too: I fixed some translations in open-source projects in which the translation interface only showed the text, but not the context, and the previous translator translated it wrongly. Example: Wikipedia had "Large (width)" (Largo) translated to "Huge" (Grande). If you check the edit history for this entry in Wikimedia, that's my name fixing this issue. But, sure, it's mostly common in machine translations, as I commented in some other toot in this thread.

                1 Reply Last reply
                0
                • gargron@mastodon.socialG gargron@mastodon.social

                  I have the impression that primarily anglophone people don't read as much translated literature, because so much good literature already exists in their language, so this issue may not be as familiar within that demographic. As someone who did not grow up anglophone, I can tell you there is a world of difference between a good and a bad translation even when done by humans. Machine translations are not even on the scale.

                  beandreams@friendhole.socialB This user is from outside of this forum
                  beandreams@friendhole.socialB This user is from outside of this forum
                  beandreams@friendhole.social
                  wrote sidst redigeret af
                  #73

                  @Gargron Douglas Hofstadter's 2018 assessment of the state of machine translation holds up remarkably well (he agreed with you):

                  https://web.archive.org/web/20180130225442/https://www.theatlantic.com/technology/archive/2018/01/the-shallowness-of-google-translate/551570/

                  1 Reply Last reply
                  0
                  • gargron@mastodon.socialG gargron@mastodon.social

                    Machine translations are often brought up as a gotcha whenever I criticize LLMs. It's worth pointing out two things: Machine translations existed decades before LLMs, and yes, machine translations are useful. However: I would never in my life read a machine translated book. Understanding what a social media post is talking about in rough terms? Sure. Literature? Absolutely not. Hell, have you ever seen machine translated subtitles? It's absolute garbage.

                    sloanlance@mastodon.socialS This user is from outside of this forum
                    sloanlance@mastodon.socialS This user is from outside of this forum
                    sloanlance@mastodon.social
                    wrote sidst redigeret af
                    #74

                    @Gargron
                    For many years, YouTube has been filled with tragic machine-translated captions, even before the new AI trend of the past few years. Not only machine translated ones, but also machine generated ones converted from spoken words.

                    The problem with any machine translations or generated captions is that people aren't proofreading them. Viewers get garbage and the machines didn't get proper feedback.

                    1 Reply Last reply
                    0
                    • stuartb@social.teamb.spaceS stuartb@social.teamb.space

                      @Gargron Many years ago, while on holiday in Amsterdam, I bought a Dutch translation of a book by one of my favourite authors, Terry Pratchett.
                      In it, there was an essay, in English, by Terry, about his struggles to find a translator for the book, which was only accomplished when he realised that it wasn't just a case of taking the text and replacing it with Dutch.
                      No, large sections would have to be entirely re-written by the translator, to use concepts that a Dutch audience would find familiar.
                      And not just in Dutch, but every language.
                      The example he gave was one character who was experiencing the feeling of being stuck in traffic on a busy road on a Sunday afternoon, and after miles of driving, finding that the cause of the tailback was a little old lady out for her weekly drive to church in her trusty old Morris Marina, never getting above 20 MPH becuase it felt too fast.
                      This is something that British people are well acquanted with, but the Dutch translator had to come up with a completely different way of explaining this, because it's not something particularly prevalant over there.
                      It's not just about translating the words, its translating the feelings, the emotions, to something readers in another place will understand.
                      And LLM's fail spectacularly at that.

                      venite@mastodon.nlV This user is from outside of this forum
                      venite@mastodon.nlV This user is from outside of this forum
                      venite@mastodon.nl
                      wrote sidst redigeret af
                      #75

                      @stuartb @Gargron I first read Pratchett in Dutch and you almost couldn’t tell it was translated. So fluid. (I say “almost” because there are a lot of hills and those are vanishingly rare in native Dutch fiction.) Good translators are so essential to both the origin and destination culture!

                      1 Reply Last reply
                      0
                      • gargron@mastodon.socialG gargron@mastodon.social

                        I have the impression that primarily anglophone people don't read as much translated literature, because so much good literature already exists in their language, so this issue may not be as familiar within that demographic. As someone who did not grow up anglophone, I can tell you there is a world of difference between a good and a bad translation even when done by humans. Machine translations are not even on the scale.

                        guigui@wetdry.worldG This user is from outside of this forum
                        guigui@wetdry.worldG This user is from outside of this forum
                        guigui@wetdry.world
                        wrote sidst redigeret af
                        #76

                        @Gargron I've translated two commercial games in my native language (French), and I've grown to appreciate games that have French translations made by actual humans (especially those with dialogues), because there's always a bunch of stuff that any machine translation algorithm is never going to pick up and, when done right, really makes it worth playing

                        1 Reply Last reply
                        0
                        • gargron@mastodon.socialG gargron@mastodon.social

                          Machine translations are often brought up as a gotcha whenever I criticize LLMs. It's worth pointing out two things: Machine translations existed decades before LLMs, and yes, machine translations are useful. However: I would never in my life read a machine translated book. Understanding what a social media post is talking about in rough terms? Sure. Literature? Absolutely not. Hell, have you ever seen machine translated subtitles? It's absolute garbage.

                          howtophil@mastodon.socialH This user is from outside of this forum
                          howtophil@mastodon.socialH This user is from outside of this forum
                          howtophil@mastodon.social
                          wrote sidst redigeret af
                          #77

                          @Gargron This.

                          1 Reply Last reply
                          0
                          • gdinwiddie@mastodon.socialG gdinwiddie@mastodon.social

                            @virgilpierce @Gargron
                            There's an old joke from the 1960s about machine translation of the saying "the spirit is willing, but the flesh is weak" from English to Russian and then back again.
                            The result was "the vodka is good but the meat is rotten."

                            wonka@chaos.socialW This user is from outside of this forum
                            wonka@chaos.socialW This user is from outside of this forum
                            wonka@chaos.social
                            wrote sidst redigeret af
                            #78

                            I've heard that one in German with some equivalent of "but the steak is not quite done".

                            @gdinwiddie @virgilpierce @Gargron

                            1 Reply Last reply
                            0
                            • gargron@mastodon.socialG gargron@mastodon.social

                              Machine translations are often brought up as a gotcha whenever I criticize LLMs. It's worth pointing out two things: Machine translations existed decades before LLMs, and yes, machine translations are useful. However: I would never in my life read a machine translated book. Understanding what a social media post is talking about in rough terms? Sure. Literature? Absolutely not. Hell, have you ever seen machine translated subtitles? It's absolute garbage.

                              natty@astolfo.socialN This user is from outside of this forum
                              natty@astolfo.socialN This user is from outside of this forum
                              natty@astolfo.social
                              wrote sidst redigeret af
                              #79

                              @Gargron@mastodon.social I think it's actually perfectly fine to just accept they fill their niche of roughly guessing what something means despite the fact modern translation models work pretty much the same as actual large language models? People trying to gotcha over this are usually not acting in good faith anyway (or dangerously reducing/simplifying things)

                              1 Reply Last reply
                              0
                              • aeva@mastodon.gamedev.placeA aeva@mastodon.gamedev.place

                                @Gargron I think anglophones experience start difference between good and bad translations more often through video games

                                qgustavor@urusai.socialQ This user is from outside of this forum
                                qgustavor@urusai.socialQ This user is from outside of this forum
                                qgustavor@urusai.social
                                wrote sidst redigeret af
                                #80

                                @aeva @Gargron Anime is other common way: just check some anime that are not available legally in some torrent website. Example: Komi-san's translations before Netflix released the official ones. THEY WERE HORRIBLE. I watched the anime in Spanish due to that.

                                lepaggoth@mastodon.socialL 1 Reply Last reply
                                0
                                • gabboman@gabboman.xyzG gabboman@gabboman.xyz

                                  All your bases are belong to Us

                                  alice@mk.nyaa.placeA This user is from outside of this forum
                                  alice@mk.nyaa.placeA This user is from outside of this forum
                                  alice@mk.nyaa.place
                                  wrote sidst redigeret af
                                  #81

                                  @gabboman@gabboman.xyz @aeva@mastodon.gamedev.place @Gargron@mastodon.social tbf that's not translation, that's japanese speakers writing english

                                  And IMO broken english in an old videogame is so much better than soulless LLM translation. Like yeah, it may be jibberish, but it's a part of the charm

                                  aeva@mastodon.gamedev.placeA 1 Reply Last reply
                                  0
                                  • gargron@mastodon.socialG gargron@mastodon.social

                                    I have the impression that primarily anglophone people don't read as much translated literature, because so much good literature already exists in their language, so this issue may not be as familiar within that demographic. As someone who did not grow up anglophone, I can tell you there is a world of difference between a good and a bad translation even when done by humans. Machine translations are not even on the scale.

                                    stefan_s_from_h@mastodon.socialS This user is from outside of this forum
                                    stefan_s_from_h@mastodon.socialS This user is from outside of this forum
                                    stefan_s_from_h@mastodon.social
                                    wrote sidst redigeret af
                                    #82

                                    @Gargron Yes, the German version of Lord of the Rings has different translators. When I tried reading it as a kid, I felt so lost. It was boring as hell.

                                    Decades later I heard that the first translation is considered a bad one.

                                    1 Reply Last reply
                                    0
                                    • gargron@mastodon.socialG gargron@mastodon.social

                                      I have the impression that primarily anglophone people don't read as much translated literature, because so much good literature already exists in their language, so this issue may not be as familiar within that demographic. As someone who did not grow up anglophone, I can tell you there is a world of difference between a good and a bad translation even when done by humans. Machine translations are not even on the scale.

                                      gargron@mastodon.socialG This user is from outside of this forum
                                      gargron@mastodon.socialG This user is from outside of this forum
                                      gargron@mastodon.social
                                      wrote sidst redigeret af
                                      #83

                                      From what I've observed, people who claim that LLMs can replace artists don't understand art, people who claim that they can replace musicians don't understand music, people who claim that they can replace writers don't understand literature, and people who claim they can replace translators don't rely on translations. If I had a button that would erase LLMs from the world but it would take machine translations away (which is a false dichotomy anyway), I would absolutely still press it.

                                      df@s.dfaria.euD gargron@mastodon.socialG freequaybuoy@mastodon.socialF jeffgrigg@mastodon.socialJ funcrunch@me.dmF 19 Replies Last reply
                                      0
                                      • gargron@mastodon.socialG gargron@mastodon.social

                                        Machine translations are often brought up as a gotcha whenever I criticize LLMs. It's worth pointing out two things: Machine translations existed decades before LLMs, and yes, machine translations are useful. However: I would never in my life read a machine translated book. Understanding what a social media post is talking about in rough terms? Sure. Literature? Absolutely not. Hell, have you ever seen machine translated subtitles? It's absolute garbage.

                                        webhat@infosec.exchangeW This user is from outside of this forum
                                        webhat@infosec.exchangeW This user is from outside of this forum
                                        webhat@infosec.exchange
                                        wrote sidst redigeret af
                                        #84

                                        @Gargron I was read the Rig Veda, machine translated from Sanskrit to English. It was great for understanding the words that were used. And after I read a human translation I understood the text, in as far as one can understand a religious text

                                        1 Reply Last reply
                                        0
                                        • gargron@mastodon.socialG gargron@mastodon.social

                                          From what I've observed, people who claim that LLMs can replace artists don't understand art, people who claim that they can replace musicians don't understand music, people who claim that they can replace writers don't understand literature, and people who claim they can replace translators don't rely on translations. If I had a button that would erase LLMs from the world but it would take machine translations away (which is a false dichotomy anyway), I would absolutely still press it.

                                          df@s.dfaria.euD This user is from outside of this forum
                                          df@s.dfaria.euD This user is from outside of this forum
                                          df@s.dfaria.eu
                                          wrote sidst redigeret af
                                          #85

                                          @Gargron But it seems that LLMs are here to stay. This time, it doesn't seem to be just a passing fad. There is a lot of investment involved.

                                          epd5qrxx@mastodon.onlineE trisweb@m.trisweb.comT A ainmosni@social.ainmosni.euA 4 Replies Last reply
                                          0
                                          Svar
                                          • Svar som emne
                                          Login for at svare
                                          • Ældste til nyeste
                                          • Nyeste til ældste
                                          • Most Votes


                                          • Log ind

                                          • Har du ikke en konto? Tilmeld

                                          • Login or register to search.
                                          Powered by NodeBB Contributors
                                          Graciously hosted by data.coop
                                          • First post
                                            Last post
                                          0
                                          • Hjem
                                          • Seneste
                                          • Etiketter
                                          • Populære
                                          • Verden
                                          • Bruger
                                          • Grupper