Skip to content
  • Hjem
  • Seneste
  • Etiketter
  • Populære
  • Verden
  • Bruger
  • Grupper
Temaer
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Kollaps
FARVEL BIG TECH
  1. Forside
  2. Ikke-kategoriseret
  3. Worth looking at both the quoted text here and •especially• the linked page, which is quite good.

Worth looking at both the quoted text here and •especially• the linked page, which is quite good.

Planlagt Fastgjort Låst Flyttet Ikke-kategoriseret
45 Indlæg 22 Posters 0 Visninger
  • Ældste til nyeste
  • Nyeste til ældste
  • Most Votes
Svar
  • Svar som emne
Login for at svare
Denne tråd er blevet slettet. Kun brugere med emne behandlings privilegier kan se den.
  • inthehands@hachyderm.ioI inthehands@hachyderm.io

    RE: https://unstable.systems/@jneen/116618931097778342

    Worth looking at both the quoted text here and •especially• the linked page, which is quite good.

    I’ll add another item of my own. The first screenshot mentions giving an LLM the task of “implementing an HTTP server in JavaScript from scratch” in 90 minutes. Sounds impressive, right? Until you remember that every open-source Javascript HTTP server in existence ••was in the training data••.

    1/

    inthehands@hachyderm.ioI This user is from outside of this forum
    inthehands@hachyderm.ioI This user is from outside of this forum
    inthehands@hachyderm.io
    wrote sidst redigeret af
    #2

    IT WAS IN THE TRAINING DATA. Your test gave the machine a pile of correct answers and free license to plagiarize.

    I remember people being wowed that Claude Code could implement a complete C compiler. But somehow it doesn’t sound quite as impressive when you phrase it as “given every existing C compiler as input, the LLM can produce a C compiler as output.”

    2/

    galbinuscaeli@spacey.spaceG inthehands@hachyderm.ioI wildeyedboyfromfreecloud@masto.aiW lily_and_frog@mastodon.artL stevewfolds@mastodon.worldS 9 Replies Last reply
    0
    • inthehands@hachyderm.ioI inthehands@hachyderm.io

      IT WAS IN THE TRAINING DATA. Your test gave the machine a pile of correct answers and free license to plagiarize.

      I remember people being wowed that Claude Code could implement a complete C compiler. But somehow it doesn’t sound quite as impressive when you phrase it as “given every existing C compiler as input, the LLM can produce a C compiler as output.”

      2/

      galbinuscaeli@spacey.spaceG This user is from outside of this forum
      galbinuscaeli@spacey.spaceG This user is from outside of this forum
      galbinuscaeli@spacey.space
      wrote sidst redigeret af
      #3

      @inthehands Given pictures of a giraffe, a rhinoceros, an elephant, and a squirrel, find the squirrel. Feel free to reference dictionaries, encyclopedias, nature documentaries and previous responses to this same question.

      inthehands@hachyderm.ioI 1 Reply Last reply
      0
      • inthehands@hachyderm.ioI inthehands@hachyderm.io

        RE: https://unstable.systems/@jneen/116618931097778342

        Worth looking at both the quoted text here and •especially• the linked page, which is quite good.

        I’ll add another item of my own. The first screenshot mentions giving an LLM the task of “implementing an HTTP server in JavaScript from scratch” in 90 minutes. Sounds impressive, right? Until you remember that every open-source Javascript HTTP server in existence ••was in the training data••.

        1/

        michael_w_busch@mastodon.onlineM This user is from outside of this forum
        michael_w_busch@mastodon.onlineM This user is from outside of this forum
        michael_w_busch@mastodon.online
        wrote sidst redigeret af
        #4

        @inthehands

        Back in 2023, OpenAI was hyping ChatGPT by claiming "it can pass the GRE".

        When all it was doing was autocomplete of the answer keys that had been used as input.

        inthehands@hachyderm.ioI 1 Reply Last reply
        0
        • inthehands@hachyderm.ioI inthehands@hachyderm.io

          RE: https://unstable.systems/@jneen/116618931097778342

          Worth looking at both the quoted text here and •especially• the linked page, which is quite good.

          I’ll add another item of my own. The first screenshot mentions giving an LLM the task of “implementing an HTTP server in JavaScript from scratch” in 90 minutes. Sounds impressive, right? Until you remember that every open-source Javascript HTTP server in existence ••was in the training data••.

          1/

          walrus@toot.walesW This user is from outside of this forum
          walrus@toot.walesW This user is from outside of this forum
          walrus@toot.wales
          wrote sidst redigeret af
          #5

          @inthehands

          Once upon a time, a boss came and tried to hurry me into giving him some code I was working on.

          Me: Do you want it now, or do you want it right?

          Him: Good point! When it's ready.

          1 Reply Last reply
          0
          • inthehands@hachyderm.ioI inthehands@hachyderm.io

            IT WAS IN THE TRAINING DATA. Your test gave the machine a pile of correct answers and free license to plagiarize.

            I remember people being wowed that Claude Code could implement a complete C compiler. But somehow it doesn’t sound quite as impressive when you phrase it as “given every existing C compiler as input, the LLM can produce a C compiler as output.”

            2/

            inthehands@hachyderm.ioI This user is from outside of this forum
            inthehands@hachyderm.ioI This user is from outside of this forum
            inthehands@hachyderm.io
            wrote sidst redigeret af
            #6

            It’s just •astonishing• how many eye-popping stories of LLMs doing amazing omg-verge-of-magical-superintelligence things turn out to be just unvarnished plagiarism.

            In the first months after ChatGPT’s release, I remember a French dept colleague being amazed that GPT could translate and summarize a passage of Le Petit Prince.

            It was a lot less impressive when I dug up the 2 or 3 online passages which it had copied almost verbatim and stitched together (sprinkling in a couple of extra words that made it less accurate).

            3/

            inthehands@hachyderm.ioI datarama@hachyderm.ioD 2 Replies Last reply
            0
            • inthehands@hachyderm.ioI inthehands@hachyderm.io

              It’s just •astonishing• how many eye-popping stories of LLMs doing amazing omg-verge-of-magical-superintelligence things turn out to be just unvarnished plagiarism.

              In the first months after ChatGPT’s release, I remember a French dept colleague being amazed that GPT could translate and summarize a passage of Le Petit Prince.

              It was a lot less impressive when I dug up the 2 or 3 online passages which it had copied almost verbatim and stitched together (sprinkling in a couple of extra words that made it less accurate).

              3/

              inthehands@hachyderm.ioI This user is from outside of this forum
              inthehands@hachyderm.ioI This user is from outside of this forum
              inthehands@hachyderm.io
              wrote sidst redigeret af
              #7

              To be fair, competent plagiarism is a nontrivial task. It takes skill for a human to do: finding relevant fragments from an input dataset and stitching those fragments together in a well-formed way is not nothing! It’s impressive and interesting that people have figured out how to make machines do it.

              But •that• is not the promise on which investors are valuing the AI megajuggernaut at trillions of dollars.

              4/

              inthehands@hachyderm.ioI 1 Reply Last reply
              0
              • inthehands@hachyderm.ioI inthehands@hachyderm.io

                IT WAS IN THE TRAINING DATA. Your test gave the machine a pile of correct answers and free license to plagiarize.

                I remember people being wowed that Claude Code could implement a complete C compiler. But somehow it doesn’t sound quite as impressive when you phrase it as “given every existing C compiler as input, the LLM can produce a C compiler as output.”

                2/

                wildeyedboyfromfreecloud@masto.aiW This user is from outside of this forum
                wildeyedboyfromfreecloud@masto.aiW This user is from outside of this forum
                wildeyedboyfromfreecloud@masto.ai
                wrote sidst redigeret af
                #8

                @inthehands I couldn't love this perspective anymore if I tried!

                1 Reply Last reply
                0
                • inthehands@hachyderm.ioI inthehands@hachyderm.io

                  IT WAS IN THE TRAINING DATA. Your test gave the machine a pile of correct answers and free license to plagiarize.

                  I remember people being wowed that Claude Code could implement a complete C compiler. But somehow it doesn’t sound quite as impressive when you phrase it as “given every existing C compiler as input, the LLM can produce a C compiler as output.”

                  2/

                  lily_and_frog@mastodon.artL This user is from outside of this forum
                  lily_and_frog@mastodon.artL This user is from outside of this forum
                  lily_and_frog@mastodon.art
                  wrote sidst redigeret af
                  #9

                  @inthehands

                  Many years ago, the following idea spread in the art community: a 15 minutes doodle is the result of years of practice and study. In other words, the 15 minutes doodle took your entire life to make.

                  The same goes to LLMs: it did not take 90 minutes. It took the thousands of hours all the other softwares took to make, plus the years of study of all the programmers making them. And the thousands of hours manually labeling source codes...

                  shiitaketoast@beige.partyS 1 Reply Last reply
                  0
                  • inthehands@hachyderm.ioI inthehands@hachyderm.io

                    It’s just •astonishing• how many eye-popping stories of LLMs doing amazing omg-verge-of-magical-superintelligence things turn out to be just unvarnished plagiarism.

                    In the first months after ChatGPT’s release, I remember a French dept colleague being amazed that GPT could translate and summarize a passage of Le Petit Prince.

                    It was a lot less impressive when I dug up the 2 or 3 online passages which it had copied almost verbatim and stitched together (sprinkling in a couple of extra words that made it less accurate).

                    3/

                    datarama@hachyderm.ioD This user is from outside of this forum
                    datarama@hachyderm.ioD This user is from outside of this forum
                    datarama@hachyderm.io
                    wrote sidst redigeret af
                    #10

                    @inthehands (Completely aside: I sometimes feel like I ought to learn French *just* to be able to read that one in its original language. It is very special to me; it was the first book I ever read myself.

                    Even more completely aside: Someone ought to give Hayao Miyazaki the rights to create an animated interpretation of it, and supply him with however much coffee and pure adrenochrome needed to keep him alive, active and in good spirits for however long that takes. And reward him with whatever he wanted - for example, the exclusive right to hunt Sam Altman for sport.)

                    linza@kamu.socialL temptoetiam@eldritch.cafeT inthehands@hachyderm.ioI 3 Replies Last reply
                    0
                    • inthehands@hachyderm.ioI inthehands@hachyderm.io

                      To be fair, competent plagiarism is a nontrivial task. It takes skill for a human to do: finding relevant fragments from an input dataset and stitching those fragments together in a well-formed way is not nothing! It’s impressive and interesting that people have figured out how to make machines do it.

                      But •that• is not the promise on which investors are valuing the AI megajuggernaut at trillions of dollars.

                      4/

                      inthehands@hachyderm.ioI This user is from outside of this forum
                      inthehands@hachyderm.ioI This user is from outside of this forum
                      inthehands@hachyderm.io
                      wrote sidst redigeret af
                      #11

                      The reasons we consider plagiarism to be cheating in school and malpractice in professional contexts are many of the same reasons that LLMs are not going to replace all knowledge-based human labor. (Details left as an exercise for the reader.)

                      And yes, a whole lot of what LLMs do •would• count as plagiarism if one of my students did it manually, and •should• count as plagiarism just the same if they use a machine to do it — not just in a “that’s cheating!!” sense, but more importantly in a “that’s not really doing the work” sense.

                      5/

                      inthehands@hachyderm.ioI 1 Reply Last reply
                      0
                      • lily_and_frog@mastodon.artL lily_and_frog@mastodon.art

                        @inthehands

                        Many years ago, the following idea spread in the art community: a 15 minutes doodle is the result of years of practice and study. In other words, the 15 minutes doodle took your entire life to make.

                        The same goes to LLMs: it did not take 90 minutes. It took the thousands of hours all the other softwares took to make, plus the years of study of all the programmers making them. And the thousands of hours manually labeling source codes...

                        shiitaketoast@beige.partyS This user is from outside of this forum
                        shiitaketoast@beige.partyS This user is from outside of this forum
                        shiitaketoast@beige.party
                        wrote sidst redigeret af
                        #12

                        @Lily_and_frog @inthehands I think that has been said for plumbing—$1 for the part, $45 for the five minutes it took to install the part, $400 for the years of experience it takes to know which part and how to install it.

                        lily_and_frog@mastodon.artL 1 Reply Last reply
                        0
                        • inthehands@hachyderm.ioI inthehands@hachyderm.io

                          RE: https://unstable.systems/@jneen/116618931097778342

                          Worth looking at both the quoted text here and •especially• the linked page, which is quite good.

                          I’ll add another item of my own. The first screenshot mentions giving an LLM the task of “implementing an HTTP server in JavaScript from scratch” in 90 minutes. Sounds impressive, right? Until you remember that every open-source Javascript HTTP server in existence ••was in the training data••.

                          1/

                          ansuz@gts.cryptography.dogA This user is from outside of this forum
                          ansuz@gts.cryptography.dogA This user is from outside of this forum
                          ansuz@gts.cryptography.dog
                          wrote sidst redigeret af
                          #13

                          @inthehands it annoys me so much when people cite AI benchmarks for exactly this reason.

                          Either the logic of the benchmarks are private (and thus not really fit for judgement) or they're public in a way that effectively makes the supposed challenge of that benchmark into an open-book test.

                          1 Reply Last reply
                          0
                          • inthehands@hachyderm.ioI inthehands@hachyderm.io

                            RE: https://unstable.systems/@jneen/116618931097778342

                            Worth looking at both the quoted text here and •especially• the linked page, which is quite good.

                            I’ll add another item of my own. The first screenshot mentions giving an LLM the task of “implementing an HTTP server in JavaScript from scratch” in 90 minutes. Sounds impressive, right? Until you remember that every open-source Javascript HTTP server in existence ••was in the training data••.

                            1/

                            pgcd@mastodon.onlineP This user is from outside of this forum
                            pgcd@mastodon.onlineP This user is from outside of this forum
                            pgcd@mastodon.online
                            wrote sidst redigeret af
                            #14

                            @inthehands My company houses a number of pro-AI engineers. I asked them to evaluate Claude (which I understand is their favorite) by asking it to build something broken in a very specific, novel way.
                            My suggestion wasn't taken seriously, unfortunately, but I suppose it might be interesting.

                            1 Reply Last reply
                            0
                            • inthehands@hachyderm.ioI inthehands@hachyderm.io

                              IT WAS IN THE TRAINING DATA. Your test gave the machine a pile of correct answers and free license to plagiarize.

                              I remember people being wowed that Claude Code could implement a complete C compiler. But somehow it doesn’t sound quite as impressive when you phrase it as “given every existing C compiler as input, the LLM can produce a C compiler as output.”

                              2/

                              stevewfolds@mastodon.worldS This user is from outside of this forum
                              stevewfolds@mastodon.worldS This user is from outside of this forum
                              stevewfolds@mastodon.world
                              wrote sidst redigeret af
                              #15

                              @inthehands
                              Grep could do it faster. 😉

                              1 Reply Last reply
                              0
                              • datarama@hachyderm.ioD datarama@hachyderm.io

                                @inthehands (Completely aside: I sometimes feel like I ought to learn French *just* to be able to read that one in its original language. It is very special to me; it was the first book I ever read myself.

                                Even more completely aside: Someone ought to give Hayao Miyazaki the rights to create an animated interpretation of it, and supply him with however much coffee and pure adrenochrome needed to keep him alive, active and in good spirits for however long that takes. And reward him with whatever he wanted - for example, the exclusive right to hunt Sam Altman for sport.)

                                linza@kamu.socialL This user is from outside of this forum
                                linza@kamu.socialL This user is from outside of this forum
                                linza@kamu.social
                                wrote sidst redigeret af
                                #16

                                @datarama @inthehands Miyazaki's headed for retirement (again), may I propose Guillermo del Toro?

                                datarama@hachyderm.ioD 1 Reply Last reply
                                0
                                • inthehands@hachyderm.ioI inthehands@hachyderm.io

                                  RE: https://unstable.systems/@jneen/116618931097778342

                                  Worth looking at both the quoted text here and •especially• the linked page, which is quite good.

                                  I’ll add another item of my own. The first screenshot mentions giving an LLM the task of “implementing an HTTP server in JavaScript from scratch” in 90 minutes. Sounds impressive, right? Until you remember that every open-source Javascript HTTP server in existence ••was in the training data••.

                                  1/

                                  jbayes@sfba.socialJ This user is from outside of this forum
                                  jbayes@sfba.socialJ This user is from outside of this forum
                                  jbayes@sfba.social
                                  wrote sidst redigeret af
                                  #17

                                  @inthehands Additionally, why would I want a tool to write me an http server? I already have lots of http servers.

                                  What's next? Are you going to invent a tool that will mail me thousands of free AOL CD's?

                                  1 Reply Last reply
                                  0
                                  • inthehands@hachyderm.ioI inthehands@hachyderm.io

                                    The reasons we consider plagiarism to be cheating in school and malpractice in professional contexts are many of the same reasons that LLMs are not going to replace all knowledge-based human labor. (Details left as an exercise for the reader.)

                                    And yes, a whole lot of what LLMs do •would• count as plagiarism if one of my students did it manually, and •should• count as plagiarism just the same if they use a machine to do it — not just in a “that’s cheating!!” sense, but more importantly in a “that’s not really doing the work” sense.

                                    5/

                                    inthehands@hachyderm.ioI This user is from outside of this forum
                                    inthehands@hachyderm.ioI This user is from outside of this forum
                                    inthehands@hachyderm.io
                                    wrote sidst redigeret af
                                    #18

                                    One last example:

                                    The first LLM code example that really made my eyes pop was early after the release of GPT, when somebody got it to combine Breakout with Conway’s Game of Life (a truly delightful idea). It worked!

                                    Funny thing: the Breakout code and the Life code had a •completely• different style and flavor. Red flag. In about 15 minutes of web searching, I was able to find one of the projects (can’t remember if it was the Breakout or the Life half) which it had copied wholesale, with just a few variable renames. And the other half? It was in Python, but it used dictionaries where it really should have used objects — tons of `thing["prop"]` where it should have said `thing.prop`, and lots of other un-Pythonic stuff besides. It was a machine translate of code from another language, very likely Javascript.

                                    The entire thing was a plagiarized Breakout and a plagiarized Game of Life, one transpiled, and all stuck together in a single run loop. To be fair, figuring out how to (1) run both halves of the logic from a single loop and (2) count the Life cells as Breakout bricks is work I'd cheer on from a second-semester intro CS student! It's not, however, quite what's being sold by these companies.

                                    6/

                                    inthehands@hachyderm.ioI joe@f.duriansoftware.comJ 2 Replies Last reply
                                    0
                                    • inthehands@hachyderm.ioI inthehands@hachyderm.io

                                      IT WAS IN THE TRAINING DATA. Your test gave the machine a pile of correct answers and free license to plagiarize.

                                      I remember people being wowed that Claude Code could implement a complete C compiler. But somehow it doesn’t sound quite as impressive when you phrase it as “given every existing C compiler as input, the LLM can produce a C compiler as output.”

                                      2/

                                      andymoose@fedi.aiga.rocksA This user is from outside of this forum
                                      andymoose@fedi.aiga.rocksA This user is from outside of this forum
                                      andymoose@fedi.aiga.rocks
                                      wrote sidst redigeret af
                                      #19

                                      @inthehands @rayckeith Same with Mythos - the benchmark tests are in the training data. Cybergym tracks their performance and their website has a link to their github which refers to all the evaluation data being in huggingface (with the git repo link to it). It stands to reason that every model has been trained on the very thing they are then evaluated on.

                                      1 Reply Last reply
                                      0
                                      • linza@kamu.socialL linza@kamu.social

                                        @datarama @inthehands Miyazaki's headed for retirement (again), may I propose Guillermo del Toro?

                                        datarama@hachyderm.ioD This user is from outside of this forum
                                        datarama@hachyderm.ioD This user is from outside of this forum
                                        datarama@hachyderm.io
                                        wrote sidst redigeret af
                                        #20

                                        @Linza @inthehands Miyazaki was born to make that movie, and it is a cosmic injustice of the highest order if he doesn't get to do it.

                                        inthehands@hachyderm.ioI 1 Reply Last reply
                                        0
                                        • shiitaketoast@beige.partyS shiitaketoast@beige.party

                                          @Lily_and_frog @inthehands I think that has been said for plumbing—$1 for the part, $45 for the five minutes it took to install the part, $400 for the years of experience it takes to know which part and how to install it.

                                          lily_and_frog@mastodon.artL This user is from outside of this forum
                                          lily_and_frog@mastodon.artL This user is from outside of this forum
                                          lily_and_frog@mastodon.art
                                          wrote sidst redigeret af
                                          #21

                                          @ShiitakeToast @inthehands

                                          Yep!
                                          But in the case of LLMs, it's thousands and thousands of years of experience + hundreds and hundreds of years of just... labelling the source material!

                                          1 Reply Last reply
                                          0
                                          Svar
                                          • Svar som emne
                                          Login for at svare
                                          • Ældste til nyeste
                                          • Nyeste til ældste
                                          • Most Votes


                                          • Log ind

                                          • Har du ikke en konto? Tilmeld

                                          • Login or register to search.
                                          Powered by NodeBB Contributors
                                          Graciously hosted by data.coop
                                          • First post
                                            Last post
                                          0
                                          • Hjem
                                          • Seneste
                                          • Etiketter
                                          • Populære
                                          • Verden
                                          • Bruger
                                          • Grupper