Skip to content
  • Hjem
  • Seneste
  • Etiketter
  • Populære
  • Verden
  • Bruger
  • Grupper
Temaer
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Kollaps
FARVEL BIG TECH
  1. Forside
  2. Ikke-kategoriseret
  3. Worth looking at both the quoted text here and •especially• the linked page, which is quite good.

Worth looking at both the quoted text here and •especially• the linked page, which is quite good.

Planlagt Fastgjort Låst Flyttet Ikke-kategoriseret
45 Indlæg 22 Posters 0 Visninger
  • Ældste til nyeste
  • Nyeste til ældste
  • Most Votes
Svar
  • Svar som emne
Login for at svare
Denne tråd er blevet slettet. Kun brugere med emne behandlings privilegier kan se den.
  • inthehands@hachyderm.ioI inthehands@hachyderm.io

    RE: https://unstable.systems/@jneen/116618931097778342

    Worth looking at both the quoted text here and •especially• the linked page, which is quite good.

    I’ll add another item of my own. The first screenshot mentions giving an LLM the task of “implementing an HTTP server in JavaScript from scratch” in 90 minutes. Sounds impressive, right? Until you remember that every open-source Javascript HTTP server in existence ••was in the training data••.

    1/

    jbayes@sfba.socialJ This user is from outside of this forum
    jbayes@sfba.socialJ This user is from outside of this forum
    jbayes@sfba.social
    wrote sidst redigeret af
    #17

    @inthehands Additionally, why would I want a tool to write me an http server? I already have lots of http servers.

    What's next? Are you going to invent a tool that will mail me thousands of free AOL CD's?

    1 Reply Last reply
    0
    • inthehands@hachyderm.ioI inthehands@hachyderm.io

      The reasons we consider plagiarism to be cheating in school and malpractice in professional contexts are many of the same reasons that LLMs are not going to replace all knowledge-based human labor. (Details left as an exercise for the reader.)

      And yes, a whole lot of what LLMs do •would• count as plagiarism if one of my students did it manually, and •should• count as plagiarism just the same if they use a machine to do it — not just in a “that’s cheating!!” sense, but more importantly in a “that’s not really doing the work” sense.

      5/

      inthehands@hachyderm.ioI This user is from outside of this forum
      inthehands@hachyderm.ioI This user is from outside of this forum
      inthehands@hachyderm.io
      wrote sidst redigeret af
      #18

      One last example:

      The first LLM code example that really made my eyes pop was early after the release of GPT, when somebody got it to combine Breakout with Conway’s Game of Life (a truly delightful idea). It worked!

      Funny thing: the Breakout code and the Life code had a •completely• different style and flavor. Red flag. In about 15 minutes of web searching, I was able to find one of the projects (can’t remember if it was the Breakout or the Life half) which it had copied wholesale, with just a few variable renames. And the other half? It was in Python, but it used dictionaries where it really should have used objects — tons of `thing["prop"]` where it should have said `thing.prop`, and lots of other un-Pythonic stuff besides. It was a machine translate of code from another language, very likely Javascript.

      The entire thing was a plagiarized Breakout and a plagiarized Game of Life, one transpiled, and all stuck together in a single run loop. To be fair, figuring out how to (1) run both halves of the logic from a single loop and (2) count the Life cells as Breakout bricks is work I'd cheer on from a second-semester intro CS student! It's not, however, quite what's being sold by these companies.

      6/

      inthehands@hachyderm.ioI joe@f.duriansoftware.comJ 2 Replies Last reply
      0
      • inthehands@hachyderm.ioI inthehands@hachyderm.io

        IT WAS IN THE TRAINING DATA. Your test gave the machine a pile of correct answers and free license to plagiarize.

        I remember people being wowed that Claude Code could implement a complete C compiler. But somehow it doesn’t sound quite as impressive when you phrase it as “given every existing C compiler as input, the LLM can produce a C compiler as output.”

        2/

        andymoose@fedi.aiga.rocksA This user is from outside of this forum
        andymoose@fedi.aiga.rocksA This user is from outside of this forum
        andymoose@fedi.aiga.rocks
        wrote sidst redigeret af
        #19

        @inthehands @rayckeith Same with Mythos - the benchmark tests are in the training data. Cybergym tracks their performance and their website has a link to their github which refers to all the evaluation data being in huggingface (with the git repo link to it). It stands to reason that every model has been trained on the very thing they are then evaluated on.

        1 Reply Last reply
        0
        • linza@kamu.socialL linza@kamu.social

          @datarama @inthehands Miyazaki's headed for retirement (again), may I propose Guillermo del Toro?

          datarama@hachyderm.ioD This user is from outside of this forum
          datarama@hachyderm.ioD This user is from outside of this forum
          datarama@hachyderm.io
          wrote sidst redigeret af
          #20

          @Linza @inthehands Miyazaki was born to make that movie, and it is a cosmic injustice of the highest order if he doesn't get to do it.

          inthehands@hachyderm.ioI 1 Reply Last reply
          0
          • shiitaketoast@beige.partyS shiitaketoast@beige.party

            @Lily_and_frog @inthehands I think that has been said for plumbing—$1 for the part, $45 for the five minutes it took to install the part, $400 for the years of experience it takes to know which part and how to install it.

            lily_and_frog@mastodon.artL This user is from outside of this forum
            lily_and_frog@mastodon.artL This user is from outside of this forum
            lily_and_frog@mastodon.art
            wrote sidst redigeret af
            #21

            @ShiitakeToast @inthehands

            Yep!
            But in the case of LLMs, it's thousands and thousands of years of experience + hundreds and hundreds of years of just... labelling the source material!

            1 Reply Last reply
            0
            • datarama@hachyderm.ioD datarama@hachyderm.io

              @inthehands (Completely aside: I sometimes feel like I ought to learn French *just* to be able to read that one in its original language. It is very special to me; it was the first book I ever read myself.

              Even more completely aside: Someone ought to give Hayao Miyazaki the rights to create an animated interpretation of it, and supply him with however much coffee and pure adrenochrome needed to keep him alive, active and in good spirits for however long that takes. And reward him with whatever he wanted - for example, the exclusive right to hunt Sam Altman for sport.)

              temptoetiam@eldritch.cafeT This user is from outside of this forum
              temptoetiam@eldritch.cafeT This user is from outside of this forum
              temptoetiam@eldritch.cafe
              wrote sidst redigeret af
              #22

              @datarama @inthehands That is a reasonably achievable goal since it is pretty short and written in a very accessible language.
              So feel encouraged to try if you wish to!
              (As a French person, I never had to learn it the hard way, and admire anyone who does)

              datarama@hachyderm.ioD 1 Reply Last reply
              0
              • inthehands@hachyderm.ioI inthehands@hachyderm.io

                IT WAS IN THE TRAINING DATA. Your test gave the machine a pile of correct answers and free license to plagiarize.

                I remember people being wowed that Claude Code could implement a complete C compiler. But somehow it doesn’t sound quite as impressive when you phrase it as “given every existing C compiler as input, the LLM can produce a C compiler as output.”

                2/

                mhoye@cosocial.caM This user is from outside of this forum
                mhoye@cosocial.caM This user is from outside of this forum
                mhoye@cosocial.ca
                wrote sidst redigeret af
                #23

                @inthehands

                We need to consider the possibility that soon cp(1) will become sentient.

                https://exple.tive.org/blarg/2026/02/12/the-pride-of-subject-hometown-here/

                https://exple.tive.org/blarg/2026/02/07/on-the-crank-spectrum/

                1 Reply Last reply
                0
                • temptoetiam@eldritch.cafeT temptoetiam@eldritch.cafe

                  @datarama @inthehands That is a reasonably achievable goal since it is pretty short and written in a very accessible language.
                  So feel encouraged to try if you wish to!
                  (As a French person, I never had to learn it the hard way, and admire anyone who does)

                  datarama@hachyderm.ioD This user is from outside of this forum
                  datarama@hachyderm.ioD This user is from outside of this forum
                  datarama@hachyderm.io
                  wrote sidst redigeret af
                  #24

                  @temptoetiam @inthehands I can read Borges (slowly and with embarrassingly frequent dictionary breaks) in the original Spanish! That's actually one of the ways I maintained being able to at least read the language (though I struggle with understanding spoken Spanish, if it's spoken at a natural pace) since back when I took Spanish in high school.

                  (I can also read Russian children's literature - *very* far from my goal of being able to read the Strugatsky brothers' science fiction in the original language. 🙂 )

                  temptoetiam@eldritch.cafeT 1 Reply Last reply
                  0
                  • inthehands@hachyderm.ioI inthehands@hachyderm.io

                    One last example:

                    The first LLM code example that really made my eyes pop was early after the release of GPT, when somebody got it to combine Breakout with Conway’s Game of Life (a truly delightful idea). It worked!

                    Funny thing: the Breakout code and the Life code had a •completely• different style and flavor. Red flag. In about 15 minutes of web searching, I was able to find one of the projects (can’t remember if it was the Breakout or the Life half) which it had copied wholesale, with just a few variable renames. And the other half? It was in Python, but it used dictionaries where it really should have used objects — tons of `thing["prop"]` where it should have said `thing.prop`, and lots of other un-Pythonic stuff besides. It was a machine translate of code from another language, very likely Javascript.

                    The entire thing was a plagiarized Breakout and a plagiarized Game of Life, one transpiled, and all stuck together in a single run loop. To be fair, figuring out how to (1) run both halves of the logic from a single loop and (2) count the Life cells as Breakout bricks is work I'd cheer on from a second-semester intro CS student! It's not, however, quite what's being sold by these companies.

                    6/

                    inthehands@hachyderm.ioI This user is from outside of this forum
                    inthehands@hachyderm.ioI This user is from outside of this forum
                    inthehands@hachyderm.io
                    wrote sidst redigeret af
                    #25

                    As per my posts, I have the luxury of not having LLM vendors shoved down my throat, and I generally avoid them for ethical reasons:

                    https://hachyderm.io/@inthehands/116581463138461199

                    But because this all these questions about the usage and limits of these tools keep crashing through my doors, all of our doors, whatever we think of the ethical showstoppers, well…

                    …fight off amazing percentages of LLM overhype with this one weird question.

                    /end

                    gildilinie@beige.partyG bifouba@kolektiva.socialB 2 Replies Last reply
                    1
                    0
                    • inthehands@hachyderm.ioI inthehands@hachyderm.io

                      One last example:

                      The first LLM code example that really made my eyes pop was early after the release of GPT, when somebody got it to combine Breakout with Conway’s Game of Life (a truly delightful idea). It worked!

                      Funny thing: the Breakout code and the Life code had a •completely• different style and flavor. Red flag. In about 15 minutes of web searching, I was able to find one of the projects (can’t remember if it was the Breakout or the Life half) which it had copied wholesale, with just a few variable renames. And the other half? It was in Python, but it used dictionaries where it really should have used objects — tons of `thing["prop"]` where it should have said `thing.prop`, and lots of other un-Pythonic stuff besides. It was a machine translate of code from another language, very likely Javascript.

                      The entire thing was a plagiarized Breakout and a plagiarized Game of Life, one transpiled, and all stuck together in a single run loop. To be fair, figuring out how to (1) run both halves of the logic from a single loop and (2) count the Life cells as Breakout bricks is work I'd cheer on from a second-semester intro CS student! It's not, however, quite what's being sold by these companies.

                      6/

                      joe@f.duriansoftware.comJ This user is from outside of this forum
                      joe@f.duriansoftware.comJ This user is from outside of this forum
                      joe@f.duriansoftware.com
                      wrote sidst redigeret af
                      #26

                      @inthehands i've noticed a trend in anecdotes recently where people are finding it harder to trace their novel-seeming LLM outputs back to inputs. i wonder if this is a result of them atomizing their inputs more finely, or being "better" at swapping the tokens around to make output look original. (an AI bro might argue that at some point human creativity is doing the same thing…)

                      inthehands@hachyderm.ioI 1 Reply Last reply
                      0
                      • inthehands@hachyderm.ioI inthehands@hachyderm.io

                        As per my posts, I have the luxury of not having LLM vendors shoved down my throat, and I generally avoid them for ethical reasons:

                        https://hachyderm.io/@inthehands/116581463138461199

                        But because this all these questions about the usage and limits of these tools keep crashing through my doors, all of our doors, whatever we think of the ethical showstoppers, well…

                        …fight off amazing percentages of LLM overhype with this one weird question.

                        /end

                        gildilinie@beige.partyG This user is from outside of this forum
                        gildilinie@beige.partyG This user is from outside of this forum
                        gildilinie@beige.party
                        wrote sidst redigeret af
                        #27

                        @inthehands Going further: if you could google the source for a HTTP server in JavaScript in 2015, the LLM should be able to output one in 0 (zero) minutes or it's failed.

                        1 Reply Last reply
                        0
                        • datarama@hachyderm.ioD datarama@hachyderm.io

                          @inthehands (Completely aside: I sometimes feel like I ought to learn French *just* to be able to read that one in its original language. It is very special to me; it was the first book I ever read myself.

                          Even more completely aside: Someone ought to give Hayao Miyazaki the rights to create an animated interpretation of it, and supply him with however much coffee and pure adrenochrome needed to keep him alive, active and in good spirits for however long that takes. And reward him with whatever he wanted - for example, the exclusive right to hunt Sam Altman for sport.)

                          inthehands@hachyderm.ioI This user is from outside of this forum
                          inthehands@hachyderm.ioI This user is from outside of this forum
                          inthehands@hachyderm.io
                          wrote sidst redigeret af
                          #28

                          @datarama
                          I know just enough French to have read Le Petit Prince in the original language (with some struggle), and…it really is beautiful in French in a way that translations don't capture. “On ne voit bien qu'avec le coeur” can translate into English quite directly as “One does not see well but with the heart,” but it just doesn't have the same poetry and magic at all.

                          datarama@hachyderm.ioD janeishly@beige.partyJ 2 Replies Last reply
                          0
                          • datarama@hachyderm.ioD datarama@hachyderm.io

                            @temptoetiam @inthehands I can read Borges (slowly and with embarrassingly frequent dictionary breaks) in the original Spanish! That's actually one of the ways I maintained being able to at least read the language (though I struggle with understanding spoken Spanish, if it's spoken at a natural pace) since back when I took Spanish in high school.

                            (I can also read Russian children's literature - *very* far from my goal of being able to read the Strugatsky brothers' science fiction in the original language. 🙂 )

                            temptoetiam@eldritch.cafeT This user is from outside of this forum
                            temptoetiam@eldritch.cafeT This user is from outside of this forum
                            temptoetiam@eldritch.cafe
                            wrote sidst redigeret af
                            #29

                            @datarama @inthehands knowing Spanish is a great stepping stone to learn any other romance languages!
                            Bon courage à toi 🙂

                            1 Reply Last reply
                            0
                            • inthehands@hachyderm.ioI inthehands@hachyderm.io

                              @datarama
                              I know just enough French to have read Le Petit Prince in the original language (with some struggle), and…it really is beautiful in French in a way that translations don't capture. “On ne voit bien qu'avec le coeur” can translate into English quite directly as “One does not see well but with the heart,” but it just doesn't have the same poetry and magic at all.

                              datarama@hachyderm.ioD This user is from outside of this forum
                              datarama@hachyderm.ioD This user is from outside of this forum
                              datarama@hachyderm.io
                              wrote sidst redigeret af
                              #30

                              @inthehands I've read it in Danish and English. I personally like the Danish translation best.

                              1 Reply Last reply
                              0
                              • galbinuscaeli@spacey.spaceG galbinuscaeli@spacey.space

                                @inthehands Given pictures of a giraffe, a rhinoceros, an elephant, and a squirrel, find the squirrel. Feel free to reference dictionaries, encyclopedias, nature documentaries and previous responses to this same question.

                                inthehands@hachyderm.ioI This user is from outside of this forum
                                inthehands@hachyderm.ioI This user is from outside of this forum
                                inthehands@hachyderm.io
                                wrote sidst redigeret af
                                #31

                                @GalbinusCaeli
                                To be fair, this is an algorithmically difficult problem that was still largely an open question 10-15 years ago! Scale down your expectations by 2 or 3 orders of magnitude, and modern machine learning is truly impressive.

                                Not a $10 trillion industry. But it's impressive in a “cool research” sense, and also in a “oo, that may pose serious societal danger” sense.

                                1 Reply Last reply
                                0
                                • michael_w_busch@mastodon.onlineM michael_w_busch@mastodon.online

                                  @inthehands

                                  Back in 2023, OpenAI was hyping ChatGPT by claiming "it can pass the GRE".

                                  When all it was doing was autocomplete of the answer keys that had been used as input.

                                  inthehands@hachyderm.ioI This user is from outside of this forum
                                  inthehands@hachyderm.ioI This user is from outside of this forum
                                  inthehands@hachyderm.io
                                  wrote sidst redigeret af
                                  #32

                                  @michael_w_busch
                                  Yup. Same with passing the bar.

                                  1 Reply Last reply
                                  0
                                  • inthehands@hachyderm.ioI inthehands@hachyderm.io

                                    As per my posts, I have the luxury of not having LLM vendors shoved down my throat, and I generally avoid them for ethical reasons:

                                    https://hachyderm.io/@inthehands/116581463138461199

                                    But because this all these questions about the usage and limits of these tools keep crashing through my doors, all of our doors, whatever we think of the ethical showstoppers, well…

                                    …fight off amazing percentages of LLM overhype with this one weird question.

                                    /end

                                    bifouba@kolektiva.socialB This user is from outside of this forum
                                    bifouba@kolektiva.socialB This user is from outside of this forum
                                    bifouba@kolektiva.social
                                    wrote sidst redigeret af
                                    #33

                                    @inthehands

                                    I'm as anti-"AI" as they come, but this is a much stronger argument against these systems being intelligent, or about to achieve a breakthrough, than it is against the claim that they are useful. The ability even to quickly retrieve a known right answer needle from a haystack of less useful answers (as opposed to coming up with a new right answer from first principles) would potentially be a valuable service, if it were reliable (and less inefficient, ecologically suicidal, etc.).

                                    inthehands@hachyderm.ioI 1 Reply Last reply
                                    0
                                    • datarama@hachyderm.ioD datarama@hachyderm.io

                                      @Linza @inthehands Miyazaki was born to make that movie, and it is a cosmic injustice of the highest order if he doesn't get to do it.

                                      inthehands@hachyderm.ioI This user is from outside of this forum
                                      inthehands@hachyderm.ioI This user is from outside of this forum
                                      inthehands@hachyderm.io
                                      wrote sidst redigeret af
                                      #34

                                      @datarama @Linza
                                      My own position on this is that the book is perfect, and should not be adapted.

                                      I'm pretty sure Miyazaki understands this — and if he •were• making an adaptation, it would be because he's actually writing a dramatically different story that is largely new material and profoundly different in its scope and arc, as he did with both Kiki and Howl.

                                      datarama@hachyderm.ioD 1 Reply Last reply
                                      0
                                      • inthehands@hachyderm.ioI inthehands@hachyderm.io

                                        @datarama @Linza
                                        My own position on this is that the book is perfect, and should not be adapted.

                                        I'm pretty sure Miyazaki understands this — and if he •were• making an adaptation, it would be because he's actually writing a dramatically different story that is largely new material and profoundly different in its scope and arc, as he did with both Kiki and Howl.

                                        datarama@hachyderm.ioD This user is from outside of this forum
                                        datarama@hachyderm.ioD This user is from outside of this forum
                                        datarama@hachyderm.io
                                        wrote sidst redigeret af
                                        #35

                                        @inthehands @Linza Hence, "interpretation". 🙂 That is kinda the thing he does.

                                        (BTW, I seem to recall him mentioning that it is his favourite book.)

                                        datarama@hachyderm.ioD 1 Reply Last reply
                                        0
                                        • joe@f.duriansoftware.comJ joe@f.duriansoftware.com

                                          @inthehands i've noticed a trend in anecdotes recently where people are finding it harder to trace their novel-seeming LLM outputs back to inputs. i wonder if this is a result of them atomizing their inputs more finely, or being "better" at swapping the tokens around to make output look original. (an AI bro might argue that at some point human creativity is doing the same thing…)

                                          inthehands@hachyderm.ioI This user is from outside of this forum
                                          inthehands@hachyderm.ioI This user is from outside of this forum
                                          inthehands@hachyderm.io
                                          wrote sidst redigeret af
                                          #36

                                          @joe
                                          Yeah, people were having that same argument about humans and creativity on the more academic side of my circles back in 2023. It would be an interesting one if it didn't have all this investment money weighing it down! (Human learning, both technical and artistic, almost always starts with imitation and repetition; clearly it's a building block of this messy constellation of things that we call “intelligence.”)

                                          I do think the models are getting better at atomizing, as you put it, and I'm disappointed that there's not more research on this family of reverse-mapping problems. One question I've wondered about: can we quantify how much the output depended on a given input? e.g. how would the probability of given output have changed if the model were trained without <pattern> in its training data?

                                          1 Reply Last reply
                                          0
                                          Svar
                                          • Svar som emne
                                          Login for at svare
                                          • Ældste til nyeste
                                          • Nyeste til ældste
                                          • Most Votes


                                          • Log ind

                                          • Har du ikke en konto? Tilmeld

                                          • Login or register to search.
                                          Powered by NodeBB Contributors
                                          Graciously hosted by data.coop
                                          • First post
                                            Last post
                                          0
                                          • Hjem
                                          • Seneste
                                          • Etiketter
                                          • Populære
                                          • Verden
                                          • Bruger
                                          • Grupper