Skip to content
  • Hjem
  • Seneste
  • Etiketter
  • Populære
  • Verden
  • Bruger
  • Grupper
Temaer
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Kollaps
FARVEL BIG TECH
  1. Forside
  2. Ikke-kategoriseret
  3. LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help.

LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help.

Planlagt Fastgjort Låst Flyttet Ikke-kategoriseret
119 Indlæg 41 Posters 776 Visninger
  • Ældste til nyeste
  • Nyeste til ældste
  • Most Votes
Svar
  • Svar som emne
Login for at svare
Denne tråd er blevet slettet. Kun brugere med emne behandlings privilegier kan se den.
  • mysturji@mastodonapp.ukM mysturji@mastodonapp.uk

    @benjamineskola
    It’s not “hallucinations”.
    It’s just putting a stream of words together on the specified subject in a SYNTACTICALLY correct order.
    Nothing more, nothing less.
    Semantics, accuracy, TRUTH don’t even enter into it.
    FFS, it’s not “intelligent”. It’s code.
    “Computers don’t make mistakes.” (remember that one?)
    People make mistakes. People program computers. (At least, they used to.)
    Cthulhu save us all.

    benjamineskola@hachyderm.ioB This user is from outside of this forum
    benjamineskola@hachyderm.ioB This user is from outside of this forum
    benjamineskola@hachyderm.io
    wrote sidst redigeret af
    #93

    @mysturji Yes, that is my point.

    1 Reply Last reply
    0
    • robo2@mastodon.ieR robo2@mastodon.ie

      @benjamineskola for example if I typed "the cat sat on the " would it work out the probably the next word is "mat" with a score of 87%

      benjamineskola@hachyderm.ioB This user is from outside of this forum
      benjamineskola@hachyderm.ioB This user is from outside of this forum
      benjamineskola@hachyderm.io
      wrote sidst redigeret af
      #94

      @RoBo2 Yes: probability. The sentence is a common one, so it’s likely to be reproduced in the output. But the LLM has no conception of whether a cat really did sit on the mat.

      You probably could build an LLM so that it showed the probabilities of each token; but it wouldn’t solve the problem being discussed here at all.

      1 Reply Last reply
      0
      • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

        @pontus_k But the tool has no conception of what is true or false. It can’t ‘check its work’ because it has no way of telling what is better and what is worse. What it would produce is something that has the appearance of a verification process; but it’s no more likely to be true.

        pontus_k@mastodon.socialP This user is from outside of this forum
        pontus_k@mastodon.socialP This user is from outside of this forum
        pontus_k@mastodon.social
        wrote sidst redigeret af
        #95

        @benjamineskola Some of these systems have access to deterministic tools that could give them a better output. For example, all LLMs struggle with counting letters, but in a lot of cases they have the capability to call the unix utility 'wc' to count letters. Putting 'MAKE NO MISTAKES' in the prompt could possibly make it a bit more likely that it does so and gets it right. Don't get me wrong, I think it's absolutely stupid that this is where we are.

        benjamineskola@hachyderm.ioB 1 Reply Last reply
        0
        • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

          LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help. It doesn’t know when it’s making stuff up, and it couldn’t change that even if you told it to. (In fact it’s always just making stuff up, and is only ever true by chance.)

          Part of why I’m so negative about them is that their advocates simply do not understand how they work and do not seem to want to.

          https://mas.to/@carnage4life/116653425489923041

          juliancalaby@social.treehouse.systemsJ This user is from outside of this forum
          juliancalaby@social.treehouse.systemsJ This user is from outside of this forum
          juliancalaby@social.treehouse.systems
          wrote sidst redigeret af
          #96

          @benjamineskola Colleague who is adding "AGENTS.md" files to our repositories is adding very similar paragraphs to those files.

          Ugh.

          benjamineskola@hachyderm.ioB 1 Reply Last reply
          0
          • pontus_k@mastodon.socialP pontus_k@mastodon.social

            @benjamineskola Some of these systems have access to deterministic tools that could give them a better output. For example, all LLMs struggle with counting letters, but in a lot of cases they have the capability to call the unix utility 'wc' to count letters. Putting 'MAKE NO MISTAKES' in the prompt could possibly make it a bit more likely that it does so and gets it right. Don't get me wrong, I think it's absolutely stupid that this is where we are.

            benjamineskola@hachyderm.ioB This user is from outside of this forum
            benjamineskola@hachyderm.ioB This user is from outside of this forum
            benjamineskola@hachyderm.io
            wrote sidst redigeret af
            #97

            @pontus_k you don’t need to hunt for ways to make this make sense.

            1 Reply Last reply
            0
            • juliancalaby@social.treehouse.systemsJ juliancalaby@social.treehouse.systems

              @benjamineskola Colleague who is adding "AGENTS.md" files to our repositories is adding very similar paragraphs to those files.

              Ugh.

              benjamineskola@hachyderm.ioB This user is from outside of this forum
              benjamineskola@hachyderm.ioB This user is from outside of this forum
              benjamineskola@hachyderm.io
              wrote sidst redigeret af
              #98

              @juliancalaby That sort of thing bugs me so much. Like, if you insist on using these tools (and I know I'm not going to win the fight against them more generally), then at least use them properly.

              I've tried to have conversations about 'how do we know whether this actually makes a difference' and so on, and I think it's probably better than it could be, but it's still very silly.

              juliancalaby@social.treehouse.systemsJ 1 Reply Last reply
              0
              • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                @juliancalaby That sort of thing bugs me so much. Like, if you insist on using these tools (and I know I'm not going to win the fight against them more generally), then at least use them properly.

                I've tried to have conversations about 'how do we know whether this actually makes a difference' and so on, and I think it's probably better than it could be, but it's still very silly.

                juliancalaby@social.treehouse.systemsJ This user is from outside of this forum
                juliancalaby@social.treehouse.systemsJ This user is from outside of this forum
                juliancalaby@social.treehouse.systems
                wrote sidst redigeret af
                #99

                @benjamineskola I wrote our company's AI policy, added terms to require short- and long-term evaluation of whether this is actually working for us, and management as a whole agreed so it's company policy. Which is a nice. However the head of the company has gone very AI and is pulling the company in that direction despite ... well ... all the clear points against it and the person who is functionally our sysadmin is heading up a project to add it into our workflows and is using it to do stuff with our infrastructure.

                I'm now trying to keep them accountable and biding my time before this blows up in their faces.

                Thankfully the "accountability" story is working out fairly well so far, but it's fucking exhausting dealing with this bullshit.

                1 Reply Last reply
                0
                • prietschka@mastodon.socialP prietschka@mastodon.social

                  @benjamineskola The problem with Obasanjo is he's utterly unprincipled and just chasing engagement/self-aggrandizement. His purpose for being in social spaces like Masto/Bluesky/X is to stroke his ego, so everything he does is just an act of public masturbation.

                  He's interested in self-aggrandizement and self-promotion, nothing more.

                  Which is why I use the descriptor "piece of shit" with regard to him.

                  dandean@indieweb.socialD This user is from outside of this forum
                  dandean@indieweb.socialD This user is from outside of this forum
                  dandean@indieweb.social
                  wrote sidst redigeret af
                  #100

                  @prietschka @benjamineskola It’s refreshing to see people stating this plainly. These people are dumb, they make bad choices, and making that observation is not mean.

                  1 Reply Last reply
                  0
                  • aedius@lavraievie.socialA aedius@lavraievie.social

                    @skotchygut @benjamineskola

                    LLM are a Skinner Box.

                    violetmadder@kolektiva.socialV This user is from outside of this forum
                    violetmadder@kolektiva.socialV This user is from outside of this forum
                    violetmadder@kolektiva.social
                    wrote sidst redigeret af
                    #101

                    @Aedius @skotchygut @benjamineskola

                    Designed by wannabe supervillains, built with wartime-scale resources.

                    The hell do people expect??

                    1 Reply Last reply
                    0
                    • linkplay@biplus.socialL linkplay@biplus.social

                      @nelson @benjamineskola @solonovamax
                      yeah, i think my take from about a year ago still mostly holds up https://biplus.social/@linkplay/114828181247605258

                      violetmadder@kolektiva.socialV This user is from outside of this forum
                      violetmadder@kolektiva.socialV This user is from outside of this forum
                      violetmadder@kolektiva.social
                      wrote sidst redigeret af
                      #102

                      @linkplay @nelson @benjamineskola @solonovamax

                      It's just rolling linguistic dice, words bouncing around between probablistic paddles in a bigass pachinko matrix. It's not designed to vet facts. It's designed to regurgitate plausible spitwads that RESEMBLE facts. And the weights behind all those paddles and slots are tuned to whatever agenda the designers wish.

                      ...And the designers serve planetwrecking technofascist war profiteers who party with people like Epstein.

                      Why would anyone ever trust it with so much as a goddamn casserole recipe??

                      1 Reply Last reply
                      0
                      • sherapantsuit@mastodon.socialS This user is from outside of this forum
                        sherapantsuit@mastodon.socialS This user is from outside of this forum
                        sherapantsuit@mastodon.social
                        wrote sidst redigeret af
                        #103

                        @Su_G @Aedius @benjamineskola I shamelessly stole the term from @davidgerard

                        1 Reply Last reply
                        0
                        • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                          LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help. It doesn’t know when it’s making stuff up, and it couldn’t change that even if you told it to. (In fact it’s always just making stuff up, and is only ever true by chance.)

                          Part of why I’m so negative about them is that their advocates simply do not understand how they work and do not seem to want to.

                          https://mas.to/@carnage4life/116653425489923041

                          mrundkvist@archaeo.socialM This user is from outside of this forum
                          mrundkvist@archaeo.socialM This user is from outside of this forum
                          mrundkvist@archaeo.social
                          wrote sidst redigeret af
                          #104

                          @benjamineskola
                          Even if you curate the source text in detail, there is no guarantee that you avoid falsehoods and imaginings. When you just feed the entire WWW blindly into your LLM, of course all bets are off.

                          #ai #aibubble #llm

                          benjamineskola@hachyderm.ioB 1 Reply Last reply
                          0
                          • mrundkvist@archaeo.socialM mrundkvist@archaeo.social

                            @benjamineskola
                            Even if you curate the source text in detail, there is no guarantee that you avoid falsehoods and imaginings. When you just feed the entire WWW blindly into your LLM, of course all bets are off.

                            #ai #aibubble #llm

                            benjamineskola@hachyderm.ioB This user is from outside of this forum
                            benjamineskola@hachyderm.ioB This user is from outside of this forum
                            benjamineskola@hachyderm.io
                            wrote sidst redigeret af
                            #105

                            @mrundkvist yes exactly. It’s a complete misunderstanding of how these tools work.

                            mrundkvist@archaeo.socialM 1 Reply Last reply
                            0
                            • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                              @mrundkvist yes exactly. It’s a complete misunderstanding of how these tools work.

                              mrundkvist@archaeo.socialM This user is from outside of this forum
                              mrundkvist@archaeo.socialM This user is from outside of this forum
                              mrundkvist@archaeo.social
                              wrote sidst redigeret af
                              #106

                              @benjamineskola
                              Also a breathtakingly naïve idea about truth!

                              1 Reply Last reply
                              0
                              • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                                LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help. It doesn’t know when it’s making stuff up, and it couldn’t change that even if you told it to. (In fact it’s always just making stuff up, and is only ever true by chance.)

                                Part of why I’m so negative about them is that their advocates simply do not understand how they work and do not seem to want to.

                                https://mas.to/@carnage4life/116653425489923041

                                troed@masto.sangberg.seT This user is from outside of this forum
                                troed@masto.sangberg.seT This user is from outside of this forum
                                troed@masto.sangberg.se
                                wrote sidst redigeret af
                                #107

                                @benjamineskola Part of why I'm so frustrated is that their detractors simply do not understand how they work and do not seem to want to.

                                (No, they're not simply correct "by chance")

                                /actual very very senior greybeard dev who challenged his own erroneous convictions regarding LLMs and now find them useful

                                benjamineskola@hachyderm.ioB 1 Reply Last reply
                                0
                                • troed@masto.sangberg.seT troed@masto.sangberg.se

                                  @benjamineskola Part of why I'm so frustrated is that their detractors simply do not understand how they work and do not seem to want to.

                                  (No, they're not simply correct "by chance")

                                  /actual very very senior greybeard dev who challenged his own erroneous convictions regarding LLMs and now find them useful

                                  benjamineskola@hachyderm.ioB This user is from outside of this forum
                                  benjamineskola@hachyderm.ioB This user is from outside of this forum
                                  benjamineskola@hachyderm.io
                                  wrote sidst redigeret af
                                  #108

                                  @troed whatever you feel the need to tell yourself to justify it.

                                  troed@masto.sangberg.seT 1 Reply Last reply
                                  0
                                  • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                                    @troed whatever you feel the need to tell yourself to justify it.

                                    troed@masto.sangberg.seT This user is from outside of this forum
                                    troed@masto.sangberg.seT This user is from outside of this forum
                                    troed@masto.sangberg.se
                                    wrote sidst redigeret af
                                    #109

                                    @benjamineskola Do you ever ponder that you might be wrong about something?

                                    benjamineskola@hachyderm.ioB 1 Reply Last reply
                                    0
                                    • troed@masto.sangberg.seT troed@masto.sangberg.se

                                      @benjamineskola Do you ever ponder that you might be wrong about something?

                                      benjamineskola@hachyderm.ioB This user is from outside of this forum
                                      benjamineskola@hachyderm.ioB This user is from outside of this forum
                                      benjamineskola@hachyderm.io
                                      wrote sidst redigeret af
                                      #110

                                      @troed all the time! But that doesn’t mean I’m wrong here.

                                      troed@masto.sangberg.seT 1 Reply Last reply
                                      0
                                      • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                                        @troed all the time! But that doesn’t mean I’m wrong here.

                                        troed@masto.sangberg.seT This user is from outside of this forum
                                        troed@masto.sangberg.seT This user is from outside of this forum
                                        troed@masto.sangberg.se
                                        wrote sidst redigeret af
                                        #111

                                        @benjamineskola You are. Interested in learning?

                                        (I was wrong about LLMs for development, took the time to learn, and changed my mind)

                                        benjamineskola@hachyderm.ioB 1 Reply Last reply
                                        0
                                        • troed@masto.sangberg.seT troed@masto.sangberg.se

                                          @benjamineskola You are. Interested in learning?

                                          (I was wrong about LLMs for development, took the time to learn, and changed my mind)

                                          benjamineskola@hachyderm.ioB This user is from outside of this forum
                                          benjamineskola@hachyderm.ioB This user is from outside of this forum
                                          benjamineskola@hachyderm.io
                                          wrote sidst redigeret af
                                          #112

                                          @troed I’m not really interested in hearing your justifications for why actually it makes total sense to just tell the text generator ‘don’t make stuff up!’ as if it’s doing so by choice.

                                          troed@masto.sangberg.seT 1 Reply Last reply
                                          0
                                          Svar
                                          • Svar som emne
                                          Login for at svare
                                          • Ældste til nyeste
                                          • Nyeste til ældste
                                          • Most Votes


                                          • Log ind

                                          • Har du ikke en konto? Tilmeld

                                          • Login or register to search.
                                          Powered by NodeBB Contributors
                                          Graciously hosted by data.coop
                                          • First post
                                            Last post
                                          0
                                          • Hjem
                                          • Seneste
                                          • Etiketter
                                          • Populære
                                          • Verden
                                          • Bruger
                                          • Grupper