Skip to content
  • Hjem
  • Seneste
  • Etiketter
  • Populære
  • Verden
  • Bruger
  • Grupper
Temaer
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Kollaps
FARVEL BIG TECH
  1. Forside
  2. Ikke-kategoriseret
  3. LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help.

LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help.

Planlagt Fastgjort Låst Flyttet Ikke-kategoriseret
119 Indlæg 41 Posters 776 Visninger
  • Ældste til nyeste
  • Nyeste til ældste
  • Most Votes
Svar
  • Svar som emne
Login for at svare
Denne tråd er blevet slettet. Kun brugere med emne behandlings privilegier kan se den.
  • missconstrue@mefi.socialM missconstrue@mefi.social

    @nelson @eestileib @solonovamax @benjamineskola @complexmath

    I think you're on to something there. A lot of AI hype can be seen as a mass pareidoliac hallucination. We're seeing dragons in the clouds.

    eestileib@tech.lgbtE This user is from outside of this forum
    eestileib@tech.lgbtE This user is from outside of this forum
    eestileib@tech.lgbt
    wrote sidst redigeret af
    #78

    @MissConstrue @nelson @solonovamax @benjamineskola @complexmath

    I don't blame random non techy people or people who don't claim to know any philosophy for thinking chatbots are intelligent when all of their social sources of proof (rich people, relatives, the people on tv and YouTube) say it is and it seems like it is.

    I rely on social proof to pick what food I eat all the time, it's not such a bad reasoning method for stuff you can't research yourself.

    But people with CS or math or history or philosophy degrees (including all PhDs) should be ashamed of themselves if they tell other people that chatbots "think" or are "alive" or "apologize" or "feel bad".

    That is a failure to use their intellectual training, and it is fucking over people who use their social status to form opinions on these matters.

    Generative Textual Functionalism is just yet another extractive religion.

    1 Reply Last reply
    0
    • nelson@wetdry.worldN nelson@wetdry.world

      @benjamineskola "hey convincing-bullshit-a-tron2000, i want you to stop making up bullshit when you answer to me"
      "ok"

      bright_helpings@mspsocial.netB This user is from outside of this forum
      bright_helpings@mspsocial.netB This user is from outside of this forum
      bright_helpings@mspsocial.net
      wrote sidst redigeret af
      #79

      @nelson @benjamineskola "hey machine that's been trained to put together the kind of words people use if they're being deferential, do what I tell you"
      "ok"

      1 Reply Last reply
      0
      • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

        LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help. It doesn’t know when it’s making stuff up, and it couldn’t change that even if you told it to. (In fact it’s always just making stuff up, and is only ever true by chance.)

        Part of why I’m so negative about them is that their advocates simply do not understand how they work and do not seem to want to.

        https://mas.to/@carnage4life/116653425489923041

        robo2@mastodon.ieR This user is from outside of this forum
        robo2@mastodon.ieR This user is from outside of this forum
        robo2@mastodon.ie
        wrote sidst redigeret af
        #80

        @benjamineskola genuine question, could the LLM share it's confidence for what it says is true.. It's a probability machine so could it say "I'm 90% sure I'm correct"?

        benjamineskola@hachyderm.ioB 1 Reply Last reply
        0
        • robo2@mastodon.ieR robo2@mastodon.ie

          @benjamineskola genuine question, could the LLM share it's confidence for what it says is true.. It's a probability machine so could it say "I'm 90% sure I'm correct"?

          benjamineskola@hachyderm.ioB This user is from outside of this forum
          benjamineskola@hachyderm.ioB This user is from outside of this forum
          benjamineskola@hachyderm.io
          wrote sidst redigeret af
          #81

          @RoBo2 No. The probability of it generating a particular output is based on frequency not correctness.

          robo2@mastodon.ieR 1 Reply Last reply
          0
          • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

            LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help. It doesn’t know when it’s making stuff up, and it couldn’t change that even if you told it to. (In fact it’s always just making stuff up, and is only ever true by chance.)

            Part of why I’m so negative about them is that their advocates simply do not understand how they work and do not seem to want to.

            https://mas.to/@carnage4life/116653425489923041

            pontus_k@mastodon.socialP This user is from outside of this forum
            pontus_k@mastodon.socialP This user is from outside of this forum
            pontus_k@mastodon.social
            wrote sidst redigeret af
            #82

            @benjamineskola I wouldn't be surprised if it works. The LLMs have been trained to have a certain level of confidence when replying and to make quick guesses for "trivial" questions. Nudging them to be more thorough could cause them to check their work with deterministic tool calls more frequently. It's sort of part of the problem though that the actual correct way to use the technology is to repeat some superstitious incantation or to talk to it like a cave man.

            benjamineskola@hachyderm.ioB 1 Reply Last reply
            0
            • pontus_k@mastodon.socialP pontus_k@mastodon.social

              @benjamineskola I wouldn't be surprised if it works. The LLMs have been trained to have a certain level of confidence when replying and to make quick guesses for "trivial" questions. Nudging them to be more thorough could cause them to check their work with deterministic tool calls more frequently. It's sort of part of the problem though that the actual correct way to use the technology is to repeat some superstitious incantation or to talk to it like a cave man.

              benjamineskola@hachyderm.ioB This user is from outside of this forum
              benjamineskola@hachyderm.ioB This user is from outside of this forum
              benjamineskola@hachyderm.io
              wrote sidst redigeret af
              #83

              @pontus_k But the tool has no conception of what is true or false. It can’t ‘check its work’ because it has no way of telling what is better and what is worse. What it would produce is something that has the appearance of a verification process; but it’s no more likely to be true.

              pontus_k@mastodon.socialP 1 Reply Last reply
              0
              • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                @prietschka I do recall that a few weeks back he was complaining that LLM advocates get made to feel unwelcome on the fediverse. (OK? I don’t care. It’s nobody’s job to make people feel good about their bad opinions.)

                And then just a couple of days ago he was posting something critical, and like … yes this is what we’ve been saying all along.

                patrick_h_lauke@mastodon.socialP This user is from outside of this forum
                patrick_h_lauke@mastodon.socialP This user is from outside of this forum
                patrick_h_lauke@mastodon.social
                wrote sidst redigeret af
                #84

                @benjamineskola @prietschka yup, which prompted one of the folks who works on mastodon to go on a weird "we want journalists to come to fedi, right? how can we entice them to come? they're not coming because fedi is a monoculture, and you all don't engage with their views..." good times

                benjamineskola@hachyderm.ioB 1 Reply Last reply
                0
                • patrick_h_lauke@mastodon.socialP patrick_h_lauke@mastodon.social

                  @benjamineskola @prietschka yup, which prompted one of the folks who works on mastodon to go on a weird "we want journalists to come to fedi, right? how can we entice them to come? they're not coming because fedi is a monoculture, and you all don't engage with their views..." good times

                  benjamineskola@hachyderm.ioB This user is from outside of this forum
                  benjamineskola@hachyderm.ioB This user is from outside of this forum
                  benjamineskola@hachyderm.io
                  wrote sidst redigeret af
                  #85

                  @patrick_h_lauke that’s the one. i’m happy for antisocial views to remain unwelcome tbh.

                  patrick_h_lauke@mastodon.socialP 1 Reply Last reply
                  0
                  • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                    @patrick_h_lauke that’s the one. i’m happy for antisocial views to remain unwelcome tbh.

                    patrick_h_lauke@mastodon.socialP This user is from outside of this forum
                    patrick_h_lauke@mastodon.socialP This user is from outside of this forum
                    patrick_h_lauke@mastodon.social
                    wrote sidst redigeret af
                    #86

                    @benjamineskola happy for them to post, but then don't complain when nobody likes/subscribes/hits the bell button/follows them/whatever other made-up number-go-up metric they see as engagement

                    patrick_h_lauke@mastodon.socialP benjamineskola@hachyderm.ioB 2 Replies Last reply
                    0
                    • patrick_h_lauke@mastodon.socialP patrick_h_lauke@mastodon.social

                      @benjamineskola happy for them to post, but then don't complain when nobody likes/subscribes/hits the bell button/follows them/whatever other made-up number-go-up metric they see as engagement

                      patrick_h_lauke@mastodon.socialP This user is from outside of this forum
                      patrick_h_lauke@mastodon.socialP This user is from outside of this forum
                      patrick_h_lauke@mastodon.social
                      wrote sidst redigeret af
                      #87

                      @benjamineskola https://www.youtube.com/watch?v=Ek23bXq23pY

                      1 Reply Last reply
                      0
                      • patrick_h_lauke@mastodon.socialP patrick_h_lauke@mastodon.social

                        @benjamineskola happy for them to post, but then don't complain when nobody likes/subscribes/hits the bell button/follows them/whatever other made-up number-go-up metric they see as engagement

                        benjamineskola@hachyderm.ioB This user is from outside of this forum
                        benjamineskola@hachyderm.ioB This user is from outside of this forum
                        benjamineskola@hachyderm.io
                        wrote sidst redigeret af
                        #88

                        @patrick_h_lauke yes, true, that’s the problem; they want not only to be allowed to share their bad opinions but to be rewarded for doing so (with internet points).

                        patrick_h_lauke@mastodon.socialP 1 Reply Last reply
                        0
                        • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                          @patrick_h_lauke yes, true, that’s the problem; they want not only to be allowed to share their bad opinions but to be rewarded for doing so (with internet points).

                          patrick_h_lauke@mastodon.socialP This user is from outside of this forum
                          patrick_h_lauke@mastodon.socialP This user is from outside of this forum
                          patrick_h_lauke@mastodon.social
                          wrote sidst redigeret af
                          #89

                          @benjamineskola it's the "i used to have 2 million followers on twitter...don't you realise who i am?" mentality

                          1 Reply Last reply
                          0
                          • patrick_h_lauke@mastodon.socialP patrick_h_lauke@mastodon.social

                            @benjamineskola but it's ok, i have a second instance of a different LLM tasked with checking the output of the first LLM is CORRECT...

                            m0khr@mastodon.radioM This user is from outside of this forum
                            m0khr@mastodon.radioM This user is from outside of this forum
                            m0khr@mastodon.radio
                            wrote sidst redigeret af
                            #90

                            @patrick_h_lauke @benjamineskola
                            No no, as one commenter in the original thread mentioned. have THREE different passes... /s

                            1 Reply Last reply
                            0
                            • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                              @RoBo2 No. The probability of it generating a particular output is based on frequency not correctness.

                              robo2@mastodon.ieR This user is from outside of this forum
                              robo2@mastodon.ieR This user is from outside of this forum
                              robo2@mastodon.ie
                              wrote sidst redigeret af
                              #91

                              @benjamineskola for example if I typed "the cat sat on the " would it work out the probably the next word is "mat" with a score of 87%

                              benjamineskola@hachyderm.ioB 1 Reply Last reply
                              0
                              • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                                LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help. It doesn’t know when it’s making stuff up, and it couldn’t change that even if you told it to. (In fact it’s always just making stuff up, and is only ever true by chance.)

                                Part of why I’m so negative about them is that their advocates simply do not understand how they work and do not seem to want to.

                                https://mas.to/@carnage4life/116653425489923041

                                mysturji@mastodonapp.ukM This user is from outside of this forum
                                mysturji@mastodonapp.ukM This user is from outside of this forum
                                mysturji@mastodonapp.uk
                                wrote sidst redigeret af
                                #92

                                @benjamineskola
                                It’s not “hallucinations”.
                                It’s just putting a stream of words together on the specified subject in a SYNTACTICALLY correct order.
                                Nothing more, nothing less.
                                Semantics, accuracy, TRUTH don’t even enter into it.
                                FFS, it’s not “intelligent”. It’s code.
                                “Computers don’t make mistakes.” (remember that one?)
                                People make mistakes. People program computers. (At least, they used to.)
                                Cthulhu save us all.

                                benjamineskola@hachyderm.ioB 1 Reply Last reply
                                0
                                • mysturji@mastodonapp.ukM mysturji@mastodonapp.uk

                                  @benjamineskola
                                  It’s not “hallucinations”.
                                  It’s just putting a stream of words together on the specified subject in a SYNTACTICALLY correct order.
                                  Nothing more, nothing less.
                                  Semantics, accuracy, TRUTH don’t even enter into it.
                                  FFS, it’s not “intelligent”. It’s code.
                                  “Computers don’t make mistakes.” (remember that one?)
                                  People make mistakes. People program computers. (At least, they used to.)
                                  Cthulhu save us all.

                                  benjamineskola@hachyderm.ioB This user is from outside of this forum
                                  benjamineskola@hachyderm.ioB This user is from outside of this forum
                                  benjamineskola@hachyderm.io
                                  wrote sidst redigeret af
                                  #93

                                  @mysturji Yes, that is my point.

                                  1 Reply Last reply
                                  0
                                  • robo2@mastodon.ieR robo2@mastodon.ie

                                    @benjamineskola for example if I typed "the cat sat on the " would it work out the probably the next word is "mat" with a score of 87%

                                    benjamineskola@hachyderm.ioB This user is from outside of this forum
                                    benjamineskola@hachyderm.ioB This user is from outside of this forum
                                    benjamineskola@hachyderm.io
                                    wrote sidst redigeret af
                                    #94

                                    @RoBo2 Yes: probability. The sentence is a common one, so it’s likely to be reproduced in the output. But the LLM has no conception of whether a cat really did sit on the mat.

                                    You probably could build an LLM so that it showed the probabilities of each token; but it wouldn’t solve the problem being discussed here at all.

                                    1 Reply Last reply
                                    0
                                    • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                                      @pontus_k But the tool has no conception of what is true or false. It can’t ‘check its work’ because it has no way of telling what is better and what is worse. What it would produce is something that has the appearance of a verification process; but it’s no more likely to be true.

                                      pontus_k@mastodon.socialP This user is from outside of this forum
                                      pontus_k@mastodon.socialP This user is from outside of this forum
                                      pontus_k@mastodon.social
                                      wrote sidst redigeret af
                                      #95

                                      @benjamineskola Some of these systems have access to deterministic tools that could give them a better output. For example, all LLMs struggle with counting letters, but in a lot of cases they have the capability to call the unix utility 'wc' to count letters. Putting 'MAKE NO MISTAKES' in the prompt could possibly make it a bit more likely that it does so and gets it right. Don't get me wrong, I think it's absolutely stupid that this is where we are.

                                      benjamineskola@hachyderm.ioB 1 Reply Last reply
                                      0
                                      • benjamineskola@hachyderm.ioB benjamineskola@hachyderm.io

                                        LLM advocates still don’t seem to be able to comprehend that ordering the machine not to ‘make stuff up’ doesn’t help. It doesn’t know when it’s making stuff up, and it couldn’t change that even if you told it to. (In fact it’s always just making stuff up, and is only ever true by chance.)

                                        Part of why I’m so negative about them is that their advocates simply do not understand how they work and do not seem to want to.

                                        https://mas.to/@carnage4life/116653425489923041

                                        juliancalaby@social.treehouse.systemsJ This user is from outside of this forum
                                        juliancalaby@social.treehouse.systemsJ This user is from outside of this forum
                                        juliancalaby@social.treehouse.systems
                                        wrote sidst redigeret af
                                        #96

                                        @benjamineskola Colleague who is adding "AGENTS.md" files to our repositories is adding very similar paragraphs to those files.

                                        Ugh.

                                        benjamineskola@hachyderm.ioB 1 Reply Last reply
                                        0
                                        • pontus_k@mastodon.socialP pontus_k@mastodon.social

                                          @benjamineskola Some of these systems have access to deterministic tools that could give them a better output. For example, all LLMs struggle with counting letters, but in a lot of cases they have the capability to call the unix utility 'wc' to count letters. Putting 'MAKE NO MISTAKES' in the prompt could possibly make it a bit more likely that it does so and gets it right. Don't get me wrong, I think it's absolutely stupid that this is where we are.

                                          benjamineskola@hachyderm.ioB This user is from outside of this forum
                                          benjamineskola@hachyderm.ioB This user is from outside of this forum
                                          benjamineskola@hachyderm.io
                                          wrote sidst redigeret af
                                          #97

                                          @pontus_k you don’t need to hunt for ways to make this make sense.

                                          1 Reply Last reply
                                          0
                                          Svar
                                          • Svar som emne
                                          Login for at svare
                                          • Ældste til nyeste
                                          • Nyeste til ældste
                                          • Most Votes


                                          • Log ind

                                          • Har du ikke en konto? Tilmeld

                                          • Login or register to search.
                                          Powered by NodeBB Contributors
                                          Graciously hosted by data.coop
                                          • First post
                                            Last post
                                          0
                                          • Hjem
                                          • Seneste
                                          • Etiketter
                                          • Populære
                                          • Verden
                                          • Bruger
                                          • Grupper