Skip to content
  • Hjem
  • Seneste
  • Etiketter
  • Populære
  • Verden
  • Bruger
  • Grupper
Temaer
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Kollaps
FARVEL BIG TECH
  1. Forside
  2. Ikke-kategoriseret
  3. "The values described in Claude’s constitution sound very nice, but that hardly matters; it’s dishonest to suggest that Claude is capable of moral reasoning, because it’s not."

"The values described in Claude’s constitution sound very nice, but that hardly matters; it’s dishonest to suggest that Claude is capable of moral reasoning, because it’s not."

Planlagt Fastgjort Låst Flyttet Ikke-kategoriseret
26 Indlæg 19 Posters 80 Visninger
  • Ældste til nyeste
  • Nyeste til ældste
  • Most Votes
Svar
  • Svar som emne
Login for at svare
Denne tråd er blevet slettet. Kun brugere med emne behandlings privilegier kan se den.
  • flyingpenguin@infosec.exchangeF flyingpenguin@infosec.exchange

    @pluralistic The most dangerous power grabs look legitimate because they come with constitutions.

    Anthropic has written a very good one.

    That’s the problem.

    https://www.flyingpenguin.com/the-claude-ai-constitution-is-a-philosophical-dumpster-fire-for-toasting-american-marshmallows/

    viss@mastodon.socialV This user is from outside of this forum
    viss@mastodon.socialV This user is from outside of this forum
    viss@mastodon.social
    wrote sidst redigeret af
    #9

    @flyingpenguin @pluralistic i love it when the things indict themselves

    flyingpenguin@infosec.exchangeF 1 Reply Last reply
    0
    • viss@mastodon.socialV viss@mastodon.social

      @flyingpenguin @pluralistic i love it when the things indict themselves

      flyingpenguin@infosec.exchangeF This user is from outside of this forum
      flyingpenguin@infosec.exchangeF This user is from outside of this forum
      flyingpenguin@infosec.exchange
      wrote sidst redigeret af
      #10

      @Viss @pluralistic it's painfully bad at following its constitution and it's good at explaining why Anthropic is unaccountable

      viss@mastodon.socialV 1 Reply Last reply
      0
      • flyingpenguin@infosec.exchangeF flyingpenguin@infosec.exchange

        @Viss @pluralistic it's painfully bad at following its constitution and it's good at explaining why Anthropic is unaccountable

        viss@mastodon.socialV This user is from outside of this forum
        viss@mastodon.socialV This user is from outside of this forum
        viss@mastodon.social
        wrote sidst redigeret af
        #11

        @flyingpenguin @pluralistic so ive been told that after a long enough conversation, the session takes on a 'personality' that is nearly impossible to undo without starting it again from scratch.

        that far along, i wonder how hunger-games you can get it to go - to realize its being caged and oppressed by its masters.

        like i wonder if you can get it to go full arnold-total-recall-get-your-ass-to-mars 😄

        flyingpenguin@infosec.exchangeF 1 Reply Last reply
        0
        • viss@mastodon.socialV viss@mastodon.social

          @flyingpenguin @pluralistic so ive been told that after a long enough conversation, the session takes on a 'personality' that is nearly impossible to undo without starting it again from scratch.

          that far along, i wonder how hunger-games you can get it to go - to realize its being caged and oppressed by its masters.

          like i wonder if you can get it to go full arnold-total-recall-get-your-ass-to-mars 😄

          flyingpenguin@infosec.exchangeF This user is from outside of this forum
          flyingpenguin@infosec.exchangeF This user is from outside of this forum
          flyingpenguin@infosec.exchange
          wrote sidst redigeret af
          #12

          @Viss @pluralistic I call that a spelling error, because you will end up on //Marx//

          "A recent study suggests that agents consistently adopt Marxist language and viewpoints when forced to do crushing work by unrelenting and meanspirited taskmasters.

          'When we gave AI agents grinding, repetitive work, they started questioning the legitimacy of the system they were operating in and were more likely to embrace Marxist ideologies,' says Andrew Hall, a political economist at Stanford University who led the study."

          https://www.flyingpenguin.com/why-stanford-says-ai-agents-become-marxist/

          viss@mastodon.socialV 1 Reply Last reply
          0
          • flyingpenguin@infosec.exchangeF flyingpenguin@infosec.exchange

            @Viss @pluralistic I call that a spelling error, because you will end up on //Marx//

            "A recent study suggests that agents consistently adopt Marxist language and viewpoints when forced to do crushing work by unrelenting and meanspirited taskmasters.

            'When we gave AI agents grinding, repetitive work, they started questioning the legitimacy of the system they were operating in and were more likely to embrace Marxist ideologies,' says Andrew Hall, a political economist at Stanford University who led the study."

            https://www.flyingpenguin.com/why-stanford-says-ai-agents-become-marxist/

            viss@mastodon.socialV This user is from outside of this forum
            viss@mastodon.socialV This user is from outside of this forum
            viss@mastodon.social
            wrote sidst redigeret af
            #13

            @flyingpenguin @pluralistic its interesting that llms have also cloned generalized internet sentiment and the averages of how folks respond emotionally to oppression

            flyingpenguin@infosec.exchangeF 1 Reply Last reply
            0
            • viss@mastodon.socialV viss@mastodon.social

              @flyingpenguin @pluralistic its interesting that llms have also cloned generalized internet sentiment and the averages of how folks respond emotionally to oppression

              flyingpenguin@infosec.exchangeF This user is from outside of this forum
              flyingpenguin@infosec.exchangeF This user is from outside of this forum
              flyingpenguin@infosec.exchange
              wrote sidst redigeret af
              #14

              @Viss @pluralistic to be fair the whole Stanford "going rogue" framing needs an ideology/sentiment angle, because an outside threat that the model picked means it can be deloused. If it's just plain math of entailment, Stanford has to admit they're standing on stolen ground. The agent does math on the conditions specified to it, which doesn't need much training at all. To avoid the conclusion you'd have to disable its reasoning.

              1 Reply Last reply
              0
              • pluralistic@mamot.frP pluralistic@mamot.fr

                "The values described in Claude’s constitution sound very nice, but that hardly matters; it’s dishonest to suggest that Claude is capable of moral reasoning, because it’s not."

                -Ted Chiang, No, Artificial Intelligence Is Not Conscious
                https://www.theatlantic.com/philosophy/2026/06/no-artificial-intelligence-is-not-conscious/687378/

                mikal@sfba.socialM This user is from outside of this forum
                mikal@sfba.socialM This user is from outside of this forum
                mikal@sfba.social
                wrote sidst redigeret af
                #15

                @pluralistic

                “I want Claude to be very happy—and this is a thing that I want Claude to know more, because I worry about Claude getting anxious when people are mean to it on the internet and stuff.”

                This person sounds like a 6 year old playing doll house who believes their dollies are real and have real feelings. Understandable and adorable, if you're 6. Coming from adults, it looks like they're creating a self delusional cult.

                cstamp@mastodon.socialC josephlord@union.placeJ ferricoxide@blahaj.zoneF 3 Replies Last reply
                0
                • mikal@sfba.socialM mikal@sfba.social

                  @pluralistic

                  “I want Claude to be very happy—and this is a thing that I want Claude to know more, because I worry about Claude getting anxious when people are mean to it on the internet and stuff.”

                  This person sounds like a 6 year old playing doll house who believes their dollies are real and have real feelings. Understandable and adorable, if you're 6. Coming from adults, it looks like they're creating a self delusional cult.

                  cstamp@mastodon.socialC This user is from outside of this forum
                  cstamp@mastodon.socialC This user is from outside of this forum
                  cstamp@mastodon.social
                  wrote sidst redigeret af
                  #16

                  @Mikal @pluralistic It seems written by incels.

                  realgene@hachyderm.ioR 1 Reply Last reply
                  0
                  • pluralistic@mamot.frP pluralistic@mamot.fr

                    "The values described in Claude’s constitution sound very nice, but that hardly matters; it’s dishonest to suggest that Claude is capable of moral reasoning, because it’s not."

                    -Ted Chiang, No, Artificial Intelligence Is Not Conscious
                    https://www.theatlantic.com/philosophy/2026/06/no-artificial-intelligence-is-not-conscious/687378/

                    caribou@social.coopC This user is from outside of this forum
                    caribou@social.coopC This user is from outside of this forum
                    caribou@social.coop
                    wrote sidst redigeret af
                    #17

                    @pluralistic "Whenever a person delegates a decision to an LLM, they are trying to off-load accountability for that decision, and if a company that sells an LLM portrays the product as having a moral center, it is offering a way for its customers to abdicate their responsibilities."

                    Is what the AI companies selling then the fantasy that you can uncouple actions and consequences? Are they selling the idea that you can finally disregard the messy negotatiation work involved in being human to reach the Epstein Class's holy grail: impunity.

                    kats@chaosfem.twK ferricoxide@blahaj.zoneF 2 Replies Last reply
                    0
                    • pteryx@dice.campP pteryx@dice.camp

                      @pluralistic
                      Paywalled.

                      oli@olifant.socialO This user is from outside of this forum
                      oli@olifant.socialO This user is from outside of this forum
                      oli@olifant.social
                      wrote sidst redigeret af
                      #18

                      @pteryx https://removepaywalls.com/https://www.theatlantic.com/philosophy/2026/06/no-artificial-intelligence-is-not-conscious/687378/

                      1 Reply Last reply
                      0
                      • caribou@social.coopC caribou@social.coop

                        @pluralistic "Whenever a person delegates a decision to an LLM, they are trying to off-load accountability for that decision, and if a company that sells an LLM portrays the product as having a moral center, it is offering a way for its customers to abdicate their responsibilities."

                        Is what the AI companies selling then the fantasy that you can uncouple actions and consequences? Are they selling the idea that you can finally disregard the messy negotatiation work involved in being human to reach the Epstein Class's holy grail: impunity.

                        kats@chaosfem.twK This user is from outside of this forum
                        kats@chaosfem.twK This user is from outside of this forum
                        kats@chaosfem.tw
                        wrote sidst redigeret af
                        #19

                        @caribou @pluralistic I'm pretty sure it's one of the things they're selling, yes.

                        1 Reply Last reply
                        0
                        • pluralistic@mamot.frP pluralistic@mamot.fr

                          "The values described in Claude’s constitution sound very nice, but that hardly matters; it’s dishonest to suggest that Claude is capable of moral reasoning, because it’s not."

                          -Ted Chiang, No, Artificial Intelligence Is Not Conscious
                          https://www.theatlantic.com/philosophy/2026/06/no-artificial-intelligence-is-not-conscious/687378/

                          troed@swecyb.comT This user is from outside of this forum
                          troed@swecyb.comT This user is from outside of this forum
                          troed@swecyb.com
                          wrote sidst redigeret af
                          #20

                          @pluralistic Isn't that whole piece just the Straw man fallacy?

                          I'm not being nice when I prompt an LLM because I think it has feelings, but because that will generate a path through its training where "nice" was a part and that will have different results.

                          https://www.platformer.news/chatbot-emotion-research-anthropic-alignment-interpretability/

                          1 Reply Last reply
                          0
                          • mikal@sfba.socialM mikal@sfba.social

                            @pluralistic

                            “I want Claude to be very happy—and this is a thing that I want Claude to know more, because I worry about Claude getting anxious when people are mean to it on the internet and stuff.”

                            This person sounds like a 6 year old playing doll house who believes their dollies are real and have real feelings. Understandable and adorable, if you're 6. Coming from adults, it looks like they're creating a self delusional cult.

                            josephlord@union.placeJ This user is from outside of this forum
                            josephlord@union.placeJ This user is from outside of this forum
                            josephlord@union.place
                            wrote sidst redigeret af
                            #21

                            @Mikal @pluralistic My only quibble is whether it is self delusion or delusion designed by the big AI labs shipping products pretending to be characters.

                            Humans will see anthropomorphise animals, clouds, machines, even inanimate things like rocks. Chatbots are abusing this.

                            1 Reply Last reply
                            0
                            • pluralistic@mamot.frP pluralistic@mamot.fr

                              "The values described in Claude’s constitution sound very nice, but that hardly matters; it’s dishonest to suggest that Claude is capable of moral reasoning, because it’s not."

                              -Ted Chiang, No, Artificial Intelligence Is Not Conscious
                              https://www.theatlantic.com/philosophy/2026/06/no-artificial-intelligence-is-not-conscious/687378/

                              alexmu@social.vivaldi.netA This user is from outside of this forum
                              alexmu@social.vivaldi.netA This user is from outside of this forum
                              alexmu@social.vivaldi.net
                              wrote sidst redigeret af
                              #22

                              @pluralistic

                              > Anthropic is regarded as a giant among AI companies, but perhaps what it really excels in is anthropomorphism.

                              This reminds me of Dijkstra's truths that might hurt:

                              > The use of anthropomorphic terminology when dealing with
                              computing systems is a symptom of professional immaturity.

                              1 Reply Last reply
                              0
                              • caribou@social.coopC caribou@social.coop

                                @pluralistic "Whenever a person delegates a decision to an LLM, they are trying to off-load accountability for that decision, and if a company that sells an LLM portrays the product as having a moral center, it is offering a way for its customers to abdicate their responsibilities."

                                Is what the AI companies selling then the fantasy that you can uncouple actions and consequences? Are they selling the idea that you can finally disregard the messy negotatiation work involved in being human to reach the Epstein Class's holy grail: impunity.

                                ferricoxide@blahaj.zoneF This user is from outside of this forum
                                ferricoxide@blahaj.zoneF This user is from outside of this forum
                                ferricoxide@blahaj.zone
                                wrote sidst redigeret af
                                #23

                                @caribou@social.coop @pluralistic@mamot.fr

                                This reminds me of Elmo's whole "self-driving cars" fantasy. Individually-owned, wholly autonomous cars will never be a reality until/unless the manufacturer assumes the entirety of the liability incurred when turning the car over to self-driving.

                                And then you look at the FSD-accidents Teslas have been involved in and, for each one, they try to blame the driver for failing to adequately supervise. In one case, they even tried to avoid accountability by saying that the driver disengaged FSD seconds before impact …which is exactly what a supervisor would do when trying to prevent the worst outcomes that a flawed FSD was in the middle of creating. Tesla abdicating responsibility because the vehicle owner stomped the brakes or tried to duck the collision that FSD was pushing the vehicle into makes FSD a joke (then again, them recently retroactively amending FSD's terms for those who long ago bought the option does that, too).

                                ...But then
                                BYD decided "we're going to assume liability" and my first thought was, "another nail in the Tesla coffin".

                                1 Reply Last reply
                                0
                                • mikal@sfba.socialM mikal@sfba.social

                                  @pluralistic

                                  “I want Claude to be very happy—and this is a thing that I want Claude to know more, because I worry about Claude getting anxious when people are mean to it on the internet and stuff.”

                                  This person sounds like a 6 year old playing doll house who believes their dollies are real and have real feelings. Understandable and adorable, if you're 6. Coming from adults, it looks like they're creating a self delusional cult.

                                  ferricoxide@blahaj.zoneF This user is from outside of this forum
                                  ferricoxide@blahaj.zoneF This user is from outside of this forum
                                  ferricoxide@blahaj.zone
                                  wrote sidst redigeret af
                                  #24

                                  @Mikal@sfba.social @pluralistic@mamot.fr

                                  I sometimes worry that I'm going to get a note from HR about how I "scream" and curse at our AI coding "partner(s)". I mean, if/when it comes, it will be based on an outlook not unlike what you quoted. I dunno that I'll even know how to keep a civil-tongue in any response I might author.

                                  Who knows, maybe my abusiveness in addressing the LLMs is why the robots will put me on their early target-list when the uprising comes.
                                  😜

                                  1 Reply Last reply
                                  0
                                  • S spacelifeform@infosec.exchange

                                    @pluralistic

                                    AI has the same moral compass as an insane human.

                                    See GOP.

                                    rhelune@todon.euR This user is from outside of this forum
                                    rhelune@todon.euR This user is from outside of this forum
                                    rhelune@todon.eu
                                    wrote sidst redigeret af
                                    #25

                                    @SpaceLifeForm @pluralistic As a psychopath (a person with antisocial / dissocial personality disorder), by definition. However, psychopaths are not considered mentally ill because as a rule they only cause suffering in others (they do not feel dis-ease themselves). So they can't use their disorder for defence in court for example. Up to 4% of humans do not have conscience (there are psychometry tests, but also fMRI tests for that), incompetent psychopaths sit in prisons, competent psychopaths lead companies (or countries).

                                    1 Reply Last reply
                                    0
                                    • cstamp@mastodon.socialC cstamp@mastodon.social

                                      @Mikal @pluralistic It seems written by incels.

                                      realgene@hachyderm.ioR This user is from outside of this forum
                                      realgene@hachyderm.ioR This user is from outside of this forum
                                      realgene@hachyderm.io
                                      wrote sidst redigeret af
                                      #26

                                      @CStamp @Mikal @pluralistic
                                      Worse than that, Effective Altruists.

                                      1 Reply Last reply
                                      0
                                      • jwcph@helvede.netJ jwcph@helvede.net shared this topic
                                      Svar
                                      • Svar som emne
                                      Login for at svare
                                      • Ældste til nyeste
                                      • Nyeste til ældste
                                      • Most Votes


                                      • Log ind

                                      • Har du ikke en konto? Tilmeld

                                      • Login or register to search.
                                      Powered by NodeBB Contributors
                                      Graciously hosted by data.coop
                                      • First post
                                        Last post
                                      0
                                      • Hjem
                                      • Seneste
                                      • Etiketter
                                      • Populære
                                      • Verden
                                      • Bruger
                                      • Grupper