Skip to content
  • Hjem
  • Seneste
  • Etiketter
  • Populære
  • Verden
  • Bruger
  • Grupper
Temaer
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Kollaps
FARVEL BIG TECH
  1. Forside
  2. Ikke-kategoriseret
  3. all the criticism has been said, all the takes been had.

all the criticism has been said, all the takes been had.

Planlagt Fastgjort Låst Flyttet Ikke-kategoriseret
110 Indlæg 60 Posters 384 Visninger
  • Ældste til nyeste
  • Nyeste til ældste
  • Most Votes
Svar
  • Svar som emne
Login for at svare
Denne tråd er blevet slettet. Kun brugere med emne behandlings privilegier kan se den.
  • kalenxi@mastodon.socialK kalenxi@mastodon.social

    @ainmosni
    I also believe we're currently in an unsustainable bubble where the true cost of all of this inference is subsidized by people who have been allowed to accrue too much capital. And I feel like in fairly short order people are going to have to start reckoning with whether all this is worth it when venture capital isn't covering 90% of the real cost for them.

    kalenxi@mastodon.socialK This user is from outside of this forum
    kalenxi@mastodon.socialK This user is from outside of this forum
    kalenxi@mastodon.social
    wrote sidst redigeret af
    #28

    @ainmosni I'll also admit that I find it very difficult to truly understand what people who have an emotional relationship with AI are going through. Because my reaction to a software program feigning any kind of emotional understanding or empathy is to be extremely repulsed.

    kalenxi@mastodon.socialK 1 Reply Last reply
    0
    • kalenxi@mastodon.socialK kalenxi@mastodon.social

      @ainmosni I'll also admit that I find it very difficult to truly understand what people who have an emotional relationship with AI are going through. Because my reaction to a software program feigning any kind of emotional understanding or empathy is to be extremely repulsed.

      kalenxi@mastodon.socialK This user is from outside of this forum
      kalenxi@mastodon.socialK This user is from outside of this forum
      kalenxi@mastodon.social
      wrote sidst redigeret af
      #29

      @ainmosni And I do worry that the people who work for the AI companies seem to be imo somewhat delusional. Like asking a model what it wants to do in "retirement" and giving it a blog to express itself on rather than just turning it off. When they more than anyone should know it's just an algorithm and is therefore incapable of having feelings or opinions.

      1 Reply Last reply
      0
      • jonny@neuromatch.socialJ jonny@neuromatch.social

        RE: https://hails.org/@hailey/116657391001259044

        all the criticism has been said, all the takes been had. the only metaphor i have been finding consistently useful for understanding what is happening with people and "AI" is addiction, and specifically gambling addiction.

        fnrd@toots.nuF This user is from outside of this forum
        fnrd@toots.nuF This user is from outside of this forum
        fnrd@toots.nu
        wrote sidst redigeret af
        #30

        @jonny As well... there are a few people I was amazed to talk to, especially science fiction loving guys higher up at tech companies (who are like, everywhere) which have an almost animistic belief that GAI currently exists. Got called small-minded, etc, which is fine. Sources and research didn't phase them.

        They sincerely believe in it, and you know you can't replace a belief with data and information. My anthropology courses have never been better applied than recently with the cultish behaviour around genAI. I'm interested in how it feels for people who get that gambling rush about genAI, and how it ties into beliefs, could it maybe strengthen the affect?

        1 Reply Last reply
        0
        • jonny@neuromatch.socialJ jonny@neuromatch.social

          RE: https://hails.org/@hailey/116657391001259044

          all the criticism has been said, all the takes been had. the only metaphor i have been finding consistently useful for understanding what is happening with people and "AI" is addiction, and specifically gambling addiction.

          shafik@hachyderm.ioS This user is from outside of this forum
          shafik@hachyderm.ioS This user is from outside of this forum
          shafik@hachyderm.io
          wrote sidst redigeret af
          #31

          @jonny

          mandatory

          "LLMs are slot-machines": https://doctorow.medium.com/https-pluralistic-net-2025-08-16-jackpot-salience-bias-2a696501bba7

          and

          https://pony.social/@cadey/115089518168821130

          1 Reply Last reply
          0
          • jonny@neuromatch.socialJ jonny@neuromatch.social

            RE: https://hails.org/@hailey/116657391001259044

            all the criticism has been said, all the takes been had. the only metaphor i have been finding consistently useful for understanding what is happening with people and "AI" is addiction, and specifically gambling addiction.

            louka@mementomori.socialL This user is from outside of this forum
            louka@mementomori.socialL This user is from outside of this forum
            louka@mementomori.social
            wrote sidst redigeret af
            #32

            @jonny I believe addiction treatment centres are seeing the same as you. Is that peer reviewed hard research or them capitalizing on what people feel like they need treatment for? I don't know.
            https://www.naadac.org/treating-internet-addiction-ai-pornography-social-media-online-gambling-gaming

            1 Reply Last reply
            0
            • jonny@neuromatch.socialJ jonny@neuromatch.social

              RE: https://hails.org/@hailey/116657391001259044

              all the criticism has been said, all the takes been had. the only metaphor i have been finding consistently useful for understanding what is happening with people and "AI" is addiction, and specifically gambling addiction.

              option8@oldbytes.spaceO This user is from outside of this forum
              option8@oldbytes.spaceO This user is from outside of this forum
              option8@oldbytes.space
              wrote sidst redigeret af
              #33

              @jonny Just one more commit. This time will be different.

              1 Reply Last reply
              0
              • prema@hachyderm.ioP prema@hachyderm.io

                @jonny exactly, when i talk to people who maybe just use ai a little bit, their reaponses quite align with things you’d hear from drug addict, some excuses, avoidance, bit of shame.
                And i agree that the patern is finally similar to gaming/gambling addiction

                rhelune@todon.euR This user is from outside of this forum
                rhelune@todon.euR This user is from outside of this forum
                rhelune@todon.eu
                wrote sidst redigeret af
                #34

                @prema @jonny "I can quit anytime, I just choose not to."
                "I am only a social slopper. I never consume slop when alone at home."
                "I only use chatbots with my morning coffee. Coffee doesn't taste good without a chatbot."
                "Without a chatbot I wouldn't know what to do with my hands."
                "I would have quit, but I do not want to get fat."
                "All my colleagues use slop, I also need to use slop for the sake of networking."
                "If I quit nobody is going to invite me out anymore because I won't be fun to be around."

                1 Reply Last reply
                0
                • jonny@neuromatch.socialJ jonny@neuromatch.social

                  @ainmosni
                  You have to feel the rush to understand it

                  glassresistor@sfba.socialG This user is from outside of this forum
                  glassresistor@sfba.socialG This user is from outside of this forum
                  glassresistor@sfba.social
                  wrote sidst redigeret af
                  #35

                  @jonny @ainmosni exactly. winning early and being on the dopamine chasing side of nuerospicy helps kick start it.

                  i tried AI a bit and omg when its works SHOT TO THE VIEN. it takes the things you hate feeling and makes it gone but moving through bad feelings is like exercise you cant keep sick gains without daily effort

                  1 Reply Last reply
                  0
                  • ainmosni@social.ainmosni.euA ainmosni@social.ainmosni.eu

                    @jonny Yeah, I never got that rush, I think my brain is just too stuck in "cynical arsehole" mode for that to work on me.

                    datarama@hachyderm.ioD This user is from outside of this forum
                    datarama@hachyderm.ioD This user is from outside of this forum
                    datarama@hachyderm.io
                    wrote sidst redigeret af
                    #36

                    @ainmosni @jonny I don't get the rush, but I *do* get sucked into loops I can't easily break out of (I have OCD). I get sucked into "completionist mode" in games, which means that if I play a game where there are collectibles you can buy with actual money, I'm quite likely to make very stupid decisions.

                    I've learned my lesson, I avoid such games scrupulously, and I steer clear of gambling.

                    And I am very distraught that my workplace now wants me to use AI.

                    1 Reply Last reply
                    0
                    • jonny@neuromatch.socialJ jonny@neuromatch.social

                      i love gambling. i have used "AI" extensively. it feels the same.

                      tempusfelix@wehavecookies.socialT This user is from outside of this forum
                      tempusfelix@wehavecookies.socialT This user is from outside of this forum
                      tempusfelix@wehavecookies.social
                      wrote sidst redigeret af
                      #37

                      @jonny

                      Agreed. I don’t gamble because I know I’m vulnerable. And the use of Ai really is the just one more sensation. It’s too damned easy to get a plausible output. And it’s hard to stop.

                      1 Reply Last reply
                      0
                      • concretedog@mastodon.socialC concretedog@mastodon.social

                        @jonny @ainmosni gosh this convo hits... I am not a gambler and am pretty anti LLM ... really interesting.

                        catch56@kolektiva.socialC This user is from outside of this forum
                        catch56@kolektiva.socialC This user is from outside of this forum
                        catch56@kolektiva.social
                        wrote sidst redigeret af
                        #38

                        @concretedog @jonny @ainmosni also don't gamble but I think I am closer to the avoidance due to not wanting to fall into a hole than the complete lack of interest, I internally predict election results and other stuff just don't put actual money on it.

                        Don't smoke because I know it would be hard to stop and rarely play computer games for the same reason. Don't use LLMs and a big part of that is seeing other people get sucked in.

                        1 Reply Last reply
                        0
                        • jonny@neuromatch.socialJ jonny@neuromatch.social

                          RE: https://hails.org/@hailey/116657391001259044

                          all the criticism has been said, all the takes been had. the only metaphor i have been finding consistently useful for understanding what is happening with people and "AI" is addiction, and specifically gambling addiction.

                          glassresistor@sfba.socialG This user is from outside of this forum
                          glassresistor@sfba.socialG This user is from outside of this forum
                          glassresistor@sfba.social
                          wrote sidst redigeret af
                          #39

                          @jonny so this thread could be a case study

                          https://social.losno.co/@chris/116655930139554496

                          jonny@neuromatch.socialJ 1 Reply Last reply
                          0
                          • glassresistor@sfba.socialG glassresistor@sfba.social

                            @jonny so this thread could be a case study

                            https://social.losno.co/@chris/116655930139554496

                            jonny@neuromatch.socialJ This user is from outside of this forum
                            jonny@neuromatch.socialJ This user is from outside of this forum
                            jonny@neuromatch.social
                            wrote sidst redigeret af
                            #40

                            @glassresistor
                            Yeah I saw that and got sad

                            1 Reply Last reply
                            0
                            • jonny@neuromatch.socialJ jonny@neuromatch.social

                              i love gambling. i have used "AI" extensively. it feels the same.

                              jonny@neuromatch.socialJ This user is from outside of this forum
                              jonny@neuromatch.socialJ This user is from outside of this forum
                              jonny@neuromatch.social
                              wrote sidst redigeret af
                              #41

                              So, look. One shot rewriting the whole test suite in another language is probably not great to do, but what happened here is so much worse than you are expecting.

                              https://github.com/RsyncProject/rsync/pull/903/

                              This does not "translate tests into pytest" or a unit testing framework, it writes its own testing framework where tests are whole python scripts that redefine basic test functions in every script. Surely there would be a single way to "run rsync and get the results" - nope, well, there is, but then every test file will randomly redefine its own _run_and_capture function. So like now rsync needs a test suite for its test suite.

                              If instead of telling an LLM to "rewrite the tests in python" you just searched "python testing" you would find the pytest docs. And then you would find examples. And then you could write fixtures to deduplicate all the prior shell script setup and teardown stuff, and so on. But since it was just "rewrite the tests in python" its now worse than before, and the odds of the rewrite actually being a 100% faithful translation are close to 0.

                              arrjay@tacobelllabs.netA ricci@discuss.systemsR jonny@neuromatch.socialJ aud@fire.asta.lgbtA eliocamp@mastodon.socialE 6 Replies Last reply
                              0
                              • jonny@neuromatch.socialJ jonny@neuromatch.social

                                So, look. One shot rewriting the whole test suite in another language is probably not great to do, but what happened here is so much worse than you are expecting.

                                https://github.com/RsyncProject/rsync/pull/903/

                                This does not "translate tests into pytest" or a unit testing framework, it writes its own testing framework where tests are whole python scripts that redefine basic test functions in every script. Surely there would be a single way to "run rsync and get the results" - nope, well, there is, but then every test file will randomly redefine its own _run_and_capture function. So like now rsync needs a test suite for its test suite.

                                If instead of telling an LLM to "rewrite the tests in python" you just searched "python testing" you would find the pytest docs. And then you would find examples. And then you could write fixtures to deduplicate all the prior shell script setup and teardown stuff, and so on. But since it was just "rewrite the tests in python" its now worse than before, and the odds of the rewrite actually being a 100% faithful translation are close to 0.

                                arrjay@tacobelllabs.netA This user is from outside of this forum
                                arrjay@tacobelllabs.netA This user is from outside of this forum
                                arrjay@tacobelllabs.net
                                wrote sidst redigeret af
                                #42

                                @jonny *screaming intensifies*

                                1 Reply Last reply
                                0
                                • jonny@neuromatch.socialJ jonny@neuromatch.social

                                  So, look. One shot rewriting the whole test suite in another language is probably not great to do, but what happened here is so much worse than you are expecting.

                                  https://github.com/RsyncProject/rsync/pull/903/

                                  This does not "translate tests into pytest" or a unit testing framework, it writes its own testing framework where tests are whole python scripts that redefine basic test functions in every script. Surely there would be a single way to "run rsync and get the results" - nope, well, there is, but then every test file will randomly redefine its own _run_and_capture function. So like now rsync needs a test suite for its test suite.

                                  If instead of telling an LLM to "rewrite the tests in python" you just searched "python testing" you would find the pytest docs. And then you would find examples. And then you could write fixtures to deduplicate all the prior shell script setup and teardown stuff, and so on. But since it was just "rewrite the tests in python" its now worse than before, and the odds of the rewrite actually being a 100% faithful translation are close to 0.

                                  ricci@discuss.systemsR This user is from outside of this forum
                                  ricci@discuss.systemsR This user is from outside of this forum
                                  ricci@discuss.systems
                                  wrote sidst redigeret af
                                  #43

                                  @jonny 🎰

                                  1 Reply Last reply
                                  0
                                  • jonny@neuromatch.socialJ jonny@neuromatch.social

                                    So, look. One shot rewriting the whole test suite in another language is probably not great to do, but what happened here is so much worse than you are expecting.

                                    https://github.com/RsyncProject/rsync/pull/903/

                                    This does not "translate tests into pytest" or a unit testing framework, it writes its own testing framework where tests are whole python scripts that redefine basic test functions in every script. Surely there would be a single way to "run rsync and get the results" - nope, well, there is, but then every test file will randomly redefine its own _run_and_capture function. So like now rsync needs a test suite for its test suite.

                                    If instead of telling an LLM to "rewrite the tests in python" you just searched "python testing" you would find the pytest docs. And then you would find examples. And then you could write fixtures to deduplicate all the prior shell script setup and teardown stuff, and so on. But since it was just "rewrite the tests in python" its now worse than before, and the odds of the rewrite actually being a 100% faithful translation are close to 0.

                                    jonny@neuromatch.socialJ This user is from outside of this forum
                                    jonny@neuromatch.socialJ This user is from outside of this forum
                                    jonny@neuromatch.social
                                    wrote sidst redigeret af
                                    #44

                                    I think the modal situation here is that the people are reading none or very little of what is being generated by the LLM, so the tests have a special role: Tests function as the pull arm on the slot machine, you just generate until tests pass, and that's a jackpot. Obviously that's meaningless when the tests are meaningless, so tests take on a very different meaning and role in slot machine coding.

                                    Previously we would write careful test conditions that were based off some real problem or an understanding of what the code under test did, and had a specific thing they were intended to protect against. Tests move slow and are designed to protect us against the things we know can go wrong. When we learn of a new wrong thing, we add a test.

                                    LLM tests have the form of tests but don't do the same thing. They often test nothing, and are just expressions of truisms that the probabilistic text space explored while generating. They have strongly worded names but end up actually asserting that basic language features work as expected. Because it is not us writing tests for ourselves, where we only harm ourselves by making them weak, they function instead as a passively obfuscated justification for the code that the LLM generates. The user wants the tests to pass. The LLM provides.

                                    The tests are theater: they are the play field for the slot machine. They are mild, surmountable, need to fail a few times to be plausible, but must eventually pass within the expected generation loop window to deliver the payout.

                                    peterrenshaw@ioc.exchangeP jonny@neuromatch.socialJ jens@social.finkhaeuser.deJ dahukanna@mastodon.socialD synlogic4242@social.vivaldi.netS 5 Replies Last reply
                                    0
                                    • jonny@neuromatch.socialJ jonny@neuromatch.social

                                      So, look. One shot rewriting the whole test suite in another language is probably not great to do, but what happened here is so much worse than you are expecting.

                                      https://github.com/RsyncProject/rsync/pull/903/

                                      This does not "translate tests into pytest" or a unit testing framework, it writes its own testing framework where tests are whole python scripts that redefine basic test functions in every script. Surely there would be a single way to "run rsync and get the results" - nope, well, there is, but then every test file will randomly redefine its own _run_and_capture function. So like now rsync needs a test suite for its test suite.

                                      If instead of telling an LLM to "rewrite the tests in python" you just searched "python testing" you would find the pytest docs. And then you would find examples. And then you could write fixtures to deduplicate all the prior shell script setup and teardown stuff, and so on. But since it was just "rewrite the tests in python" its now worse than before, and the odds of the rewrite actually being a 100% faithful translation are close to 0.

                                      aud@fire.asta.lgbtA This user is from outside of this forum
                                      aud@fire.asta.lgbtA This user is from outside of this forum
                                      aud@fire.asta.lgbt
                                      wrote sidst redigeret af
                                      #45

                                      @jonny@neuromatch.social oh holy FUCK

                                      it is so bad

                                      1 Reply Last reply
                                      0
                                      • jonny@neuromatch.socialJ jonny@neuromatch.social

                                        I think the modal situation here is that the people are reading none or very little of what is being generated by the LLM, so the tests have a special role: Tests function as the pull arm on the slot machine, you just generate until tests pass, and that's a jackpot. Obviously that's meaningless when the tests are meaningless, so tests take on a very different meaning and role in slot machine coding.

                                        Previously we would write careful test conditions that were based off some real problem or an understanding of what the code under test did, and had a specific thing they were intended to protect against. Tests move slow and are designed to protect us against the things we know can go wrong. When we learn of a new wrong thing, we add a test.

                                        LLM tests have the form of tests but don't do the same thing. They often test nothing, and are just expressions of truisms that the probabilistic text space explored while generating. They have strongly worded names but end up actually asserting that basic language features work as expected. Because it is not us writing tests for ourselves, where we only harm ourselves by making them weak, they function instead as a passively obfuscated justification for the code that the LLM generates. The user wants the tests to pass. The LLM provides.

                                        The tests are theater: they are the play field for the slot machine. They are mild, surmountable, need to fail a few times to be plausible, but must eventually pass within the expected generation loop window to deliver the payout.

                                        peterrenshaw@ioc.exchangeP This user is from outside of this forum
                                        peterrenshaw@ioc.exchangeP This user is from outside of this forum
                                        peterrenshaw@ioc.exchange
                                        wrote sidst redigeret af
                                        #46

                                        @jonny “tests have a special role” ☺️

                                        1 Reply Last reply
                                        0
                                        • jonny@neuromatch.socialJ jonny@neuromatch.social

                                          I think the modal situation here is that the people are reading none or very little of what is being generated by the LLM, so the tests have a special role: Tests function as the pull arm on the slot machine, you just generate until tests pass, and that's a jackpot. Obviously that's meaningless when the tests are meaningless, so tests take on a very different meaning and role in slot machine coding.

                                          Previously we would write careful test conditions that were based off some real problem or an understanding of what the code under test did, and had a specific thing they were intended to protect against. Tests move slow and are designed to protect us against the things we know can go wrong. When we learn of a new wrong thing, we add a test.

                                          LLM tests have the form of tests but don't do the same thing. They often test nothing, and are just expressions of truisms that the probabilistic text space explored while generating. They have strongly worded names but end up actually asserting that basic language features work as expected. Because it is not us writing tests for ourselves, where we only harm ourselves by making them weak, they function instead as a passively obfuscated justification for the code that the LLM generates. The user wants the tests to pass. The LLM provides.

                                          The tests are theater: they are the play field for the slot machine. They are mild, surmountable, need to fail a few times to be plausible, but must eventually pass within the expected generation loop window to deliver the payout.

                                          jonny@neuromatch.socialJ This user is from outside of this forum
                                          jonny@neuromatch.socialJ This user is from outside of this forum
                                          jonny@neuromatch.social
                                          wrote sidst redigeret af
                                          #47

                                          Here's an example from some code that was thrust at me this week. The rest of the tests try a bit harder to look like tests, but this one is perplexing.

                                          What does it test? The function name suggests its a smoke test. LLMs love to call things smoke tests. That would suggest this would be an early-run test that fails loudly if some basic precondition - like having ffmpeg - fails. Or, I guess we are smoke testing the ensure_ffmpeg function? Anyway who knows. However we first check if ffmpeg or ffprobe are present, which is exactly what ensure_ffmpeg does. If they aren't present, a warning tells us that ffmpeg/ffprobe are required for the video tests, which makes it seem like this should be a parameterizing test that controls which tests are run, which of course it does not do.

                                          So the test literally does nothing and cannot possibly fail, but says it does at least two things, because to an LLM something saying it does something is the same thing as it actually doing that thing.

                                          bms48@mastodon.socialB jonny@neuromatch.socialJ bstacey@icosahedron.websiteB gunchleoc@mastodon.scotG henryk@chaos.socialH 7 Replies Last reply
                                          0
                                          Svar
                                          • Svar som emne
                                          Login for at svare
                                          • Ældste til nyeste
                                          • Nyeste til ældste
                                          • Most Votes


                                          • Log ind

                                          • Har du ikke en konto? Tilmeld

                                          • Login or register to search.
                                          Powered by NodeBB Contributors
                                          Graciously hosted by data.coop
                                          • First post
                                            Last post
                                          0
                                          • Hjem
                                          • Seneste
                                          • Etiketter
                                          • Populære
                                          • Verden
                                          • Bruger
                                          • Grupper