Skip to content
  • Hjem
  • Seneste
  • Etiketter
  • Populære
  • Verden
  • Bruger
  • Grupper
Temaer
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Kollaps
FARVEL BIG TECH
  1. Forside
  2. Ikke-kategoriseret
  3. We'll see how I feel in the morning, but for now i seem to have convinced myself to actually read that fuckin anthropic paper

We'll see how I feel in the morning, but for now i seem to have convinced myself to actually read that fuckin anthropic paper

Planlagt Fastgjort Låst Flyttet Ikke-kategoriseret
92 Indlæg 29 Posters 13 Visninger
  • Ældste til nyeste
  • Nyeste til ældste
  • Most Votes
Svar
  • Svar som emne
Login for at svare
Denne tråd er blevet slettet. Kun brugere med emne behandlings privilegier kan se den.
  • jenniferplusplus@hachyderm.ioJ jenniferplusplus@hachyderm.io

    > We find that using AI assistance to complete
    tasks that involve this new library resulted in a reduction in the evaluation score by 17% or two grade
    points (Cohen’s d = 0.738, p = 0.010). Meanwhile, we did not find a statistically significant acceleration in
    completion time with AI assistance.

    I mean, that's an enormous effect. I'm very interested in the methods section, now.

    > Through an in-depth qualitative analysis where we watch the screen recordings of every participant in our
    main study, we explain the lack of AI productivity improvement through the additional time some participants
    invested in interacting with the AI assistant.

    ...

    Is this about learning, or is it about productivity!? God.

    > We attribute the gains in skill development of the control group to the process of encountering and subsequently resolving errors independently

    Hm. Learning with instruction is generally more effective than learning through struggle. A surface level read would suggest that the stochastic chatbot actually has a counter-instructional effect. But again, we'll see what the methods actually are.

    Edit: I should say, doing things with feedback from an instructor generally has better learning outcomes than doing things in isolation. I phrased that badly.

    grimalkina@mastodon.socialG This user is from outside of this forum
    grimalkina@mastodon.socialG This user is from outside of this forum
    grimalkina@mastodon.social
    wrote sidst redigeret af
    #39

    @jenniferplusplus "Learning with instruction is generally more effective than learning through struggle"

    I'm not sure I agree! Desirable difficulties literature and metacognition lit both agree short term failures can lead to better long term retention (people's lack of belief in this is often pointed to as a reason we engage in inefficient problem solving). That is one reason project based learning can sometimes beat sage on a stage lectures

    Eg classic lit here: https://bjorklab.psych.ucla.edu/wp-content/uploads/sites/13/2016/04/EBjork_RBjork_2011.pdf

    jenniferplusplus@hachyderm.ioJ 1 Reply Last reply
    0
    • jenniferplusplus@hachyderm.ioJ jenniferplusplus@hachyderm.io

      And then we switch back to background context. We get a 11 sentences of AI = productivity. Then 3 sentences on "cognitive offloading". 4 sentences on skill retention. And 4 on "over reliance". So, fully 50% of the background section of the "AI Impacts on Skill Formation" paper is about productivity.

      jenniferplusplus@hachyderm.ioJ This user is from outside of this forum
      jenniferplusplus@hachyderm.ioJ This user is from outside of this forum
      jenniferplusplus@hachyderm.io
      wrote sidst redigeret af
      #40

      Chapter 3. Framework.

      Finally.

      Paraphrasing a little: "the learning by doing" philosphy connects completing real world tasks with learning new concepts and developing new skills. Experiental learning has also been explored to mimic solving real world problems. We focus on settings where workers must acquire new skills to complete tasks. We seek to understand both the impact of AI on productivity
      and skill formation. We ask whether AI assistance presents a tradeoff between immediate productivity and longer-term skill development or if AI assistance presents a shortcut to enhance both.

      Right. There it is again: productivity. Even within this framing, there are at least 3 more possibilities. That AI does not actually increase productivity; that AI has no effect at all; or that AI improves learning only. I think it's very telling that the authors don't even conceive of these options. Particularly the last one.

      But I'm becoming more and more convinced that the framing of productivity as an essential factor to measure and judge by is itself the whole purpose of this paper. And, specifically, productivity as defined by production output. But maybe I'm getting ahead of myself.

      jenniferplusplus@hachyderm.ioJ 1 Reply Last reply
      0
      • jenniferplusplus@hachyderm.ioJ jenniferplusplus@hachyderm.io

        And then we switch back to background context. We get a 11 sentences of AI = productivity. Then 3 sentences on "cognitive offloading". 4 sentences on skill retention. And 4 on "over reliance". So, fully 50% of the background section of the "AI Impacts on Skill Formation" paper is about productivity.

        wakame@tech.lgbtW This user is from outside of this forum
        wakame@tech.lgbtW This user is from outside of this forum
        wakame@tech.lgbt
        wrote sidst redigeret af
        #41

        @jenniferplusplus

        I love how the introduction already frames the industrial revolution all wrong. Product lines were all about de-skilling people, by turning artisans into factory workers.

        Then comparing factory work with software development. When software development is the exact opposite.

        Maybe those authors should visit a real factory and talk to some real engineers.

        And I find it absolutely fascinating how they talk about "professional domains", when all they mean is "software development".

        (Not even through the first page and already feeling the need to vent, sorry.)

        1 Reply Last reply
        0
        • jenniferplusplus@hachyderm.ioJ jenniferplusplus@hachyderm.io

          Chapter 3. Framework.

          Finally.

          Paraphrasing a little: "the learning by doing" philosphy connects completing real world tasks with learning new concepts and developing new skills. Experiental learning has also been explored to mimic solving real world problems. We focus on settings where workers must acquire new skills to complete tasks. We seek to understand both the impact of AI on productivity
          and skill formation. We ask whether AI assistance presents a tradeoff between immediate productivity and longer-term skill development or if AI assistance presents a shortcut to enhance both.

          Right. There it is again: productivity. Even within this framing, there are at least 3 more possibilities. That AI does not actually increase productivity; that AI has no effect at all; or that AI improves learning only. I think it's very telling that the authors don't even conceive of these options. Particularly the last one.

          But I'm becoming more and more convinced that the framing of productivity as an essential factor to measure and judge by is itself the whole purpose of this paper. And, specifically, productivity as defined by production output. But maybe I'm getting ahead of myself.

          jenniferplusplus@hachyderm.ioJ This user is from outside of this forum
          jenniferplusplus@hachyderm.ioJ This user is from outside of this forum
          jenniferplusplus@hachyderm.io
          wrote sidst redigeret af
          #42

          And now we have actual research questions! It feels like it shouldn't take this long to get these, but w/e

          1. Does AI assistance improve task completion productivity when new skills are required?
          2. How does using AI assistance affect the development of these new skills?

          We'll learn how the authors propose to answer these questions in the next chapter: Methods.

          But first, there is a 6 year old in here demanding I play minecraft, and I'd rather do that.

          To be continued... probbaly

          weekend_editor@mathstodon.xyzW jenniferplusplus@hachyderm.ioJ 2 Replies Last reply
          0
          • r343l@freeradical.zoneR r343l@freeradical.zone

            @inthehands @jenniferplusplus One of my personal hesitance to use the LLM tools much (despite incredible professional pressure to do so) is that my use of it (again, under professional necessity) has re-enforced my pre-existing belief that struggling through a problem, debugging and digging through source and so on has been CRITICAL to my skill development. It is something I have for (uh) 15+ years told less experienced software developers is critical to getting better / faster!

            dahukanna@mastodon.socialD This user is from outside of this forum
            dahukanna@mastodon.socialD This user is from outside of this forum
            dahukanna@mastodon.social
            wrote sidst redigeret af
            #43

            @r343l @inthehands @jenniferplusplus
            “struggling through a problem, debugging & digging through source & so on has been CRITICAL to my skill development” … because the “cognitive struggle” is like doing physical exercise or activity to get your body and brain better + faster doing it.
            Making a request & waiting for the output result is like ordering a meal from a restaurant menu & somehow expecting that action to make you an expert Chef. At most, you become an expert at ordering off a menu.

            1 Reply Last reply
            0
            • grimalkina@mastodon.socialG grimalkina@mastodon.social

              @jenniferplusplus "Learning with instruction is generally more effective than learning through struggle"

              I'm not sure I agree! Desirable difficulties literature and metacognition lit both agree short term failures can lead to better long term retention (people's lack of belief in this is often pointed to as a reason we engage in inefficient problem solving). That is one reason project based learning can sometimes beat sage on a stage lectures

              Eg classic lit here: https://bjorklab.psych.ucla.edu/wp-content/uploads/sites/13/2016/04/EBjork_RBjork_2011.pdf

              jenniferplusplus@hachyderm.ioJ This user is from outside of this forum
              jenniferplusplus@hachyderm.ioJ This user is from outside of this forum
              jenniferplusplus@hachyderm.io
              wrote sidst redigeret af
              #44

              @grimalkina I think I phrased that badly. I'm aware and agree that doing a thing, mistakes and all, is very often has better learning outcomes than lectures from experts.

              What I meant was doing a thing with guidance and feedback from an expert has better outcomes than doing it in isolation.

              1 Reply Last reply
              0
              • jenniferplusplus@hachyderm.ioJ jenniferplusplus@hachyderm.io

                @dalias Honestly, yes. I suspect the purpose of this paper is to reinforce that production is a correct and necessary factor to consider when making decisions about AI.

                And secondarily, I suspect it's establishing justification for blaming workers for undesirable outcomes; it's our fault for choosing to learn badly.

                dalias@hachyderm.ioD This user is from outside of this forum
                dalias@hachyderm.ioD This user is from outside of this forum
                dalias@hachyderm.io
                wrote sidst redigeret af
                #45

                @jenniferplusplus 🤔 The purpose of a paper is the assumptions it makes.

                jenniferplusplus@hachyderm.ioJ lispi314@udongein.xyzL 2 Replies Last reply
                0
                • jenniferplusplus@hachyderm.ioJ jenniferplusplus@hachyderm.io

                  > We find that using AI assistance to complete
                  tasks that involve this new library resulted in a reduction in the evaluation score by 17% or two grade
                  points (Cohen’s d = 0.738, p = 0.010). Meanwhile, we did not find a statistically significant acceleration in
                  completion time with AI assistance.

                  I mean, that's an enormous effect. I'm very interested in the methods section, now.

                  > Through an in-depth qualitative analysis where we watch the screen recordings of every participant in our
                  main study, we explain the lack of AI productivity improvement through the additional time some participants
                  invested in interacting with the AI assistant.

                  ...

                  Is this about learning, or is it about productivity!? God.

                  > We attribute the gains in skill development of the control group to the process of encountering and subsequently resolving errors independently

                  Hm. Learning with instruction is generally more effective than learning through struggle. A surface level read would suggest that the stochastic chatbot actually has a counter-instructional effect. But again, we'll see what the methods actually are.

                  Edit: I should say, doing things with feedback from an instructor generally has better learning outcomes than doing things in isolation. I phrased that badly.

                  catch56@kolektiva.socialC This user is from outside of this forum
                  catch56@kolektiva.socialC This user is from outside of this forum
                  catch56@kolektiva.social
                  wrote sidst redigeret af
                  #46

                  @jenniferplusplus I think the 'control group' here didn't use AI at all. At least that's how I read it. And they completed the task in more or less the same time and two grades better results.

                  1 Reply Last reply
                  0
                  • jenniferplusplus@hachyderm.ioJ jenniferplusplus@hachyderm.io

                    And now we have actual research questions! It feels like it shouldn't take this long to get these, but w/e

                    1. Does AI assistance improve task completion productivity when new skills are required?
                    2. How does using AI assistance affect the development of these new skills?

                    We'll learn how the authors propose to answer these questions in the next chapter: Methods.

                    But first, there is a 6 year old in here demanding I play minecraft, and I'd rather do that.

                    To be continued... probbaly

                    weekend_editor@mathstodon.xyzW This user is from outside of this forum
                    weekend_editor@mathstodon.xyzW This user is from outside of this forum
                    weekend_editor@mathstodon.xyz
                    wrote sidst redigeret af
                    #47

                    @jenniferplusplus

                    There's a whole series of recent studies from MIT, CMU, Boston Consulting Group, BBC, and Oxford Economics arguing that AI/LLM assistants do NOT improve productivity.

                    Walk-through here:

                    https://www.someweekendreading.blog/ai-update-2026/

                    jenniferplusplus@hachyderm.ioJ 1 Reply Last reply
                    0
                    • dalias@hachyderm.ioD dalias@hachyderm.io

                      @jenniferplusplus 🤔 The purpose of a paper is the assumptions it makes.

                      jenniferplusplus@hachyderm.ioJ This user is from outside of this forum
                      jenniferplusplus@hachyderm.ioJ This user is from outside of this forum
                      jenniferplusplus@hachyderm.io
                      wrote sidst redigeret af
                      #48

                      @dalias Not all the time. But if it's research conducted and published by the in-house research team of Anthropic? Yeah, probably

                      dalias@hachyderm.ioD 1 Reply Last reply
                      0
                      • jenniferplusplus@hachyderm.ioJ jenniferplusplus@hachyderm.io

                        @dalias Not all the time. But if it's research conducted and published by the in-house research team of Anthropic? Yeah, probably

                        dalias@hachyderm.ioD This user is from outside of this forum
                        dalias@hachyderm.ioD This user is from outside of this forum
                        dalias@hachyderm.io
                        wrote sidst redigeret af
                        #49

                        @jenniferplusplus Yeah. Or if there are conflicts of interest in the funding, or if the researchers are just aspiring to getting hired into the industry or getting VC for their own ideas.

                        1 Reply Last reply
                        0
                        • weekend_editor@mathstodon.xyzW weekend_editor@mathstodon.xyz

                          @jenniferplusplus

                          There's a whole series of recent studies from MIT, CMU, Boston Consulting Group, BBC, and Oxford Economics arguing that AI/LLM assistants do NOT improve productivity.

                          Walk-through here:

                          https://www.someweekendreading.blog/ai-update-2026/

                          jenniferplusplus@hachyderm.ioJ This user is from outside of this forum
                          jenniferplusplus@hachyderm.ioJ This user is from outside of this forum
                          jenniferplusplus@hachyderm.io
                          wrote sidst redigeret af
                          #50

                          @weekend_editor 👀 🔖

                          1 Reply Last reply
                          0
                          • jenniferplusplus@hachyderm.ioJ jenniferplusplus@hachyderm.io

                            And now we have actual research questions! It feels like it shouldn't take this long to get these, but w/e

                            1. Does AI assistance improve task completion productivity when new skills are required?
                            2. How does using AI assistance affect the development of these new skills?

                            We'll learn how the authors propose to answer these questions in the next chapter: Methods.

                            But first, there is a 6 year old in here demanding I play minecraft, and I'd rather do that.

                            To be continued... probbaly

                            jenniferplusplus@hachyderm.ioJ This user is from outside of this forum
                            jenniferplusplus@hachyderm.ioJ This user is from outside of this forum
                            jenniferplusplus@hachyderm.io
                            wrote sidst redigeret af
                            #51

                            Chapter 4. Methods.

                            Let's go

                            First, the task. It's uh. It's basically a shitty whiteboard coding interview. The assignment is to build a couple of demo projects for an async python library. One is a non-blocking ticker. The other is some I/O ("record retrieval", not clear if this is the local filesystem or what, but probably the local fs) with handling for missing files.

                            Both are implemented in a literal white board coding interview tool. The test group gets an AI chatbot button, and encouragement to use it. The control group doesn't.

                            /sigh

                            I just. Come on. If you were serious about this, it would be pocket change to do an actual study

                            glyph@mastodon.socialG jenniferplusplus@hachyderm.ioJ sci_photos@troet.cafeS 3 Replies Last reply
                            0
                            • jenniferplusplus@hachyderm.ioJ jenniferplusplus@hachyderm.io

                              Chapter 4. Methods.

                              Let's go

                              First, the task. It's uh. It's basically a shitty whiteboard coding interview. The assignment is to build a couple of demo projects for an async python library. One is a non-blocking ticker. The other is some I/O ("record retrieval", not clear if this is the local filesystem or what, but probably the local fs) with handling for missing files.

                              Both are implemented in a literal white board coding interview tool. The test group gets an AI chatbot button, and encouragement to use it. The control group doesn't.

                              /sigh

                              I just. Come on. If you were serious about this, it would be pocket change to do an actual study

                              glyph@mastodon.socialG This user is from outside of this forum
                              glyph@mastodon.socialG This user is from outside of this forum
                              glyph@mastodon.social
                              wrote sidst redigeret af
                              #52

                              @jenniferplusplus thank you so much for doing this. I skimmed and just couldn’t bring myself to read it all, and it’s nice to see someone doing a much deeper read but coming to largely the same conclusions.

                              jenniferplusplus@hachyderm.ioJ 1 Reply Last reply
                              0
                              • glyph@mastodon.socialG glyph@mastodon.social

                                @jenniferplusplus thank you so much for doing this. I skimmed and just couldn’t bring myself to read it all, and it’s nice to see someone doing a much deeper read but coming to largely the same conclusions.

                                jenniferplusplus@hachyderm.ioJ This user is from outside of this forum
                                jenniferplusplus@hachyderm.ioJ This user is from outside of this forum
                                jenniferplusplus@hachyderm.io
                                wrote sidst redigeret af
                                #53

                                @glyph i would do this more, but the format of academic papers is so cumbersome. The time I actually have available for it is on the couch, after the kid's in bed. But reading these things on a phone is basically impossible

                                r343l@freeradical.zoneR glyph@mastodon.socialG enigma@norden.socialE 3 Replies Last reply
                                0
                                • jenniferplusplus@hachyderm.ioJ jenniferplusplus@hachyderm.io

                                  @glyph i would do this more, but the format of academic papers is so cumbersome. The time I actually have available for it is on the couch, after the kid's in bed. But reading these things on a phone is basically impossible

                                  r343l@freeradical.zoneR This user is from outside of this forum
                                  r343l@freeradical.zoneR This user is from outside of this forum
                                  r343l@freeradical.zone
                                  wrote sidst redigeret af
                                  #54

                                  @jenniferplusplus @glyph I had only read the anthropic summary. I was struck by how even if all their methods and study design were great (& a good sample etc) the results seemed to very much indicate LLM use isn't as transformative as the hype with major risks of deskilling impacts. I was surprised they published it just reading their own summary. I guess they had to make lemonade from lemons??

                                  glyph@mastodon.socialG jenniferplusplus@hachyderm.ioJ 2 Replies Last reply
                                  0
                                  • jenniferplusplus@hachyderm.ioJ jenniferplusplus@hachyderm.io

                                    @glyph i would do this more, but the format of academic papers is so cumbersome. The time I actually have available for it is on the couch, after the kid's in bed. But reading these things on a phone is basically impossible

                                    glyph@mastodon.socialG This user is from outside of this forum
                                    glyph@mastodon.socialG This user is from outside of this forum
                                    glyph@mastodon.social
                                    wrote sidst redigeret af
                                    #55

                                    @jenniferplusplus all the more reason I appreciate you putting the effort in!

                                    1 Reply Last reply
                                    0
                                    • r343l@freeradical.zoneR r343l@freeradical.zone

                                      @jenniferplusplus @glyph I had only read the anthropic summary. I was struck by how even if all their methods and study design were great (& a good sample etc) the results seemed to very much indicate LLM use isn't as transformative as the hype with major risks of deskilling impacts. I was surprised they published it just reading their own summary. I guess they had to make lemonade from lemons??

                                      glyph@mastodon.socialG This user is from outside of this forum
                                      glyph@mastodon.socialG This user is from outside of this forum
                                      glyph@mastodon.social
                                      wrote sidst redigeret af
                                      #56

                                      @r343l @jenniferplusplus as I put it earlier today: https://mastodon.social/@glyph/115992279951399934

                                      1 Reply Last reply
                                      0
                                      • jenniferplusplus@hachyderm.ioJ jenniferplusplus@hachyderm.io

                                        @glyph i would do this more, but the format of academic papers is so cumbersome. The time I actually have available for it is on the couch, after the kid's in bed. But reading these things on a phone is basically impossible

                                        enigma@norden.socialE This user is from outside of this forum
                                        enigma@norden.socialE This user is from outside of this forum
                                        enigma@norden.social
                                        wrote sidst redigeret af
                                        #57

                                        @jenniferplusplus @glyph The industrial state today is a progressing milestone . But it has a history of 60 years. Turing test and Joseph Weizenbaum’s “Eliza” (same Test as Turing) are passed easily on any machine. But the myth of the ancient days about AI didnot change for many people.

                                        enigma@norden.socialE 1 Reply Last reply
                                        0
                                        • enigma@norden.socialE enigma@norden.social

                                          @jenniferplusplus @glyph The industrial state today is a progressing milestone . But it has a history of 60 years. Turing test and Joseph Weizenbaum’s “Eliza” (same Test as Turing) are passed easily on any machine. But the myth of the ancient days about AI didnot change for many people.

                                          enigma@norden.socialE This user is from outside of this forum
                                          enigma@norden.socialE This user is from outside of this forum
                                          enigma@norden.social
                                          wrote sidst redigeret af
                                          #58

                                          @jenniferplusplus @glyph I , older too, compare AI often with the moon landing of the 1960ies when AI also started professionally at the MIT, USA. The most confusing inquiry about Apollo’s success was : Now that we reached this goal that millions dreamed about what do we want there ? And what is our next stepping stone ?

                                          enigma@norden.socialE 1 Reply Last reply
                                          0
                                          Svar
                                          • Svar som emne
                                          Login for at svare
                                          • Ældste til nyeste
                                          • Nyeste til ældste
                                          • Most Votes


                                          • Log ind

                                          • Har du ikke en konto? Tilmeld

                                          • Login or register to search.
                                          Powered by NodeBB Contributors
                                          Graciously hosted by data.coop
                                          • First post
                                            Last post
                                          0
                                          • Hjem
                                          • Seneste
                                          • Etiketter
                                          • Populære
                                          • Verden
                                          • Bruger
                                          • Grupper