We'll see how I feel in the morning, but for now i seem to have convinced myself to actually read that fuckin anthropic paper

lispi314@udongein.xyz

@jenniferplusplus@hachyderm.io > Is it normal to begin these things by confidently asserting your priors as fact, unsupported by anything in the study?

Not to my knowledge, no.

Summary of the document and hypothesis goes there.

Confident assertion is a maybe in the conclusion (some fields do lend themselves to unambiguous provable assertions) and generally it’s more of a recap of prior analysis.

lispi314@udongein.xyz

@dalias @jenniferplusplus Only if it's a bad paper.

Especially if it then goes on to debunk those very same assumptions while refusing to remark on it.

This is distinct from presenting a premise as a hypothetical to verify.

jenniferplusplus@hachyderm.io

So. The test conditions were weirdly high stress, for no particular reason the study makes clear. Or even acknowledges. The stress was *higher* on the control group. And the control group had to use inferior tooling.

I don't see how this data can be used to support any quantitative conclusion at all.

Qualitatively, I suspect there is some value in the clusters of AI usage patterns they observed. But that's not what anyone is talking about when they talk about this study.

sci_photos@troet.cafe

Yes, that's one important aspect during teaching/learning. @inthehands @jenniferplusplus

sci_photos@troet.cafe

@jenniferplusplus

jenniferplusplus@hachyderm.io

And then there's one more detail. I'm not sure how I should be thinking about this, but it feels very relevant. All of the study subjects were recruited through a crowd working platform. That adds a whole extra concern about the subject's standing on the platform. It means that in some sense undertaking this study was their job, and the instruction given in the project brief was not just instruction to a participant in a study, but requirements given to a worker.

I know this kind of thing is not unusual in studies like this. But it feels like a complicating factor that I can't see the edges of.

jenniferplusplus@hachyderm.io

@realn2s Lower grades are, indeed, worse.

The AI did seem to speed things up, but not enough to achieve statistical significance. And as I describe further down the thread (just now, not suggesting you didn't read far enough), the AI chatbot seems to have been the only supportive tooling that was available. So it's not so much the difference between AI or not, as the difference between support tools or not.

jenniferplusplus@hachyderm.io

@jsbarretto That's not what people mean when they say system design.

They mean which way do dependencies flow. What is the scope of responsibility for this thing. How will it communicate with other things. How does the collection of things remain in a consistent state.

For example.

tartley@fosstodon.org

@jenniferplusplus Holy carp this is a fabulous (slash shocking) thread. Thanks for taking the time.

jenniferplusplus@hachyderm.io

@hrefna Im finding it frustrating, mainly

jenniferplusplus@hachyderm.io

But now it's 1am. I may pick this up tomorrow, I'm not sure. If I do, the next chapter is their analysis. Seems like there would be things in there that merit comment

jenniferplusplus@hachyderm.io

Actually, hang on. One more thing occurred to me. Does this exacerbate the difficulty of replication, given that the simple passage of time will render this library no longer new?

And now I'm done for the night, for real

https://hachyderm.io/@jenniferplusplus/115991499531084541

realn2s@infosec.exchange

@jenniferplusplus

I indeed asked the question before i had finished the thread
I was very confused and in some ways still are.
How can the authors of the paper think all this is an argument for AI (which I believe they do)?