So there's this guy who made a tool where someone punches in their bluesky or mastodon credentials to his website, and it auto-crawls their feeds and produces an LLM summary of everyone it finds posting there.
-
@cthos I mean, I don't actually mind being indexed and I'm not necessarily opposed to my posts being processed via bot, I just object to this one tool as designed and want to opt out. I guess at this point a "bot" is likely to be an LLM?

@mcc yeah, odds are very good a scraper project is some sort of LLM trap these days.
-
@theorangetheme @mcc the suggestion was made to him around 6 hours ago so he likely hasn't taken any action on it yet.
-
Gross. You should tell @404mediaco , they might want to cover this. They posted an article about a company non-consensually scraping Zoom meetings for AI bullshit today:
https://www.404media.co/this-company-is-secretly-turning-your-zoom-calls-into-ai-podcasts/
@funcrunch @404mediaco I don't think that's a good idea at all. I'd be strongly against that. A news article about this subject would make people aware of the project, which would cause people to use the project, which is the opposite of what I want.
-
So there's this guy who made a tool where someone punches in their bluesky or mastodon credentials to his website, and it auto-crawls their feeds and produces an LLM summary of everyone it finds posting there. He was asked what people should do if we don't want to be mulched as content for his summary feeds. He said we should block him. I replied, I can do that, but that only stops *you* from running the tool on me, how do I prevent *your other users* from running your tool on me? He blocked me.
@mcc I have 96k toots. I wonder how much compute resources it would cost him to process it all. Can you share the username of this idiot so we too can block him ?
-
Yeah, this prompted me to add #nobot to my profile, which is something I wasn't aware of. It will be interesting to see if he honors that. I'm sure we will hear more about this.
I think I may start setting most of my posts to followers only and have them auto destruct because of garbage like this.
-
So there's this guy who made a tool where someone punches in their bluesky or mastodon credentials to his website, and it auto-crawls their feeds and produces an LLM summary of everyone it finds posting there. He was asked what people should do if we don't want to be mulched as content for his summary feeds. He said we should block him. I replied, I can do that, but that only stops *you* from running the tool on me, how do I prevent *your other users* from running your tool on me? He blocked me.
@mcc I got suckered.
-
@mcc I have 96k toots. I wonder how much compute resources it would cost him to process it all. Can you share the username of this idiot so we too can block him ?
@quixoticgeek @mcc Large documents got fed easily into an LLM, thus 96k is a gravy shoemaker.
Rather, the quantity serves as additional chances to poison. Only need 250 samples that misuse a word to make an LLM erratic.
-
So there's this guy who made a tool where someone punches in their bluesky or mastodon credentials to his website, and it auto-crawls their feeds and produces an LLM summary of everyone it finds posting there. He was asked what people should do if we don't want to be mulched as content for his summary feeds. He said we should block him. I replied, I can do that, but that only stops *you* from running the tool on me, how do I prevent *your other users* from running your tool on me? He blocked me.
@mcc well, it was a matter of time, wasn't it?
I just found an example of something I wrote flattened by Gemini, which oddly enough proves the point of the original post.
It's not just invasive. It misrepresent as well.
They won't understand until it happens to them.
-
So there's this guy who made a tool where someone punches in their bluesky or mastodon credentials to his website, and it auto-crawls their feeds and produces an LLM summary of everyone it finds posting there. He was asked what people should do if we don't want to be mulched as content for his summary feeds. He said we should block him. I replied, I can do that, but that only stops *you* from running the tool on me, how do I prevent *your other users* from running your tool on me? He blocked me.
@mcc This sounds familiar. The last guy tried to do this similar attempt, it starts with M but forget his complete name lol
-
Anyway, the fact he's blocked me *partially* solves my problem, in that now he cannot LLM summarize me anymore, but the problem that possibly eventually a *second* person would use his tool remains unresolved.
Honestly, it's baffling that he added Mastodon support at all given that he's been here for years and thus saw some of the MANY YEARS of conflict and debate about the idea of people merely *archiving* or *indexing* Mastodon posts. And then he goes an uploads an auto-LLM-mulcher tool. IDK.
@mcc I think one of the big stories of the decade is the slow realisation that when folks say they are releasing things for everyone to do anything with, and used licences that encode that utterance, we don't in fact mean anyone, and we don't in fact mean anything
a lot of people are also realising, myself included, that the parties who can exploit and profit disproportionately more from free stuff is are the parties who are highly experienced at exploitation and profiteering. and the parties most immune from the social checks we have on harmful behaviour are sociopaths who can do the most harm.
it's a bummer
-
@mcc ...okay, THIS finally convinced me to set my toots to auto-expire.
What a nightmare.
-
So there's this guy who made a tool where someone punches in their bluesky or mastodon credentials to his website, and it auto-crawls their feeds and produces an LLM summary of everyone it finds posting there. He was asked what people should do if we don't want to be mulched as content for his summary feeds. He said we should block him. I replied, I can do that, but that only stops *you* from running the tool on me, how do I prevent *your other users* from running your tool on me? He blocked me.
@mcc What a shitty little creep. Thanks for the heads up. Not sure what I can do besides auto deleting posts, blocking said creep and his server but at least glad I've done that
-
So there's this guy who made a tool where someone punches in their bluesky or mastodon credentials to his website, and it auto-crawls their feeds and produces an LLM summary of everyone it finds posting there. He was asked what people should do if we don't want to be mulched as content for his summary feeds. He said we should block him. I replied, I can do that, but that only stops *you* from running the tool on me, how do I prevent *your other users* from running your tool on me? He blocked me.
@mcc he said you should block the people using the tool too, but you might have been blocked already for that
-
Anyway, the fact he's blocked me *partially* solves my problem, in that now he cannot LLM summarize me anymore, but the problem that possibly eventually a *second* person would use his tool remains unresolved.
Honestly, it's baffling that he added Mastodon support at all given that he's been here for years and thus saw some of the MANY YEARS of conflict and debate about the idea of people merely *archiving* or *indexing* Mastodon posts. And then he goes an uploads an auto-LLM-mulcher tool. IDK.
@mcc it was bound to happen
though I believe servers could mitigate this by placing limits on number of querries given user/ip/subnet/userahent could perform. Thoyhg that would impact people behind NAT, thoes who want to scroll and read bit longer, and probably bots would just go around it by adding longer sleeps between querries.Still, in the end if your client can display toots or other activities, so can bots.
-
@mcc he said you should block the people using the tool too, but you might have been blocked already for that
@leberschnitzel A proposal I made before the block was that he could add everyone who uses the tool (as the tool does involve ingesting your login info) to a Bluesky list, so we could subscribe to that as a blocklist. His response to that seemed to be that the thing I wanted was inherently unreasonable. So should I block these folks or not block them? Strange.
-
@mcc it was bound to happen
though I believe servers could mitigate this by placing limits on number of querries given user/ip/subnet/userahent could perform. Thoyhg that would impact people behind NAT, thoes who want to scroll and read bit longer, and probably bots would just go around it by adding longer sleeps between querries.Still, in the end if your client can display toots or other activities, so can bots.
@jablkoziemne The fact that anyone could have done this does not make doing it okay
-
@leberschnitzel A proposal I made before the block was that he could add everyone who uses the tool (as the tool does involve ingesting your login info) to a Bluesky list, so we could subscribe to that as a blocklist. His response to that seemed to be that the thing I wanted was inherently unreasonable. So should I block these folks or not block them? Strange.
@mcc it's very clear that he doesn't care about consent
-
@mcc it would appear to do no filtering of DMs https://github.com/seldo/zeitgeist/blob/main/app/api/mastodon/timeline/route.ts#L49
-
@mcc it would appear to do no filtering of DMs https://github.com/seldo/zeitgeist/blob/main/app/api/mastodon/timeline/route.ts#L49
-
@mcc Yeah. There's a thing going on here where that hits people in a sore spot (LLMs) that is in many ways out of bounds (you can't actually control other people's tools); the place it gets dicy is when you're running a service so you're promoting the use of the tools.
But in general, I'd be real mad at anyone who tried to control what I used to read with.
@aredridel @mcc I had a similar response to quote permissions: what good is turning off quoting for a public post when others can still use their "tools" to link to it? Someone explained to me that it's about making it easy to respect other people's wishes, for those who are inclined to do so.
Maybe it would be nice if this person added a more effective opt out mechanism? Or made their bot opt in? You'd still be free to implement your own LLM reading tool if you really want to.