Farvel Big Tech | Indlæg oprettet af derderwish@social.chaotikum.org

derderwish@social.chaotikum.org

@KennedyRichard
Web crawler should have been a solved problem. But anyhow they decided to hack it themselves, in shitty…

derderwish@social.chaotikum.org

@KennedyRichard
Like this:
https://www.jwz.org/blog/2025/10/exterminate-all-rational-ai-scrapers-redux-redux/

derderwish@social.chaotikum.org

@KennedyRichard @sebastian @julesbl @PaulaToThePeople @MikeElgan
Friends with statically generated websites are unnecessary often crawled by these systems.
They ignore established standards like robots.txt and try to avoid any kind of regulation.
They try to force their way to get these information with the same shitty behavior they try to force the LLM shit into each piece of software.

derderwish@social.chaotikum.org

@KennedyRichard @sebastian @julesbl @PaulaToThePeople @MikeElgan
One problem with that idea is that the crawler from the LLMs are enormously crappy designed and will lead to noticeable traffic.

https://mastodon.social/@jwz/116608267656820166