@KennedyRichard
Web crawler should have been a solved problem. But anyhow they decided to hack it themselves, in shitty…
derderwish@social.chaotikum.org
@derderwish@social.chaotikum.org
Indlæg
-
Now you can keep track of how many billions the AI companies are losing on AI. -
Now you can keep track of how many billions the AI companies are losing on AI. -
Now you can keep track of how many billions the AI companies are losing on AI.@KennedyRichard @sebastian @julesbl @PaulaToThePeople @MikeElgan
Friends with statically generated websites are unnecessary often crawled by these systems.
They ignore established standards like robots.txt and try to avoid any kind of regulation.
They try to force their way to get these information with the same shitty behavior they try to force the LLM shit into each piece of software. -
Now you can keep track of how many billions the AI companies are losing on AI.@KennedyRichard @sebastian @julesbl @PaulaToThePeople @MikeElgan
One problem with that idea is that the crawler from the LLMs are enormously crappy designed and will lead to noticeable traffic.