Skip to content
  • Hjem
  • Seneste
  • Etiketter
  • Populære
  • Verden
  • Bruger
  • Grupper
Temaer
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Kollaps
FARVEL BIG TECH
derderwish@social.chaotikum.orgD

derderwish@social.chaotikum.org

@derderwish@social.chaotikum.org
About
Indlæg
4
Emner
0
Fremhævelser
0
Grupper
0
Følgere
0
Følger
0

Vis Original

Indlæg

Seneste Bedste Controversial

  • Now you can keep track of how many billions the AI companies are losing on AI.
    derderwish@social.chaotikum.orgD derderwish@social.chaotikum.org

    @KennedyRichard
    Web crawler should have been a solved problem. But anyhow they decided to hack it themselves, in shitty…
    🙄

    Ikke-kategoriseret machinesociety

  • Now you can keep track of how many billions the AI companies are losing on AI.
    derderwish@social.chaotikum.orgD derderwish@social.chaotikum.org

    @KennedyRichard
    Like this:
    https://www.jwz.org/blog/2025/10/exterminate-all-rational-ai-scrapers-redux-redux/

    Ikke-kategoriseret machinesociety

  • Now you can keep track of how many billions the AI companies are losing on AI.
    derderwish@social.chaotikum.orgD derderwish@social.chaotikum.org

    @KennedyRichard @sebastian @julesbl @PaulaToThePeople @MikeElgan
    Friends with statically generated websites are unnecessary often crawled by these systems.
    They ignore established standards like robots.txt and try to avoid any kind of regulation.
    They try to force their way to get these information with the same shitty behavior they try to force the LLM shit into each piece of software.

    Ikke-kategoriseret machinesociety

  • Now you can keep track of how many billions the AI companies are losing on AI.
    derderwish@social.chaotikum.orgD derderwish@social.chaotikum.org

    @KennedyRichard @sebastian @julesbl @PaulaToThePeople @MikeElgan
    One problem with that idea is that the crawler from the LLMs are enormously crappy designed and will lead to noticeable traffic.

    https://mastodon.social/@jwz/116608267656820166

    Ikke-kategoriseret machinesociety
  • Log ind

  • Har du ikke en konto? Tilmeld

  • Login or register to search.
Powered by NodeBB Contributors
Graciously hosted by data.coop
  • First post
    Last post
0
  • Hjem
  • Seneste
  • Etiketter
  • Populære
  • Verden
  • Bruger
  • Grupper