Skip to content
  • Hjem
  • Seneste
  • Etiketter
  • Populære
  • Verden
  • Bruger
  • Grupper
Temaer
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Kollaps
FARVEL BIG TECH
  1. Forside
  2. Ikke-kategoriseret
  3. as some of you may know i made sth called [bombai](https://git.tudbut.de/tudbut/bombai) (name comes from bomba and ai) thats kinda like anubis or iocaine in purpose.

as some of you may know i made sth called [bombai](https://git.tudbut.de/tudbut/bombai) (name comes from bomba and ai) thats kinda like anubis or iocaine in purpose.

Planlagt Fastgjort Låst Flyttet Ikke-kategoriseret
forgejoopensource
1 Indlæg 1 Posters 0 Visninger
  • Ældste til nyeste
  • Nyeste til ældste
  • Most Votes
Svar
  • Svar som emne
Login for at svare
Denne tråd er blevet slettet. Kun brugere med emne behandlings privilegier kan se den.
  • tudbut@social.tudbut.deT This user is from outside of this forum
    tudbut@social.tudbut.deT This user is from outside of this forum
    tudbut@social.tudbut.de
    wrote sidst redigeret af
    #1

    as some of you may know i made sth called bombai (name comes from bomba and ai) thats kinda like anubis or iocaine in purpose. those solutions are kind of “fine” already, but the thing about them is that anubis sucks for regular users too, and isnt always effective, and that iocaine usually relies on lists or similar things.

    scrapers unfortunately come in all shapes and sizes and with new user agents or hidden ones all the time. after using anubis for a while, i got my forgejo downed again and so i went looking. iocaine seems like a good idea, but i want something that is sure to stop my git from going down even if i dont maintain it or the lists are incomplete.


    what i made now does the following:

    • very configurable detection entirely based on behaviour, without modifying site content
      • request counting
      • one fail = timeout, continuously resetting if attempts continue (this can be excellently combined with trap paths)
      • weighted by path and such
      • blobbing entire subnets together if desired (needed for alibaba’s bot for example)
      • allows setting up “trap paths” that instantly flag someone for timeout upon visit
    • customizable response
      • redirection to iocaine or other trap
      • zip bombs (small ones usually since most scrapers are smart enough to not decompress them fully otherwise - but it makes it cheaper on bandwidth either way)
      • maze similar to but less sophisticated than iocaine
      • plain http or html response from file

    it is worth mentioning especially explicitly that paths that are expensive for the server to provide (in storage or otherwise), can be thus limited extremely well, and of course most scrapers are blocked even far before the first time they request such a path.

    if interest exists and help is needed setting it up or you just want to chat about it, DO IT I WOULD LOVE THAT.

    if a bot makes it through that is capable of doing any harm, i consider that a bug immediately. make an issue and ill debug either your config or the program itself.

    i use this myself for a bit now, and it works excellently for me with my forgejo.

    #opensource #forgejo

    1 Reply Last reply
    1
    0
    • jwcph@helvede.netJ jwcph@helvede.net shared this topic
    Svar
    • Svar som emne
    Login for at svare
    • Ældste til nyeste
    • Nyeste til ældste
    • Most Votes


    • Log ind

    • Har du ikke en konto? Tilmeld

    • Login or register to search.
    Powered by NodeBB Contributors
    Graciously hosted by data.coop
    • First post
      Last post
    0
    • Hjem
    • Seneste
    • Etiketter
    • Populære
    • Verden
    • Bruger
    • Grupper