Skip to content
  • Hjem
  • Seneste
  • Etiketter
  • Populære
  • Verden
  • Bruger
  • Grupper
Temaer
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Kollaps
FARVEL BIG TECH
  1. Forside
  2. Ikke-kategoriseret
  3. My timeline (which contains a lot of project leaders/sysadmins from big projects) is filling with posts about a new, ongoing wave of what most likely are scrapers collecting training data for „AI“ companies.

My timeline (which contains a lot of project leaders/sysadmins from big projects) is filling with posts about a new, ongoing wave of what most likely are scrapers collecting training data for „AI“ companies.

Planlagt Fastgjort Låst Flyttet Ikke-kategoriseret
45 Indlæg 31 Posters 0 Visninger
  • Ældste til nyeste
  • Nyeste til ældste
  • Most Votes
Svar
  • Svar som emne
Login for at svare
Denne tråd er blevet slettet. Kun brugere med emne behandlings privilegier kan se den.
  • b_b@mastodon.roflcopter.frB b_b@mastodon.roflcopter.fr

    @algernon @grumpybozo @alan @jwildeboer Did you try to use that trick ? What tool did you used and did it works well ?

    algernon@come-from.mad-scientist.clubA This user is from outside of this forum
    algernon@come-from.mad-scientist.clubA This user is from outside of this forum
    algernon@come-from.mad-scientist.club
    wrote sidst redigeret af
    #41

    @b_b @grumpybozo @alan @jwildeboer I've been using this trick (+ a few tweaks) for about a year now, with iocaine, with great success.

    jwildeboer@social.wildeboer.netJ 1 Reply Last reply
    0
    • algernon@come-from.mad-scientist.clubA algernon@come-from.mad-scientist.club

      @b_b @grumpybozo @alan @jwildeboer I've been using this trick (+ a few tweaks) for about a year now, with iocaine, with great success.

      jwildeboer@social.wildeboer.netJ This user is from outside of this forum
      jwildeboer@social.wildeboer.netJ This user is from outside of this forum
      jwildeboer@social.wildeboer.net
      wrote sidst redigeret af
      #42

      @algernon A wonderful understatement. Perfect answer 🙂 @b_b @grumpybozo @alan

      1 Reply Last reply
      0
      • rytmis@hachyderm.ioR rytmis@hachyderm.io

        @agturcz @alan @jwildeboer

        Apparently many ”smart” TV manufacturers ship proxy SDKs from companies like Bright, and they turn the TVs into nodes in a botnet that is used for ”AI” data scraping, so the traffic comes from all over the place.

        I’d guess not many consumers know about it, let alone have the technical know-how to prevent it.

        profpatsch@mastodon.xyzP This user is from outside of this forum
        profpatsch@mastodon.xyzP This user is from outside of this forum
        profpatsch@mastodon.xyz
        wrote sidst redigeret af
        #43

        @rytmis @agturcz @alan @jwildeboer omg this is a monetization angle for TVs that is just so obvious when you consider the race to the bottom in that industry

        rytmis@hachyderm.ioR 1 Reply Last reply
        0
        • profpatsch@mastodon.xyzP profpatsch@mastodon.xyz

          @rytmis @agturcz @alan @jwildeboer omg this is a monetization angle for TVs that is just so obvious when you consider the race to the bottom in that industry

          rytmis@hachyderm.ioR This user is from outside of this forum
          rytmis@hachyderm.ioR This user is from outside of this forum
          rytmis@hachyderm.io
          wrote sidst redigeret af
          #44

          @Profpatsch @agturcz @alan @jwildeboer

          Yep. I just read about it some weeks back and immediately tried to look for dumb TVs as an alternative. Of course, they don’t really exist as a product category any more, so the next best thing was to block those things at the router. ☹️

          1 Reply Last reply
          0
          • jwildeboer@social.wildeboer.netJ jwildeboer@social.wildeboer.net

            My timeline (which contains a lot of project leaders/sysadmins from big projects) is filling with posts about a new, ongoing wave of what most likely are scrapers collecting training data for „AI“ companies. They seem to be using botnets (or what some call „residential IP proxies“ to make it sound a bit more legitimate) with millions of IP addresses, making it really hard to defend against. Some have decided to take their sites down until this is over. This is now the world we live in 😞

            nicd@masto.ahlcode.fiN This user is from outside of this forum
            nicd@masto.ahlcode.fiN This user is from outside of this forum
            nicd@masto.ahlcode.fi
            wrote sidst redigeret af
            #45

            @jwildeboer Just this week a repository in my Forgejo instance was under attack. In a day, I racked up over 130k distinct IPs with fail2ban and had to abandon that approach.

            I now have a simple trick that cut out practically all of the traffic, but I hesitate to share it as it's not difficult to work around… I wish we didn't have to resort to such things.

            1 Reply Last reply
            0
            • jwcph@helvede.netJ jwcph@helvede.net shared this topic
            Svar
            • Svar som emne
            Login for at svare
            • Ældste til nyeste
            • Nyeste til ældste
            • Most Votes


            • Log ind

            • Har du ikke en konto? Tilmeld

            • Login or register to search.
            Powered by NodeBB Contributors
            Graciously hosted by data.coop
            • First post
              Last post
            0
            • Hjem
            • Seneste
            • Etiketter
            • Populære
            • Verden
            • Bruger
            • Grupper