Skip to content
  • Hjem
  • Seneste
  • Etiketter
  • Populære
  • Verden
  • Bruger
  • Grupper
Temaer
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Kollaps
FARVEL BIG TECH
  1. Forside
  2. Ikke-kategoriseret
  3. To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

Planlagt Fastgjort Låst Flyttet Ikke-kategoriseret
openstreetmapbotsabuse
22 Indlæg 11 Posters 0 Visninger
  • Ældste til nyeste
  • Nyeste til ældste
  • Most Votes
Svar
  • Svar som emne
Login for at svare
Denne tråd er blevet slettet. Kun brugere med emne behandlings privilegier kan se den.
  • osm_tech@en.osm.townO This user is from outside of this forum
    osm_tech@en.osm.townO This user is from outside of this forum
    osm_tech@en.osm.town
    wrote sidst redigeret af
    #1

    To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

    If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
    🙏🌍 #AI #Bots #Abuse

    darkoneko@shelter.moeD michel42@norden.socialM boz@mastodon.unoB hunterz@mastodon.sdf.orgH utf_7@mastodon.socialU 7 Replies Last reply
    1
    0
    • osm_tech@en.osm.townO osm_tech@en.osm.town

      To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

      If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
      🙏🌍 #AI #Bots #Abuse

      darkoneko@shelter.moeD This user is from outside of this forum
      darkoneko@shelter.moeD This user is from outside of this forum
      darkoneko@shelter.moe
      wrote sidst redigeret af
      #2

      @osm_tech oooooooof

      1 Reply Last reply
      0
      • osm_tech@en.osm.townO osm_tech@en.osm.town

        To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

        If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
        🙏🌍 #AI #Bots #Abuse

        michel42@norden.socialM This user is from outside of this forum
        michel42@norden.socialM This user is from outside of this forum
        michel42@norden.social
        wrote sidst redigeret af
        #3

        @osm_tech do you want to share this IP-List?

        osm_tech@en.osm.townO 1 Reply Last reply
        0
        • osm_tech@en.osm.townO osm_tech@en.osm.town

          To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

          If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
          🙏🌍 #AI #Bots #Abuse

          boz@mastodon.unoB This user is from outside of this forum
          boz@mastodon.unoB This user is from outside of this forum
          boz@mastodon.uno
          wrote sidst redigeret af
          #4

          @osm_tech Please survive! I hope many people in the world will sue these aggressive companies for being the clear cause of worldwide service interruptions and quality of service disruptions and increment in electric bills, server bills bandwidth bills, RAM increase needs, etc. - they are literally killing the web.

          osm_tech@en.osm.townO 1 Reply Last reply
          0
          • boz@mastodon.unoB boz@mastodon.uno

            @osm_tech Please survive! I hope many people in the world will sue these aggressive companies for being the clear cause of worldwide service interruptions and quality of service disruptions and increment in electric bills, server bills bandwidth bills, RAM increase needs, etc. - they are literally killing the web.

            osm_tech@en.osm.townO This user is from outside of this forum
            osm_tech@en.osm.townO This user is from outside of this forum
            osm_tech@en.osm.town
            wrote sidst redigeret af
            #5

            @boz We will survive and grow stronger thanks to our awesome mapping community 🙂 We continue to squeeze more capacity out of our servers, but eventually we'll need to upgrade despite the $$$ RAM prices.

            1 Reply Last reply
            0
            • michel42@norden.socialM michel42@norden.social

              @osm_tech do you want to share this IP-List?

              osm_tech@en.osm.townO This user is from outside of this forum
              osm_tech@en.osm.townO This user is from outside of this forum
              osm_tech@en.osm.town
              wrote sidst redigeret af
              #6

              @michel42 We'd like to share the IP address list, but unfortunately don't think we can due to legal concerns.

              1 Reply Last reply
              0
              • osm_tech@en.osm.townO osm_tech@en.osm.town

                To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

                If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
                🙏🌍 #AI #Bots #Abuse

                hunterz@mastodon.sdf.orgH This user is from outside of this forum
                hunterz@mastodon.sdf.orgH This user is from outside of this forum
                hunterz@mastodon.sdf.org
                wrote sidst redigeret af
                #7

                @osm_tech does coming from residential IPs mean that someone has baked a scraper into some popular tool that people don't realize is doing that?

                ryanprior@mastodon.socialR artandtechnic@digipres.clubA 2 Replies Last reply
                0
                • hunterz@mastodon.sdf.orgH hunterz@mastodon.sdf.org

                  @osm_tech does coming from residential IPs mean that someone has baked a scraper into some popular tool that people don't realize is doing that?

                  ryanprior@mastodon.socialR This user is from outside of this forum
                  ryanprior@mastodon.socialR This user is from outside of this forum
                  ryanprior@mastodon.social
                  wrote sidst redigeret af
                  #8

                  @HunterZ @osm_tech this is actually quite common. Mobile advertising SDKs for games, background apps, etc include residential scraping proxy functionality that they can sell to the highest bidder, and then when scrapers want to avoid restrictions they can pay a fraction of a penny to send their requests via your phone. Millions of people use apps with this built in and have no idea. Most websites don't want to ban the residential scrapers because it can hurt growth.

                  tehstu@hachyderm.ioT 1 Reply Last reply
                  0
                  • osm_tech@en.osm.townO osm_tech@en.osm.town

                    To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

                    If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
                    🙏🌍 #AI #Bots #Abuse

                    utf_7@mastodon.socialU This user is from outside of this forum
                    utf_7@mastodon.socialU This user is from outside of this forum
                    utf_7@mastodon.social
                    wrote sidst redigeret af
                    #9

                    @osm_tech how can you even scrape a mapsite? isnt osm just a big canvas like google maps that is also nearly impossible to automate?

                    osm_tech@en.osm.townO 1 Reply Last reply
                    0
                    • hunterz@mastodon.sdf.orgH hunterz@mastodon.sdf.org

                      @osm_tech does coming from residential IPs mean that someone has baked a scraper into some popular tool that people don't realize is doing that?

                      artandtechnic@digipres.clubA This user is from outside of this forum
                      artandtechnic@digipres.clubA This user is from outside of this forum
                      artandtechnic@digipres.club
                      wrote sidst redigeret af
                      #10

                      @HunterZ @osm_tech Yes. There’s also (at least) one BitTorrent client that has one built in as well.

                      1 Reply Last reply
                      0
                      • utf_7@mastodon.socialU utf_7@mastodon.social

                        @osm_tech how can you even scrape a mapsite? isnt osm just a big canvas like google maps that is also nearly impossible to automate?

                        osm_tech@en.osm.townO This user is from outside of this forum
                        osm_tech@en.osm.townO This user is from outside of this forum
                        osm_tech@en.osm.town
                        wrote sidst redigeret af
                        #11

                        @utf_7 It is madness, start here: https://www.openstreetmap.org/node/1 and keep going once you reach https://www.openstreetmap.org/node/10000000000, then start on ways, and relations 😛 or just download the latest weekly export from planet.openstreetmap.org 😏

                        utf_7@mastodon.socialU 1 Reply Last reply
                        0
                        • osm_tech@en.osm.townO osm_tech@en.osm.town

                          @utf_7 It is madness, start here: https://www.openstreetmap.org/node/1 and keep going once you reach https://www.openstreetmap.org/node/10000000000, then start on ways, and relations 😛 or just download the latest weekly export from planet.openstreetmap.org 😏

                          utf_7@mastodon.socialU This user is from outside of this forum
                          utf_7@mastodon.socialU This user is from outside of this forum
                          utf_7@mastodon.social
                          wrote sidst redigeret af
                          #12

                          @osm_tech uff, i am a noob so forgive my stupid question: cant you somehow limit the requests. like 10 requests per minute or so. normal users will not be affected and scrapers will take forever?

                          osm_tech@en.osm.townO 1 Reply Last reply
                          0
                          • ryanprior@mastodon.socialR ryanprior@mastodon.social

                            @HunterZ @osm_tech this is actually quite common. Mobile advertising SDKs for games, background apps, etc include residential scraping proxy functionality that they can sell to the highest bidder, and then when scrapers want to avoid restrictions they can pay a fraction of a penny to send their requests via your phone. Millions of people use apps with this built in and have no idea. Most websites don't want to ban the residential scrapers because it can hurt growth.

                            tehstu@hachyderm.ioT This user is from outside of this forum
                            tehstu@hachyderm.ioT This user is from outside of this forum
                            tehstu@hachyderm.io
                            wrote sidst redigeret af
                            #13

                            @ryanprior @HunterZ @osm_tech I had no idea this was a thing. And presumably, as requests come from you, not the advertiser, Pihole (and other network blockers) treat it as legitimate traffic?

                            ryanprior@mastodon.socialR hunterz@mastodon.sdf.orgH 2 Replies Last reply
                            0
                            • utf_7@mastodon.socialU utf_7@mastodon.social

                              @osm_tech uff, i am a noob so forgive my stupid question: cant you somehow limit the requests. like 10 requests per minute or so. normal users will not be affected and scrapers will take forever?

                              osm_tech@en.osm.townO This user is from outside of this forum
                              osm_tech@en.osm.townO This user is from outside of this forum
                              osm_tech@en.osm.town
                              wrote sidst redigeret af
                              #14

                              @utf_7 We've had 400,000 IPs in the last 24 hours. Each IP only does a few requests. Technically we're managing, but no fun fighting this daily rather than building new things.

                              utf_7@mastodon.socialU 1 Reply Last reply
                              0
                              • tehstu@hachyderm.ioT tehstu@hachyderm.io

                                @ryanprior @HunterZ @osm_tech I had no idea this was a thing. And presumably, as requests come from you, not the advertiser, Pihole (and other network blockers) treat it as legitimate traffic?

                                ryanprior@mastodon.socialR This user is from outside of this forum
                                ryanprior@mastodon.socialR This user is from outside of this forum
                                ryanprior@mastodon.social
                                wrote sidst redigeret af
                                #15

                                @tehstu @HunterZ @osm_tech anything your pihole would let you request, it'd let the scraper request. If the scraper wanted to scrape some ads from another network it might get blocked, I guess.

                                1 Reply Last reply
                                0
                                • tehstu@hachyderm.ioT tehstu@hachyderm.io

                                  @ryanprior @HunterZ @osm_tech I had no idea this was a thing. And presumably, as requests come from you, not the advertiser, Pihole (and other network blockers) treat it as legitimate traffic?

                                  hunterz@mastodon.sdf.orgH This user is from outside of this forum
                                  hunterz@mastodon.sdf.orgH This user is from outside of this forum
                                  hunterz@mastodon.sdf.org
                                  wrote sidst redigeret af
                                  #16

                                  @tehstu @ryanprior @osm_tech pihole works by refusing to provide DNS resolution for domains on its blocklists, so it could block a scraper *if* its functionality depends on resolving a domain name that is blocked by pihole.

                                  hunterz@mastodon.sdf.orgH 1 Reply Last reply
                                  0
                                  • hunterz@mastodon.sdf.orgH hunterz@mastodon.sdf.org

                                    @tehstu @ryanprior @osm_tech pihole works by refusing to provide DNS resolution for domains on its blocklists, so it could block a scraper *if* its functionality depends on resolving a domain name that is blocked by pihole.

                                    hunterz@mastodon.sdf.orgH This user is from outside of this forum
                                    hunterz@mastodon.sdf.orgH This user is from outside of this forum
                                    hunterz@mastodon.sdf.org
                                    wrote sidst redigeret af
                                    #17

                                    @tehstu @ryanprior @osm_tech oh and of course the scraper would have to respect pihole versus using its own hard coded DNS IP to resolve things.

                                    1 Reply Last reply
                                    0
                                    • osm_tech@en.osm.townO osm_tech@en.osm.town

                                      @utf_7 We've had 400,000 IPs in the last 24 hours. Each IP only does a few requests. Technically we're managing, but no fun fighting this daily rather than building new things.

                                      utf_7@mastodon.socialU This user is from outside of this forum
                                      utf_7@mastodon.socialU This user is from outside of this forum
                                      utf_7@mastodon.social
                                      wrote sidst redigeret af
                                      #18

                                      @osm_tech tHeN yOu jUsT neEd tO sCaLe

                                      osm_tech@en.osm.townO 1 Reply Last reply
                                      0
                                      • osm_tech@en.osm.townO osm_tech@en.osm.town

                                        To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

                                        If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
                                        🙏🌍 #AI #Bots #Abuse

                                        jonsaenzagirre@mastodon.eusJ This user is from outside of this forum
                                        jonsaenzagirre@mastodon.eusJ This user is from outside of this forum
                                        jonsaenzagirre@mastodon.eus
                                        wrote sidst redigeret af
                                        #19

                                        @osm_tech question. Why do people scrape server which make the data freely available? And, probably, better structured in the final product. I don't see the point.

                                        osm_tech@en.osm.townO 1 Reply Last reply
                                        0
                                        • jonsaenzagirre@mastodon.eusJ jonsaenzagirre@mastodon.eus

                                          @osm_tech question. Why do people scrape server which make the data freely available? And, probably, better structured in the final product. I don't see the point.

                                          osm_tech@en.osm.townO This user is from outside of this forum
                                          osm_tech@en.osm.townO This user is from outside of this forum
                                          osm_tech@en.osm.town
                                          wrote sidst redigeret af
                                          #20

                                          @JonSaenzAgirre It is a good questions, and we don't know the answer either. Our planet data is so much easier to process and use.

                                          1 Reply Last reply
                                          0
                                          Svar
                                          • Svar som emne
                                          Login for at svare
                                          • Ældste til nyeste
                                          • Nyeste til ældste
                                          • Most Votes


                                          • Log ind

                                          • Har du ikke en konto? Tilmeld

                                          • Login or register to search.
                                          Powered by NodeBB Contributors
                                          Graciously hosted by data.coop
                                          • First post
                                            Last post
                                          0
                                          • Hjem
                                          • Seneste
                                          • Etiketter
                                          • Populære
                                          • Verden
                                          • Bruger
                                          • Grupper