Skip to content
  • Hjem
  • Seneste
  • Etiketter
  • Populære
  • Verden
  • Bruger
  • Grupper
Temaer
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Kollaps
FARVEL BIG TECH
  1. Forside
  2. Ikke-kategoriseret
  3. The best time for a European organization to start building an independent European mirror of Wikipedia, not reliant on the US or WMF, was in December 2016.

The best time for a European organization to start building an independent European mirror of Wikipedia, not reliant on the US or WMF, was in December 2016.

Planlagt Fastgjort Låst Flyttet Ikke-kategoriseret
33 Indlæg 14 Posters 0 Visninger
  • Ældste til nyeste
  • Nyeste til ældste
  • Most Votes
Svar
  • Svar som emne
Login for at svare
Denne tråd er blevet slettet. Kun brugere med emne behandlings privilegier kan se den.
  • thedj@mastodon.socialT thedj@mastodon.social

    @bvibber @davidgerard @luis_in_brief my back of the envelope guess would be about 3 million a year for an org to run a reliable dark archive and 8 million a year to actually start running something that you could visit (not edit). and those numbers assume the foundation is still doing its job next to what the eu side would be doing and assumes a redundant, non-US, sovereign cloud or self hosted solution.
    Only gets more expensive after that.

    thedj@mastodon.socialT This user is from outside of this forum
    thedj@mastodon.socialT This user is from outside of this forum
    thedj@mastodon.social
    wrote sidst redigeret af
    #11

    Cost goes into starting an org and hiring. Custom mirror engineering, immediate legal costs over content disputes and ddos defense.

    1 Reply Last reply
    0
    • profpatsch@mastodon.xyzP profpatsch@mastodon.xyz

      @luis_in_brief why? In the worst case, there's probably more than one mirror to start one when the need arises

      thedj@mastodon.socialT This user is from outside of this forum
      thedj@mastodon.socialT This user is from outside of this forum
      thedj@mastodon.social
      wrote sidst redigeret af
      #12

      @Profpatsch @luis_in_brief there isnt even any mirror at all of the media files. No one wants to bear that cost.

      thedj@mastodon.socialT R 2 Replies Last reply
      0
      • thedj@mastodon.socialT thedj@mastodon.social

        @Profpatsch @luis_in_brief there isnt even any mirror at all of the media files. No one wants to bear that cost.

        thedj@mastodon.socialT This user is from outside of this forum
        thedj@mastodon.socialT This user is from outside of this forum
        thedj@mastodon.social
        wrote sidst redigeret af
        #13

        @Profpatsch @luis_in_brief the mirrors also largely exclude all the process behind what the audience sees. And that process is probably more important to mirror than the raw ‘ready’ latest version of ‘some’ of the content.

        johanempa@mastodon.greenJ 1 Reply Last reply
        0
        • davidgerard@circumstances.runD davidgerard@circumstances.run

          @luis_in_brief god, the cost of WMF-level infra. has anyone got an estimate of what it would cost to stand up and run? @bvibber do you have numbers you are able to say?

          iain@kolektiva.socialI This user is from outside of this forum
          iain@kolektiva.socialI This user is from outside of this forum
          iain@kolektiva.social
          wrote sidst redigeret af
          #14

          @davidgerard @luis_in_brief @bvibber Wikipedia, but decentralised!

          luis_in_brief@social.coopL 1 Reply Last reply
          0
          • thedj@mastodon.socialT thedj@mastodon.social

            @Profpatsch @luis_in_brief the mirrors also largely exclude all the process behind what the audience sees. And that process is probably more important to mirror than the raw ‘ready’ latest version of ‘some’ of the content.

            johanempa@mastodon.greenJ This user is from outside of this forum
            johanempa@mastodon.greenJ This user is from outside of this forum
            johanempa@mastodon.green
            wrote sidst redigeret af
            #15

            @TheDJ
            Do you mind if I ask what you mean with 'all the process'?

            @Profpatsch @luis_in_brief

            1 Reply Last reply
            0
            • gedankenstuecke@scholar.socialG gedankenstuecke@scholar.social

              @davidgerard @luis_in_brief @bvibber for a start one could do a dark archive ready as a fail-over, compare with arxiv https://blog.tib.eu/2025/05/14/protecting-science-tib-builds-dark-archive-for-arxiv/

              luis_in_brief@social.coopL This user is from outside of this forum
              luis_in_brief@social.coopL This user is from outside of this forum
              luis_in_brief@social.coop
              wrote sidst redigeret af
              #16

              @davidgerard @bvibber @gedankenstuecke yeah, there’s a lot of things that could be done very differently (and very incrementally) if you’re trying to make Wikipedia independently available rather than cloning WMF.

              1 Reply Last reply
              0
              • profpatsch@mastodon.xyzP profpatsch@mastodon.xyz

                @luis_in_brief why? In the worst case, there's probably more than one mirror to start one when the need arises

                luis_in_brief@social.coopL This user is from outside of this forum
                luis_in_brief@social.coopL This user is from outside of this forum
                luis_in_brief@social.coop
                wrote sidst redigeret af
                #17

                @Profpatsch there are no deliberate, complete, formal mirrors that I know of. Archive might have one?

                1 Reply Last reply
                0
                • thedj@mastodon.socialT thedj@mastodon.social

                  @bvibber @davidgerard @luis_in_brief my back of the envelope guess would be about 3 million a year for an org to run a reliable dark archive and 8 million a year to actually start running something that you could visit (not edit). and those numbers assume the foundation is still doing its job next to what the eu side would be doing and assumes a redundant, non-US, sovereign cloud or self hosted solution.
                  Only gets more expensive after that.

                  woozle@toot.catW This user is from outside of this forum
                  woozle@toot.catW This user is from outside of this forum
                  woozle@toot.cat
                  wrote sidst redigeret af
                  #18

                  @TheDJ @bvibber @davidgerard @luis_in_brief

                  I'd say the primary goal should be a read-only mirror. Once you've got that, you've got defense against WP going dark, becoming inaccessible, or having stuff deleted by political edict -- and can move forward with additional pieces (editability, development...) once any significant shoes drop.

                  You wouldn't need an org anything like the size of MediaWiki just for that. At a glance, I'd say it's within reach of a modestly wealthy individual or a small org.

                  • Size of Wikipedia
                  • Wikipedia Database download
                  bweller@mstdn.socialB 1 Reply Last reply
                  1
                  0
                  • iain@kolektiva.socialI iain@kolektiva.social

                    @davidgerard @luis_in_brief @bvibber Wikipedia, but decentralised!

                    luis_in_brief@social.coopL This user is from outside of this forum
                    luis_in_brief@social.coopL This user is from outside of this forum
                    luis_in_brief@social.coop
                    wrote sidst redigeret af
                    #19

                    @davidgerard @bvibber @iain I would settle for “robustly and independently mirrored” right now.

                    iain@kolektiva.socialI 1 Reply Last reply
                    0
                    • luis_in_brief@social.coopL luis_in_brief@social.coop

                      @davidgerard @bvibber @iain I would settle for “robustly and independently mirrored” right now.

                      iain@kolektiva.socialI This user is from outside of this forum
                      iain@kolektiva.socialI This user is from outside of this forum
                      iain@kolektiva.social
                      wrote sidst redigeret af
                      #20

                      @luis_in_brief @davidgerard my strategy is downloading the text only archive every so often and putting it on my NAS, but that's not exactly a scalable solution

                      1 Reply Last reply
                      0
                      • bvibber@wikis.worldB bvibber@wikis.world

                        @davidgerard @luis_in_brief i do not have numbers offhand, but wmf annual budget is a strict upper bound 😉

                        luis_in_brief@social.coopL This user is from outside of this forum
                        luis_in_brief@social.coopL This user is from outside of this forum
                        luis_in_brief@social.coop
                        wrote sidst redigeret af
                        #21

                        @bvibber @davidgerard also good news, traffic is falling off a cliff so even if your goal were to split traffic 50-50 with the official site (unlikely for a variety of reasons) that’s 20% less outbound bandwidth to pay for than this time last year 🤪

                        1 Reply Last reply
                        0
                        • thedj@mastodon.socialT thedj@mastodon.social

                          @Profpatsch @luis_in_brief there isnt even any mirror at all of the media files. No one wants to bear that cost.

                          R This user is from outside of this forum
                          R This user is from outside of this forum
                          raulmatias@mstdn.social
                          wrote sidst redigeret af
                          #22

                          @TheDJ @Profpatsch @luis_in_brief A read-only mirror with all pages and all pages' revisions (approximately 5-10 TB, including talk pages, user pages, etc) and thumbnails of all used files hosted locally would be a few hundred TB at most (the entirety of Wikimedia Commons is just ~1 PB <https://commons.wikimedia.org/wiki/Special:MediaStatistics>, and that's all files ever uploaded to Wikimedia servers with their original sizes and revision history); not very much.

                          R 1 Reply Last reply
                          0
                          • R raulmatias@mstdn.social

                            @TheDJ @Profpatsch @luis_in_brief A read-only mirror with all pages and all pages' revisions (approximately 5-10 TB, including talk pages, user pages, etc) and thumbnails of all used files hosted locally would be a few hundred TB at most (the entirety of Wikimedia Commons is just ~1 PB <https://commons.wikimedia.org/wiki/Special:MediaStatistics>, and that's all files ever uploaded to Wikimedia servers with their original sizes and revision history); not very much.

                            R This user is from outside of this forum
                            R This user is from outside of this forum
                            raulmatias@mstdn.social
                            wrote sidst redigeret af
                            #23

                            @TheDJ @Profpatsch @luis_in_brief Getting the thumbnails will be a problem, because the last dump of all media files is 13 years old <https://ftpmirror.your.org/pub/wikimedia/images/wikipedia/commons/f/>, but I guess it's possible to just rip them off a "maxi" Kiwix dump. These only have files used in articles, though.

                            thedj@mastodon.socialT 1 Reply Last reply
                            0
                            • luis_in_brief@social.coopL luis_in_brief@social.coop

                              The best time for a European organization to start building an independent European mirror of Wikipedia, not reliant on the US or WMF, was in December 2016.

                              The second-best time is today.

                              R This user is from outside of this forum
                              R This user is from outside of this forum
                              raulmatias@mstdn.social
                              wrote sidst redigeret af
                              #24

                              @luis_in_brief Honestly I wouldn't even oppose mirrors hosted by the Wikimedia Foundation themselves. wikipedia.ph, wikipedia.is, wikipedia.gl, etc.

                              luis_in_brief@social.coopL R 2 Replies Last reply
                              0
                              • R raulmatias@mstdn.social

                                @luis_in_brief Honestly I wouldn't even oppose mirrors hosted by the Wikimedia Foundation themselves. wikipedia.ph, wikipedia.is, wikipedia.gl, etc.

                                luis_in_brief@social.coopL This user is from outside of this forum
                                luis_in_brief@social.coopL This user is from outside of this forum
                                luis_in_brief@social.coop
                                wrote sidst redigeret af
                                #25

                                @raulmatias no, that’s a worst-case scenario: exposes WMF to local laws/pressure, and keeps WMF as a single point of failure.

                                R 1 Reply Last reply
                                0
                                • R raulmatias@mstdn.social

                                  @luis_in_brief Honestly I wouldn't even oppose mirrors hosted by the Wikimedia Foundation themselves. wikipedia.ph, wikipedia.is, wikipedia.gl, etc.

                                  R This user is from outside of this forum
                                  R This user is from outside of this forum
                                  raulmatias@mstdn.social
                                  wrote sidst redigeret af
                                  #26

                                  @luis_in_brief Possibly with data being served from different data centers located in different countries for each domain.

                                  1 Reply Last reply
                                  0
                                  • luis_in_brief@social.coopL luis_in_brief@social.coop

                                    @raulmatias no, that’s a worst-case scenario: exposes WMF to local laws/pressure, and keeps WMF as a single point of failure.

                                    R This user is from outside of this forum
                                    R This user is from outside of this forum
                                    raulmatias@mstdn.social
                                    wrote sidst redigeret af
                                    #27

                                    @luis_in_brief
                                    >local laws/pressure

                                    .is is pretty much bulletproof, not sure about .ph and .gl.

                                    1 Reply Last reply
                                    0
                                    • R raulmatias@mstdn.social

                                      @TheDJ @Profpatsch @luis_in_brief Getting the thumbnails will be a problem, because the last dump of all media files is 13 years old <https://ftpmirror.your.org/pub/wikimedia/images/wikipedia/commons/f/>, but I guess it's possible to just rip them off a "maxi" Kiwix dump. These only have files used in articles, though.

                                      thedj@mastodon.socialT This user is from outside of this forum
                                      thedj@mastodon.socialT This user is from outside of this forum
                                      thedj@mastodon.social
                                      wrote sidst redigeret af
                                      #28

                                      @raulmatias @Profpatsch @luis_in_brief yes, I know all this. 20+ year mediawiki dev here 😉

                                      1 Reply Last reply
                                      0
                                      • luis_in_brief@social.coopL luis_in_brief@social.coop

                                        The best time for a European organization to start building an independent European mirror of Wikipedia, not reliant on the US or WMF, was in December 2016.

                                        The second-best time is today.

                                        mementomori85@mastodon.socialM This user is from outside of this forum
                                        mementomori85@mastodon.socialM This user is from outside of this forum
                                        mementomori85@mastodon.social
                                        wrote sidst redigeret af
                                        #29

                                        @luis_in_brief true

                                        1 Reply Last reply
                                        0
                                        • luis_in_brief@social.coopL luis_in_brief@social.coop

                                          The best time for a European organization to start building an independent European mirror of Wikipedia, not reliant on the US or WMF, was in December 2016.

                                          The second-best time is today.

                                          simon@en.osm.townS This user is from outside of this forum
                                          simon@en.osm.townS This user is from outside of this forum
                                          simon@en.osm.town
                                          wrote sidst redigeret af
                                          #30

                                          @luis_in_brief Just as #OpenStreetMap Wikipedia is legally forkable but not in practical terms because you can't clone the contributor community. Without that you just end up with a stale copy.

                                          luis_in_brief@social.coopL 1 Reply Last reply
                                          0
                                          Svar
                                          • Svar som emne
                                          Login for at svare
                                          • Ældste til nyeste
                                          • Nyeste til ældste
                                          • Most Votes


                                          • Log ind

                                          • Har du ikke en konto? Tilmeld

                                          • Login or register to search.
                                          Powered by NodeBB Contributors
                                          Graciously hosted by data.coop
                                          • First post
                                            Last post
                                          0
                                          • Hjem
                                          • Seneste
                                          • Etiketter
                                          • Populære
                                          • Verden
                                          • Bruger
                                          • Grupper