Skip to content
  • Hjem
  • Seneste
  • Etiketter
  • Populære
  • Verden
  • Bruger
  • Grupper
Temaer
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Kollaps
FARVEL BIG TECH
  1. Forside
  2. Ikke-kategoriseret
  3. The best time for a European organization to start building an independent European mirror of Wikipedia, not reliant on the US or WMF, was in December 2016.

The best time for a European organization to start building an independent European mirror of Wikipedia, not reliant on the US or WMF, was in December 2016.

Planlagt Fastgjort Låst Flyttet Ikke-kategoriseret
33 Indlæg 14 Posters 0 Visninger
  • Ældste til nyeste
  • Nyeste til ældste
  • Most Votes
Svar
  • Svar som emne
Login for at svare
Denne tråd er blevet slettet. Kun brugere med emne behandlings privilegier kan se den.
  • davidgerard@circumstances.runD davidgerard@circumstances.run

    @luis_in_brief god, the cost of WMF-level infra. has anyone got an estimate of what it would cost to stand up and run? @bvibber do you have numbers you are able to say?

    bvibber@wikis.worldB This user is from outside of this forum
    bvibber@wikis.worldB This user is from outside of this forum
    bvibber@wikis.world
    wrote sidst redigeret af
    #8

    @davidgerard @luis_in_brief i do not have numbers offhand, but wmf annual budget is a strict upper bound 😉

    thedj@mastodon.socialT luis_in_brief@social.coopL 2 Replies Last reply
    0
    • luis_in_brief@social.coopL luis_in_brief@social.coop

      The best time for a European organization to start building an independent European mirror of Wikipedia, not reliant on the US or WMF, was in December 2016.

      The second-best time is today.

      profpatsch@mastodon.xyzP This user is from outside of this forum
      profpatsch@mastodon.xyzP This user is from outside of this forum
      profpatsch@mastodon.xyz
      wrote sidst redigeret af
      #9

      @luis_in_brief why? In the worst case, there's probably more than one mirror to start one when the need arises

      thedj@mastodon.socialT luis_in_brief@social.coopL 2 Replies Last reply
      0
      • bvibber@wikis.worldB bvibber@wikis.world

        @davidgerard @luis_in_brief i do not have numbers offhand, but wmf annual budget is a strict upper bound 😉

        thedj@mastodon.socialT This user is from outside of this forum
        thedj@mastodon.socialT This user is from outside of this forum
        thedj@mastodon.social
        wrote sidst redigeret af
        #10

        @bvibber @davidgerard @luis_in_brief my back of the envelope guess would be about 3 million a year for an org to run a reliable dark archive and 8 million a year to actually start running something that you could visit (not edit). and those numbers assume the foundation is still doing its job next to what the eu side would be doing and assumes a redundant, non-US, sovereign cloud or self hosted solution.
        Only gets more expensive after that.

        thedj@mastodon.socialT woozle@toot.catW 2 Replies Last reply
        0
        • thedj@mastodon.socialT thedj@mastodon.social

          @bvibber @davidgerard @luis_in_brief my back of the envelope guess would be about 3 million a year for an org to run a reliable dark archive and 8 million a year to actually start running something that you could visit (not edit). and those numbers assume the foundation is still doing its job next to what the eu side would be doing and assumes a redundant, non-US, sovereign cloud or self hosted solution.
          Only gets more expensive after that.

          thedj@mastodon.socialT This user is from outside of this forum
          thedj@mastodon.socialT This user is from outside of this forum
          thedj@mastodon.social
          wrote sidst redigeret af
          #11

          Cost goes into starting an org and hiring. Custom mirror engineering, immediate legal costs over content disputes and ddos defense.

          1 Reply Last reply
          0
          • profpatsch@mastodon.xyzP profpatsch@mastodon.xyz

            @luis_in_brief why? In the worst case, there's probably more than one mirror to start one when the need arises

            thedj@mastodon.socialT This user is from outside of this forum
            thedj@mastodon.socialT This user is from outside of this forum
            thedj@mastodon.social
            wrote sidst redigeret af
            #12

            @Profpatsch @luis_in_brief there isnt even any mirror at all of the media files. No one wants to bear that cost.

            thedj@mastodon.socialT R 2 Replies Last reply
            0
            • thedj@mastodon.socialT thedj@mastodon.social

              @Profpatsch @luis_in_brief there isnt even any mirror at all of the media files. No one wants to bear that cost.

              thedj@mastodon.socialT This user is from outside of this forum
              thedj@mastodon.socialT This user is from outside of this forum
              thedj@mastodon.social
              wrote sidst redigeret af
              #13

              @Profpatsch @luis_in_brief the mirrors also largely exclude all the process behind what the audience sees. And that process is probably more important to mirror than the raw ‘ready’ latest version of ‘some’ of the content.

              johanempa@mastodon.greenJ 1 Reply Last reply
              0
              • davidgerard@circumstances.runD davidgerard@circumstances.run

                @luis_in_brief god, the cost of WMF-level infra. has anyone got an estimate of what it would cost to stand up and run? @bvibber do you have numbers you are able to say?

                iain@kolektiva.socialI This user is from outside of this forum
                iain@kolektiva.socialI This user is from outside of this forum
                iain@kolektiva.social
                wrote sidst redigeret af
                #14

                @davidgerard @luis_in_brief @bvibber Wikipedia, but decentralised!

                luis_in_brief@social.coopL 1 Reply Last reply
                0
                • thedj@mastodon.socialT thedj@mastodon.social

                  @Profpatsch @luis_in_brief the mirrors also largely exclude all the process behind what the audience sees. And that process is probably more important to mirror than the raw ‘ready’ latest version of ‘some’ of the content.

                  johanempa@mastodon.greenJ This user is from outside of this forum
                  johanempa@mastodon.greenJ This user is from outside of this forum
                  johanempa@mastodon.green
                  wrote sidst redigeret af
                  #15

                  @TheDJ
                  Do you mind if I ask what you mean with 'all the process'?

                  @Profpatsch @luis_in_brief

                  1 Reply Last reply
                  0
                  • gedankenstuecke@scholar.socialG gedankenstuecke@scholar.social

                    @davidgerard @luis_in_brief @bvibber for a start one could do a dark archive ready as a fail-over, compare with arxiv https://blog.tib.eu/2025/05/14/protecting-science-tib-builds-dark-archive-for-arxiv/

                    luis_in_brief@social.coopL This user is from outside of this forum
                    luis_in_brief@social.coopL This user is from outside of this forum
                    luis_in_brief@social.coop
                    wrote sidst redigeret af
                    #16

                    @davidgerard @bvibber @gedankenstuecke yeah, there’s a lot of things that could be done very differently (and very incrementally) if you’re trying to make Wikipedia independently available rather than cloning WMF.

                    1 Reply Last reply
                    0
                    • profpatsch@mastodon.xyzP profpatsch@mastodon.xyz

                      @luis_in_brief why? In the worst case, there's probably more than one mirror to start one when the need arises

                      luis_in_brief@social.coopL This user is from outside of this forum
                      luis_in_brief@social.coopL This user is from outside of this forum
                      luis_in_brief@social.coop
                      wrote sidst redigeret af
                      #17

                      @Profpatsch there are no deliberate, complete, formal mirrors that I know of. Archive might have one?

                      1 Reply Last reply
                      0
                      • thedj@mastodon.socialT thedj@mastodon.social

                        @bvibber @davidgerard @luis_in_brief my back of the envelope guess would be about 3 million a year for an org to run a reliable dark archive and 8 million a year to actually start running something that you could visit (not edit). and those numbers assume the foundation is still doing its job next to what the eu side would be doing and assumes a redundant, non-US, sovereign cloud or self hosted solution.
                        Only gets more expensive after that.

                        woozle@toot.catW This user is from outside of this forum
                        woozle@toot.catW This user is from outside of this forum
                        woozle@toot.cat
                        wrote sidst redigeret af
                        #18

                        @TheDJ @bvibber @davidgerard @luis_in_brief

                        I'd say the primary goal should be a read-only mirror. Once you've got that, you've got defense against WP going dark, becoming inaccessible, or having stuff deleted by political edict -- and can move forward with additional pieces (editability, development...) once any significant shoes drop.

                        You wouldn't need an org anything like the size of MediaWiki just for that. At a glance, I'd say it's within reach of a modestly wealthy individual or a small org.

                        • Size of Wikipedia
                        • Wikipedia Database download
                        bweller@mstdn.socialB 1 Reply Last reply
                        1
                        0
                        • iain@kolektiva.socialI iain@kolektiva.social

                          @davidgerard @luis_in_brief @bvibber Wikipedia, but decentralised!

                          luis_in_brief@social.coopL This user is from outside of this forum
                          luis_in_brief@social.coopL This user is from outside of this forum
                          luis_in_brief@social.coop
                          wrote sidst redigeret af
                          #19

                          @davidgerard @bvibber @iain I would settle for “robustly and independently mirrored” right now.

                          iain@kolektiva.socialI 1 Reply Last reply
                          0
                          • luis_in_brief@social.coopL luis_in_brief@social.coop

                            @davidgerard @bvibber @iain I would settle for “robustly and independently mirrored” right now.

                            iain@kolektiva.socialI This user is from outside of this forum
                            iain@kolektiva.socialI This user is from outside of this forum
                            iain@kolektiva.social
                            wrote sidst redigeret af
                            #20

                            @luis_in_brief @davidgerard my strategy is downloading the text only archive every so often and putting it on my NAS, but that's not exactly a scalable solution

                            1 Reply Last reply
                            0
                            • bvibber@wikis.worldB bvibber@wikis.world

                              @davidgerard @luis_in_brief i do not have numbers offhand, but wmf annual budget is a strict upper bound 😉

                              luis_in_brief@social.coopL This user is from outside of this forum
                              luis_in_brief@social.coopL This user is from outside of this forum
                              luis_in_brief@social.coop
                              wrote sidst redigeret af
                              #21

                              @bvibber @davidgerard also good news, traffic is falling off a cliff so even if your goal were to split traffic 50-50 with the official site (unlikely for a variety of reasons) that’s 20% less outbound bandwidth to pay for than this time last year 🤪

                              1 Reply Last reply
                              0
                              • thedj@mastodon.socialT thedj@mastodon.social

                                @Profpatsch @luis_in_brief there isnt even any mirror at all of the media files. No one wants to bear that cost.

                                R This user is from outside of this forum
                                R This user is from outside of this forum
                                raulmatias@mstdn.social
                                wrote sidst redigeret af
                                #22

                                @TheDJ @Profpatsch @luis_in_brief A read-only mirror with all pages and all pages' revisions (approximately 5-10 TB, including talk pages, user pages, etc) and thumbnails of all used files hosted locally would be a few hundred TB at most (the entirety of Wikimedia Commons is just ~1 PB <https://commons.wikimedia.org/wiki/Special:MediaStatistics>, and that's all files ever uploaded to Wikimedia servers with their original sizes and revision history); not very much.

                                R 1 Reply Last reply
                                0
                                • R raulmatias@mstdn.social

                                  @TheDJ @Profpatsch @luis_in_brief A read-only mirror with all pages and all pages' revisions (approximately 5-10 TB, including talk pages, user pages, etc) and thumbnails of all used files hosted locally would be a few hundred TB at most (the entirety of Wikimedia Commons is just ~1 PB <https://commons.wikimedia.org/wiki/Special:MediaStatistics>, and that's all files ever uploaded to Wikimedia servers with their original sizes and revision history); not very much.

                                  R This user is from outside of this forum
                                  R This user is from outside of this forum
                                  raulmatias@mstdn.social
                                  wrote sidst redigeret af
                                  #23

                                  @TheDJ @Profpatsch @luis_in_brief Getting the thumbnails will be a problem, because the last dump of all media files is 13 years old <https://ftpmirror.your.org/pub/wikimedia/images/wikipedia/commons/f/>, but I guess it's possible to just rip them off a "maxi" Kiwix dump. These only have files used in articles, though.

                                  thedj@mastodon.socialT 1 Reply Last reply
                                  0
                                  • luis_in_brief@social.coopL luis_in_brief@social.coop

                                    The best time for a European organization to start building an independent European mirror of Wikipedia, not reliant on the US or WMF, was in December 2016.

                                    The second-best time is today.

                                    R This user is from outside of this forum
                                    R This user is from outside of this forum
                                    raulmatias@mstdn.social
                                    wrote sidst redigeret af
                                    #24

                                    @luis_in_brief Honestly I wouldn't even oppose mirrors hosted by the Wikimedia Foundation themselves. wikipedia.ph, wikipedia.is, wikipedia.gl, etc.

                                    luis_in_brief@social.coopL R 2 Replies Last reply
                                    0
                                    • R raulmatias@mstdn.social

                                      @luis_in_brief Honestly I wouldn't even oppose mirrors hosted by the Wikimedia Foundation themselves. wikipedia.ph, wikipedia.is, wikipedia.gl, etc.

                                      luis_in_brief@social.coopL This user is from outside of this forum
                                      luis_in_brief@social.coopL This user is from outside of this forum
                                      luis_in_brief@social.coop
                                      wrote sidst redigeret af
                                      #25

                                      @raulmatias no, that’s a worst-case scenario: exposes WMF to local laws/pressure, and keeps WMF as a single point of failure.

                                      R 1 Reply Last reply
                                      0
                                      • R raulmatias@mstdn.social

                                        @luis_in_brief Honestly I wouldn't even oppose mirrors hosted by the Wikimedia Foundation themselves. wikipedia.ph, wikipedia.is, wikipedia.gl, etc.

                                        R This user is from outside of this forum
                                        R This user is from outside of this forum
                                        raulmatias@mstdn.social
                                        wrote sidst redigeret af
                                        #26

                                        @luis_in_brief Possibly with data being served from different data centers located in different countries for each domain.

                                        1 Reply Last reply
                                        0
                                        • luis_in_brief@social.coopL luis_in_brief@social.coop

                                          @raulmatias no, that’s a worst-case scenario: exposes WMF to local laws/pressure, and keeps WMF as a single point of failure.

                                          R This user is from outside of this forum
                                          R This user is from outside of this forum
                                          raulmatias@mstdn.social
                                          wrote sidst redigeret af
                                          #27

                                          @luis_in_brief
                                          >local laws/pressure

                                          .is is pretty much bulletproof, not sure about .ph and .gl.

                                          1 Reply Last reply
                                          0
                                          Svar
                                          • Svar som emne
                                          Login for at svare
                                          • Ældste til nyeste
                                          • Nyeste til ældste
                                          • Most Votes


                                          • Log ind

                                          • Har du ikke en konto? Tilmeld

                                          • Login or register to search.
                                          Powered by NodeBB Contributors
                                          Graciously hosted by data.coop
                                          • First post
                                            Last post
                                          0
                                          • Hjem
                                          • Seneste
                                          • Etiketter
                                          • Populære
                                          • Verden
                                          • Bruger
                                          • Grupper