Skip to content
  • Hjem
  • Seneste
  • Etiketter
  • Populære
  • Verden
  • Bruger
  • Grupper
Temaer
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Kollaps
FARVEL BIG TECH
  1. Forside
  2. Ikke-kategoriseret
  3. #WritersCoffeeClub Apr 24 Share a silly mistake you've made while writing.

#WritersCoffeeClub Apr 24 Share a silly mistake you've made while writing.

Planlagt Fastgjort Låst Flyttet Ikke-kategoriseret
writerscoffeecl
93 Indlæg 63 Posters 0 Visninger
  • Ældste til nyeste
  • Nyeste til ældste
  • Most Votes
Svar
  • Svar som emne
Login for at svare
Denne tråd er blevet slettet. Kun brugere med emne behandlings privilegier kan se den.
  • djrndm@chaos.socialD djrndm@chaos.social

    @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @cstross Clear case of idiot editor. Because one obviously can be space sensitive and only replace " pants " with " trousers " and th[e]n this should be no problem.

    cstross@wandering.shopC This user is from outside of this forum
    cstross@wandering.shopC This user is from outside of this forum
    cstross@wandering.shop
    wrote sidst redigeret af
    #70

    @DJRNDM @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord

    Groan.

    s/(\W+?)(pants)(\W+?)/\1trousers\3/ig

    You could use \b — match a word boundary — instead of \W+? (smallest count of non-word characters preceding the next regexp group) but that'd miss run-on strings ending in pants (eg. InterCappedpants).

    The pcre search modifiers s///ig are for case-insensitive and global.

    headword@lingo.lolH 1 Reply Last reply
    0
    • richcarl@mastodon.nuR richcarl@mastodon.nu

      @cstross Protip: always do big renamings via an intermediate nonsense string.
      1) Globally rename the original string 'pants' to something that doesn't occur anywhere else, like 'xyzyx'.
      2) Search for the new string and step through all occurrences to check for mistakes like 'particixyzyx' and fix them. This is now an easy task.
      3) Rename all placeholders to the final string.

      cstross@wandering.shopC This user is from outside of this forum
      cstross@wandering.shopC This user is from outside of this forum
      cstross@wandering.shop
      wrote sidst redigeret af
      #71

      @richcarl Or you could use a regular expression. Hint: I once rewrote a UNIX man page for regular expressions as part of my day job back in the early 1990s. None of your search/replace tips are news to me.

      richcarl@mastodon.nuR 1 Reply Last reply
      0
      • owent@mastodon.socialO owent@mastodon.social

        @alicemcalicepants @nullcolaship @davidtheeviloverlord @cstross

        pineywoozle@masto.aiP This user is from outside of this forum
        pineywoozle@masto.aiP This user is from outside of this forum
        pineywoozle@masto.ai
        wrote sidst redigeret af
        #72

        @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @cstross 🤣 🤣 🤣

        1 Reply Last reply
        0
        • cstross@wandering.shopC cstross@wandering.shop

          @richcarl Or you could use a regular expression. Hint: I once rewrote a UNIX man page for regular expressions as part of my day job back in the early 1990s. None of your search/replace tips are news to me.

          richcarl@mastodon.nuR This user is from outside of this forum
          richcarl@mastodon.nuR This user is from outside of this forum
          richcarl@mastodon.nu
          wrote sidst redigeret af
          #73

          @cstross Sure, regexps are great. If your editor supports them, and you know how to write them correctly, and the implementation doesn't have word boundary issues with utf-8. For any average writer stuck on an average text editor, I suggest the 3-step method.

          cstross@wandering.shopC 1 Reply Last reply
          0
          • richcarl@mastodon.nuR richcarl@mastodon.nu

            @cstross Sure, regexps are great. If your editor supports them, and you know how to write them correctly, and the implementation doesn't have word boundary issues with utf-8. For any average writer stuck on an average text editor, I suggest the 3-step method.

            cstross@wandering.shopC This user is from outside of this forum
            cstross@wandering.shopC This user is from outside of this forum
            cstross@wandering.shop
            wrote sidst redigeret af
            #74

            @richcarl I work in Scrivener, which includes pcre regexps. But you know even Microsoft Word has regexps these days? They're well-hidden and their implementation is typically Microsoftish (i.e. non-standard and missing a few features) but it's there in the search/replace dialog box. And the publishing industry runs on Word files—so much so that if you go the trad route you *have to* submit your manuscripts in docx format.

            So every non-amateur author uses Word or LibreOffice at some stage.

            1 Reply Last reply
            0
            • cstross@wandering.shopC cstross@wandering.shop

              @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb No, they need to pad their search terms with non-word atoms (regular expressions are your friend!), i.e. \W+(search_word)\W+ (in perl-compatible regexp syntax).

              gsuberland@chaos.socialG This user is from outside of this forum
              gsuberland@chaos.socialG This user is from outside of this forum
              gsuberland@chaos.social
              wrote sidst redigeret af
              #75

              @cstross @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb or [^\w-] instead of \W for a more careful approach, since the \W class will replace smarty-pants to smarty-trousers. hyphens are not included in \w, so the inverted class \W matches on them, which is unlikely to be what you want. [^\w-] works the same but doesn't treat hyphens as word boundaries to avoid the issue.

              gsuberland@chaos.socialG towo@chaos.socialT adamrice@c.imA flippac@types.plF 4 Replies Last reply
              0
              • smartmanapps@dotnet.socialS smartmanapps@dotnet.social

                @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @cstross

                alexanderdyas@mindly.socialA This user is from outside of this forum
                alexanderdyas@mindly.socialA This user is from outside of this forum
                alexanderdyas@mindly.social
                wrote sidst redigeret af
                #76

                @SmartmanApps @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @cstross To be fair, the one at the top is a plantain

                1 Reply Last reply
                0
                • gsuberland@chaos.socialG gsuberland@chaos.social

                  @cstross @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb or [^\w-] instead of \W for a more careful approach, since the \W class will replace smarty-pants to smarty-trousers. hyphens are not included in \w, so the inverted class \W matches on them, which is unlikely to be what you want. [^\w-] works the same but doesn't treat hyphens as word boundaries to avoid the issue.

                  gsuberland@chaos.socialG This user is from outside of this forum
                  gsuberland@chaos.socialG This user is from outside of this forum
                  gsuberland@chaos.social
                  wrote sidst redigeret af
                  #77

                  @cstross @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb annoyingly there's no standard character class that matches word boundaries in Latin script prose with high confidence, e.g. something along the lines of [\s"“”„;:!?¡¿‽.,()\[\]…]

                  ilmari@social.treehouse.systemsI 1 Reply Last reply
                  0
                  • gsuberland@chaos.socialG gsuberland@chaos.social

                    @cstross @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb or [^\w-] instead of \W for a more careful approach, since the \W class will replace smarty-pants to smarty-trousers. hyphens are not included in \w, so the inverted class \W matches on them, which is unlikely to be what you want. [^\w-] works the same but doesn't treat hyphens as word boundaries to avoid the issue.

                    towo@chaos.socialT This user is from outside of this forum
                    towo@chaos.socialT This user is from outside of this forum
                    towo@chaos.social
                    wrote sidst redigeret af
                    #78

                    @gsuberland
                    If you don't care about hyphens, `\bword\b` might be the better choice as a zero-width assertion (i.e. no need for capture groups to retain other characters).

                    If you do.. `(?<!-)\bword\b(?!-)` with some perl magic included will do the look backs/lookaheads.

                    @cstross @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb

                    1 Reply Last reply
                    0
                    • gsuberland@chaos.socialG gsuberland@chaos.social

                      @cstross @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb annoyingly there's no standard character class that matches word boundaries in Latin script prose with high confidence, e.g. something along the lines of [\s"“”„;:!?¡¿‽.,()\[\]…]

                      ilmari@social.treehouse.systemsI This user is from outside of this forum
                      ilmari@social.treehouse.systemsI This user is from outside of this forum
                      ilmari@social.treehouse.systems
                      wrote sidst redigeret af
                      #79

                      @gsuberland @cstross @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb Unicode defines word boundaries, and Perl has \b{wb}, which matches them.

                      cstross@wandering.shopC gsuberland@chaos.socialG oblomov@sociale.networkO 3 Replies Last reply
                      0
                      • ilmari@social.treehouse.systemsI ilmari@social.treehouse.systems

                        @gsuberland @cstross @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb Unicode defines word boundaries, and Perl has \b{wb}, which matches them.

                        cstross@wandering.shopC This user is from outside of this forum
                        cstross@wandering.shopC This user is from outside of this forum
                        cstross@wandering.shop
                        wrote sidst redigeret af
                        #80

                        @ilmari @gsuberland @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb My perl experience mostly predates unicode 😉

                        ilmari@social.treehouse.systemsI 1 Reply Last reply
                        0
                        • ilmari@social.treehouse.systemsI ilmari@social.treehouse.systems

                          @gsuberland @cstross @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb Unicode defines word boundaries, and Perl has \b{wb}, which matches them.

                          gsuberland@chaos.socialG This user is from outside of this forum
                          gsuberland@chaos.socialG This user is from outside of this forum
                          gsuberland@chaos.social
                          wrote sidst redigeret af
                          #81

                          @ilmari @cstross @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb ooh good to know, thanks

                          1 Reply Last reply
                          0
                          • cstross@wandering.shopC cstross@wandering.shop

                            @ilmari @gsuberland @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb My perl experience mostly predates unicode 😉

                            ilmari@social.treehouse.systemsI This user is from outside of this forum
                            ilmari@social.treehouse.systemsI This user is from outside of this forum
                            ilmari@social.treehouse.systems
                            wrote sidst redigeret af
                            #82

                            @cstross @gsuberland @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb To be fair, \b{…} was only added to Perl ten years ago 😉

                            cstross@wandering.shopC jernej__s@infosec.exchangeJ 2 Replies Last reply
                            0
                            • ilmari@social.treehouse.systemsI ilmari@social.treehouse.systems

                              @cstross @gsuberland @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb To be fair, \b{…} was only added to Perl ten years ago 😉

                              cstross@wandering.shopC This user is from outside of this forum
                              cstross@wandering.shopC This user is from outside of this forum
                              cstross@wandering.shop
                              wrote sidst redigeret af
                              #83

                              @ilmari @gsuberland @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb Yeah, it's been most of 25 years for me ...

                              1 Reply Last reply
                              0
                              • ilmari@social.treehouse.systemsI ilmari@social.treehouse.systems

                                @cstross @gsuberland @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb To be fair, \b{…} was only added to Perl ten years ago 😉

                                jernej__s@infosec.exchangeJ This user is from outside of this forum
                                jernej__s@infosec.exchangeJ This user is from outside of this forum
                                jernej__s@infosec.exchange
                                wrote sidst redigeret af
                                #84

                                @ilmari @cstross @gsuberland @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb \b has been in regexp far longer, only the Unicode additions are new.

                                ilmari@social.treehouse.systemsI 1 Reply Last reply
                                0
                                • jernej__s@infosec.exchangeJ jernej__s@infosec.exchange

                                  @ilmari @cstross @gsuberland @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb \b has been in regexp far longer, only the Unicode additions are new.

                                  ilmari@social.treehouse.systemsI This user is from outside of this forum
                                  ilmari@social.treehouse.systemsI This user is from outside of this forum
                                  ilmari@social.treehouse.systems
                                  wrote sidst redigeret af
                                  #85

                                  @jernej__s @cstross @gsuberland @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb yes, that's why I wrote \b{…}, not \b.

                                  1 Reply Last reply
                                  0
                                  • ilmari@social.treehouse.systemsI ilmari@social.treehouse.systems

                                    @gsuberland @cstross @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb Unicode defines word boundaries, and Perl has \b{wb}, which matches them.

                                    oblomov@sociale.networkO This user is from outside of this forum
                                    oblomov@sociale.networkO This user is from outside of this forum
                                    oblomov@sociale.network
                                    wrote sidst redigeret af
                                    #86

                                    @ilmari @gsuberland @cstross @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb
                                    and vim has \< and \> for “directed” word boundary zero-width expression

                                    1 Reply Last reply
                                    0
                                    • gsuberland@chaos.socialG gsuberland@chaos.social

                                      @cstross @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb or [^\w-] instead of \W for a more careful approach, since the \W class will replace smarty-pants to smarty-trousers. hyphens are not included in \w, so the inverted class \W matches on them, which is unlikely to be what you want. [^\w-] works the same but doesn't treat hyphens as word boundaries to avoid the issue.

                                      adamrice@c.imA This user is from outside of this forum
                                      adamrice@c.imA This user is from outside of this forum
                                      adamrice@c.im
                                      wrote sidst redigeret af
                                      #87

                                      @gsuberland @cstross @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb Wait, you’re telling me a word character is not the same as a not-not word character?

                                      cstross@wandering.shopC 1 Reply Last reply
                                      0
                                      • adamrice@c.imA adamrice@c.im

                                        @gsuberland @cstross @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb Wait, you’re telling me a word character is not the same as a not-not word character?

                                        cstross@wandering.shopC This user is from outside of this forum
                                        cstross@wandering.shopC This user is from outside of this forum
                                        cstross@wandering.shop
                                        wrote sidst redigeret af
                                        #88

                                        @adamrice @gsuberland @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb (Obligatory Bill Clinton joke): It depends what you mean by "word".

                                        Less flippantly: is 467130356 a word? Is 17/4/2012 a word? Is !true a word?

                                        1 Reply Last reply
                                        0
                                        • gsuberland@chaos.socialG gsuberland@chaos.social

                                          @cstross @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb or [^\w-] instead of \W for a more careful approach, since the \W class will replace smarty-pants to smarty-trousers. hyphens are not included in \w, so the inverted class \W matches on them, which is unlikely to be what you want. [^\w-] works the same but doesn't treat hyphens as word boundaries to avoid the issue.

                                          flippac@types.plF This user is from outside of this forum
                                          flippac@types.plF This user is from outside of this forum
                                          flippac@types.pl
                                          wrote sidst redigeret af
                                          #89

                                          @gsuberland @cstross @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb gonna be blunt: you want to eyeball and confirm every substitution if possible

                                          these days you can be told how many potential ones up front for a lot of text pretty fast

                                          1 Reply Last reply
                                          0
                                          Svar
                                          • Svar som emne
                                          Login for at svare
                                          • Ældste til nyeste
                                          • Nyeste til ældste
                                          • Most Votes


                                          • Log ind

                                          • Har du ikke en konto? Tilmeld

                                          • Login or register to search.
                                          Powered by NodeBB Contributors
                                          Graciously hosted by data.coop
                                          • First post
                                            Last post
                                          0
                                          • Hjem
                                          • Seneste
                                          • Etiketter
                                          • Populære
                                          • Verden
                                          • Bruger
                                          • Grupper