#WritersCoffeeClub Apr 24 Share a silly mistake you've made while writing.

cstross@wandering.shop

@WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb No, they need to pad their search terms with non-word atoms (regular expressions are your friend!), i.e. \W+(search_word)\W+ (in perl-compatible regexp syntax).

cstross@wandering.shop

@editer @towo In the 2000s, Macmillan's corporate IT department installed a bad word filter *on their incoming email*. It finally got nuked after Tom Doherty (CEO of Tor) stormed their boardroom ranting furiously because the incoming email filter had repeatedly eaten the manuscript of a scheduled bestseller that Production were waiting on. (Turns out publishers get novels via email and novels frequently contain rude words: who could possibly have imagined *that* in a publisher's IT department?)

andreasdavour@dice.camp

@SmartmanApps @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @cstross "one of these is *not* a banana. Can you find out which one?"

fishidwardrobe@social.tchncs.de

@DJRNDM @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @cstross well, that doesn't quite work, because "pants." — but you're not wrong.

richcarl@mastodon.nu

@cstross Protip: always do big renamings via an intermediate nonsense string.
1) Globally rename the original string 'pants' to something that doesn't occur anywhere else, like 'xyzyx'.
2) Search for the new string and step through all occurrences to check for mistakes like 'particixyzyx' and fix them. This is now an easy task.
3) Rename all placeholders to the final string.

cstross@wandering.shop

@DJRNDM @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord

Groan.

s/(\W+?)(pants)(\W+?)/\1trousers\3/ig

You could use \b — match a word boundary — instead of \W+? (smallest count of non-word characters preceding the next regexp group) but that'd miss run-on strings ending in pants (eg. InterCappedpants).

The pcre search modifiers s///ig are for case-insensitive and global.

cstross@wandering.shop

@richcarl Or you could use a regular expression. Hint: I once rewrote a UNIX man page for regular expressions as part of my day job back in the early 1990s. None of your search/replace tips are news to me.

pineywoozle@masto.ai

@owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @cstross

richcarl@mastodon.nu

@cstross Sure, regexps are great. If your editor supports them, and you know how to write them correctly, and the implementation doesn't have word boundary issues with utf-8. For any average writer stuck on an average text editor, I suggest the 3-step method.

cstross@wandering.shop

@richcarl I work in Scrivener, which includes pcre regexps. But you know even Microsoft Word has regexps these days? They're well-hidden and their implementation is typically Microsoftish (i.e. non-standard and missing a few features) but it's there in the search/replace dialog box. And the publishing industry runs on Word files—so much so that if you go the trad route you *have to* submit your manuscripts in docx format.

So every non-amateur author uses Word or LibreOffice at some stage.

gsuberland@chaos.social

@cstross @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb or [^\w-] instead of \W for a more careful approach, since the \W class will replace smarty-pants to smarty-trousers. hyphens are not included in \w, so the inverted class \W matches on them, which is unlikely to be what you want. [^\w-] works the same but doesn't treat hyphens as word boundaries to avoid the issue.

alexanderdyas@mindly.social

@SmartmanApps @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @cstross To be fair, the one at the top is a plantain

gsuberland@chaos.social

@cstross @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb annoyingly there's no standard character class that matches word boundaries in Latin script prose with high confidence, e.g. something along the lines of [\s"“”„;:!?¡¿‽.,()\[\]…]

towo@chaos.social

@gsuberland
If you don't care about hyphens, `\bword\b` might be the better choice as a zero-width assertion (i.e. no need for capture groups to retain other characters).

If you do.. `(?<!-)\bword\b(?!-)` with some perl magic included will do the look backs/lookaheads.

@cstross @WellsiteGeo @quixoticgeek @owent @alicemcalicepants @nullcolaship @davidtheeviloverlord @edwinb