Knowledge

User:Art LaPella/AWB explanation

Source 📝

109:. The guideline also repeatedly calls for nbsps or Nowraps in any "places where breaking across lines might be disruptive to the reader". But since I learned to read from one line to the next at a very young age, I have trouble imagining any such places where the reader would have trouble following from one line to the next. So rather than guess what other places might be intended, I use nbsps or Nowraps only in places that closely resemble the examples given in the MOS:NUM elaboration of the WP:NBSP guideline, with two exceptions covered by other guidelines: 34:
program exceptions to a Find and Replace. As a workaround, I change the exceptions by adding YwBsF and other meaningless code words, so the main Find and Replace won't Find them. After that main Find and Replace, I change YwBsF back again. Then the process repeats for another Find and Replace and its exceptions. It works for me, but it doesn't work in an edit summary to see all the text changed to YwBsF and back again.
433:. This happens most often in a citation in a list of authors. That is a debatable interpretation of WP:&, because citations aren't exactly "running prose", but neither are they a place where "space is limited" because the reader has no reason to look down at the references unless he clicks one of them. 456:
No guideline requires multiple pages to be "pages 14–17" not "page 14–17", but that is common sense if English is your native language. Similarly, I didn't find a command to use "p." only for single pages and "pp." only for multiple pages, but dictionaries agree that "p." means "page" and "pp." means
125:
Most of the Nowraps I add are to dates in citations, such as the date= and accessdate= parameters. The guideline specifies "12 November" as an example of a good place for an nbsp or Nowrap, but they probably had normal text in mind when they said "disruptive to the reader", not citations. However, 1)
65:
Not counting the general fixes and RegexTypoFixes linked in the previous paragraph, below I have listed my most frequent (as of July 2010) changes, along with a link to the Manual of Style guideline or other rule that I am trying to enforce. If I have too automatically applied the Manual of Style to
33:
Why don't I let AWB list all of my changes in its edit summary, as most AWB users do? Because most of them usually wouldn't fit in the summary. And also because most of my changes would be meaningless without understanding how I programmed my AWB Find and Replaces. AWB doesn't provide an easy way to
223:
For better or worse, I don't substitute hyphens with dashes in dates of the form mm-dd-yyyy, which are usually the date that a reference is retrieved. My thought was that such dates are more likely to be updated, and an editor updating that date is unlikely to take the trouble to use dashes instead
194:
Here are the most common situations in this category. A hyphen with a space before and after is often used to mean something like a comma. According to the guideline, any such hyphen should be changed to an en dash (the guideline also allows em dashes if the spaces are removed). A hyphen without
178:
What's the difference? Look closely: - – — − They each look different. The first is a hyphen, the second is an en dash, the third is an em dash, and the fourth is a minus sign. If I changed one of those four lines to another, or if I added or removed spaces before and after, it's because of
558:
If you mean do it myself, the expected level of vanity wildly underestimates the difficulty of learning new software, and I'd rather clean up Knowledge than study software. Could anybody even direct me to a manual for programming a bot, and examples of how to interface it with Knowledge?
497:
Other AWB users make AWB edits that I could easily add to my software if there were a consensus. But where the Manual of Style is ambivalent on an issue, I don't see how you can argue that the consensus really exists. If it does, why don't you change the guideline? Specifically:
215:
which asks for the URL, or you might even look up the web page's title using a search engine like Google – but any search engine we've checked ignores punctuation. But if you're looking for a book (that is, anything on paper rather than the Internet), you're likely to use
130:
in a citation, and 4) the drawback of Nowrap or nbsp, which is that it interferes with editing by making the edit page less readable, is also less important in a citation because the editor is likely to skip over the entire citation when trying to comprehend the edit
41:
and its subpages. I don't try to enforce all of the Manual of Style, because most of the manual is too subjective to be automated with AWB. Even the parts I do automate have frequent exceptions, so this project could not be accomplished with a
126:
a Nowrap doesn't hurt a citation 2) unlike human editors, AWB software doesn't get tired of adding Nowraps throughout an article 3) it would actually take more programming to tell AWB to only change dates that
269:. Sometimes I rearrange a sentence, rather than insert another comma into a place such as almost at the end of a sentence. I also make sure each comma has a space afterwards, and no space before. 195:
spaces often occurs in phrases like "The Civil War, 1861-1865" or "pages 14-16". Those hyphens should also be en dashes. More often than not, these hyphens occur in the titles of references.
155:
would make it a redlink. But that link is blue for me, despite the nbsp you can see in edit mode. Nobody using different browsers or operating systems has ever complained on this issue.
485:, it's probably based on some obscure Manual of Style guideline, similar to the sections above. Or if you can figure it out, you might look for it in my AWB Find and Replaces listed at 30:
editing. It does not specifically explain the edit whose edit summary you probably just clicked, nor does it explain AWB itself. It explains all of my similar edits in a general way.
249:
for a word like "Asia". The list was chosen conservatively; that is, more links should probably be unlinked, but hopefully we can agree that at least this list of links is covered by
515: 207:
Removed after an objection. I still change hyphens and similar punctuation in web page titles, but not books. If you're looking for a web page, you'll probably use the
202: 66:
a special situation where the guideline in the Manual wasn't intended to apply, you could be right. But if you believe the Manual of Style guideline I'm using is
106: 257:
fought a war" might be considered unfair to either India or Pakistan, because I linked one but not the other, so in such a case I would probably unlink both.
242: 567: 55: 347: 136:
Why do I sometimes use Nowrap instead of nbsp? It has been argued that Nowrap is easier for newbies to understand (not everyone knows or remembers what
156: 531: 117:, where what is now the third bullet point to the end begins, "When both a figure ...". The other exception is the nbsp before three dots; see 477:
My software looks for a list of problems much longer than this explanation page. If a change isn't listed above, and it isn't listed with AWB's
188: 145: 71: 351: 574:, but they don't want to solve the problem. If I want something done, I do it myself, and occasionally my example inspires some imitation. 162:
I don't use nbsp or Nowrap near the beginning of a paragraph. Such an nbsp would not matter unless the reader were using a very narrow
144:
of WP:NBSP. So if you don't like Nowrap, please discuss or change MOS:NUM so the rest of us can share in your wisdom. Once again, see
478: 58:
problems, and do other proofreading such as changing "pages 14" to "page 14". My AWB "Find and Replace" selections are listed at
47: 113:
is listed as an example, but not 11 billion without any currency symbol; however that situation is covered near the end of
70:
wrong, or if you wonder why I care at all about things like the difference between hyphens and dashes, then please see
457:"pages", and I didn't find other publications confusing the two (with or without the periods or spaces afterwards). 482: 51: 571: 266: 440:. That company's article, website and logo consistently say "Barnes & Noble", not "Barnes and Noble". 530:. That is in the Manual of Style, and I might be persuaded to include it in my editing. But in the past, 323: 97:
template and   are used when you don't want a line to end in the middle of an expression like
570:, among the most widespread typos on Knowledge, has also been impossible. I meet plenty of people who 486: 246: 59: 38: 566:
on Knowledge. Just getting agreement on specifications is usually impossible. Getting anyone to fix
299: 465:"From x–y" should be "from x to y", and "between x–y" should be "between x and y", according to 524:, and there is frustratingly little interest in harmonizing the guideline with actual practice. 527: 311: 282: 114: 436:
If I changed "and" to "&", it is because my program looks for a list of companies like
412: 514:
Changing – and — to – or — despite no mention in the Manual of Style.
170:. A couple editors have opposed an nbsp in places where it would normally have no effect. 8: 224:
of hyphens. That would result in a mixture of dashes and hyphens in the retrieved dates.
520:"Fixing" redirects that aren't broken. Several automated processes promote this despite 548: 250: 163: 118: 437: 406: 141: 563: 201:, do we change the hyphen even though that may change the title? This was discussed 538:
than it is worth, so I leave that to the wikiwarriors who enjoy that sort of thing.
466: 184: 17: 535: 212: 91: 448:
AWB allows manual editing at the same time, if I see something obviously wrong.
322:
With frequent exceptions, it should usually be US or U.S. not USA, according to
509: 430: 180: 121:
where it says, "To keep the ellipsis from wrapping to the next line ...".
102: 552: 521: 378: 245:. The exact list of links that my software removes can be found by searching 238: 43: 27: 363: 503: 343: 152: 220:, which doesn't work if the punctuation doesn't match, as of August 2010. 502:
Removing double spaces between words, or double spaces after a period.
187:
also helps explain it. Who cares about obscure guidelines? Again, see
418: 167: 346:
should be capitalized, whether the header is an article header (see
254: 217: 413:
Remove phrases like "Note that ..." or "Clearly ..."
369: 151:
You might think that adding   into a wikilink like
137: 62:, but they aren't as easy to read as they are within AWB. 562:
If you mean organize others to do it, I've had no success
555:? Probably for the same reasons you aren't volunteering. 208: 348:
WP:Manual of Style (capital letters)#Section headings
568:
User:Art LaPella/Citation template double period bug
205:, and someone reinstated my dashes again afterwards. 101:. Most of my Nowraps and nbsps are specified by the 56:
User:Art LaPella/Citation template double period bug
173: 547:So why don't I fix some of this stuff by changing 352:WP:Manual of Style (tables)#Captions and headings 293: 370:7 kg with a (non-breaking) space, not 7kg 337: 189:User:Art LaPella/Because the guideline says so 146:User:Art LaPella/Because the guideline says so 72:User:Art LaPella/Because the guideline says so 472: 451: 227: 37:Most of the changes in my edits are based on 508:A space after the equal signs in a heading. 288: 77: 241:should usually be unlinked according to 492: 14: 572:make the perfect the enemy of the good 400: 140:stands for). Nowrap is recommended at 232: 197:So if a cited book has a title like 429:If I changed "&" to "and", see 115:MOS:NUM#Numbers as figures or words 23: 542: 516:Some want to reverse that process. 443: 82: 24: 583: 324:WP:MOS#Acronyms and abbreviations 357: 174:Hyphens, dashes, and minus signs 305: 272: 267:WP:COPYEDIT#Parenthetical comma 460: 294:Removing periods from captions 13: 1: 300:WP:MOS#Formatting of captions 7: 338:Uncapitalization in headers 253:. A phrase like "India and 107:this elaboration at MOS:NUM 10: 588: 473:Even less frequent changes 452:Page, pages, pp., p., etc. 228:My not so frequent changes 211:, or you'll resort to the 133:Removed after objections. 487:User:Art LaPella/AWB list 350:) or a table header (see 260: 247:User:Art LaPella/AWB list 60:User:Art LaPella/AWB list 39:Knowledge:Manual of Style 424: 342:Only the first word and 289:My less frequent changes 199:The Civil War, 1861-1865 157:Previous discussion here 78:My most frequent changes 317: 283:WP:MOS#Quotation marks 142:the elaborated version 26:This page explains my 105:guideline, including 493:Changes I don't make 279:Quotation characters 534:has generated more 401:ALL CAPITAL LETTERS 379:MOS:NUM#Conventions 334:, except ..." 46:. I also do AWB's " 438:Barnes & Noble 364:MOS:NUM#Typography 233:Removing wikilinks 153:World War II 579: 396: 392: 388: 384: 333: 329: 166:or a very large 112: 111:£11 billion 100: 96: 90: 18:User:Art LaPella 587: 586: 582: 581: 580: 578: 577: 576: 545: 543:More automation 495: 475: 463: 454: 446: 444:Typos I noticed 427: 415: 403: 394: 390: 386: 382: 372: 360: 340: 331: 327: 320: 308: 296: 291: 275: 263: 237:Wikilinks like 235: 230: 213:Wayback Machine 176: 110: 98: 94: 88: 85: 83:Nowrap and nbsp 80: 22: 21: 20: 12: 11: 5: 585: 544: 541: 540: 539: 528:WP:UNLINKDATES 525: 518: 512: 506: 494: 491: 483:RegexTypoFixes 474: 471: 462: 459: 453: 450: 445: 442: 426: 423: 414: 411: 402: 399: 371: 368: 359: 356: 339: 336: 326:: "Do not use 319: 316: 312:WP:CONTRACTION 307: 304: 295: 292: 290: 287: 274: 271: 262: 259: 243:this guideline 234: 231: 229: 226: 175: 172: 84: 81: 79: 76: 52:RegexTypoFixes 15: 9: 6: 4: 3: 2: 584: 575: 573: 569: 565: 560: 556: 554: 550: 537: 533: 529: 526: 523: 519: 517: 513: 511: 507: 505: 501: 500: 499: 490: 488: 484: 480: 479:general fixes 470: 468: 458: 449: 441: 439: 434: 432: 422: 420: 410: 408: 398: 380: 377:paragraph in 376: 367: 365: 355: 353: 349: 345: 335: 325: 315: 313: 303: 301: 286: 284: 281:paragraph of 280: 270: 268: 258: 256: 252: 248: 244: 240: 239:United States 225: 221: 219: 214: 210: 206: 204: 200: 192: 190: 186: 183:. The end of 182: 171: 169: 165: 160: 158: 154: 149: 147: 143: 139: 134: 132: 129: 122: 120: 116: 108: 104: 93: 75: 73: 69: 63: 61: 57: 53: 49: 48:general fixes 45: 40: 35: 31: 29: 19: 564:herding cats 561: 557: 546: 504:MOS:FULLSTOP 496: 476: 464: 455: 447: 435: 428: 416: 404: 375:Unit symbols 374: 373: 361: 344:proper nouns 341: 321: 309: 306:Contractions 297: 278: 276: 273:Curly quotes 264: 236: 222: 198: 196: 193: 177: 161: 150: 135: 127: 124: 123: 95:}} 89:{{ 86: 67: 64: 36: 32: 25: 551:or using a 461:From x to y 358:24th not 24 251:WP:OVERLINK 119:WP:ELLIPSIS 54:", correct 532:that issue 407:WP:ALLCAPS 387:29 kg 99:17 kg 549:templates 536:wikidrama 467:WP:ENDASH 419:MOS:NOTED 383:10 m 185:WP:HYPHEN 168:font size 431:WP:& 277:See the 255:Pakistan 218:WorldCat 510:WP:HEAD 381:: "use 181:WP:DASH 103:WP:NBSP 50:" and " 522:WP:R2D 389:, not 328:U.S.A. 261:Commas 164:window 128:aren't 92:Nowrap 68:always 425:& 131:text. 16:< 481:and 417:See 405:See 395:29kg 362:See 310:See 298:See 265:See 203:here 138:nbsp 87:The 553:bot 397:." 393:or 391:10m 385:or 354:). 332:USA 330:or 318:USA 209:URL 44:bot 28:AWB 489:. 469:. 421:. 409:. 366:. 314:. 302:. 285:. 191:. 159:. 148:. 74:.

Index

User:Art LaPella
AWB
Knowledge:Manual of Style
bot
general fixes
RegexTypoFixes
User:Art LaPella/Citation template double period bug
User:Art LaPella/AWB list
User:Art LaPella/Because the guideline says so
Nowrap
WP:NBSP
this elaboration at MOS:NUM
MOS:NUM#Numbers as figures or words
WP:ELLIPSIS
nbsp
the elaborated version
User:Art LaPella/Because the guideline says so
World War II
Previous discussion here
window
font size
WP:DASH
WP:HYPHEN
User:Art LaPella/Because the guideline says so
here
URL
Wayback Machine
WorldCat
United States
this guideline

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.