Knowledge

Spell checker

Source 📝

461: 238:, for faster action; he made the first spelling corrector by searching the word list for plausible correct spellings that differ by a single letter or adjacent letter transpositions and presenting them to the user. Gorin made SPELL publicly accessible, as was done with most SAIL (Stanford Artificial Intelligence Laboratory) programs, and it soon spread around the world via the new ARPAnet, about ten years before personal computers came into general use. SPELL, its algorithms and data structures inspired the Unix 234:, who headed the research on this budding technology, saw it necessary to include the first spell checker that accessed a list of 10,000 acceptable words. Ralph Gorin, a graduate student under Earnest at the time, created the first true spelling checker program written as an applications program (rather than research) for general English text: SPELL for the DEC PDP-10 at Stanford University's Artificial Intelligence Laboratory, in February 1971. Gorin wrote SPELL in 128: 481:'s short-lived CoAuthor and allowed a user to view the results after a document was processed and correct only the words that were known to be wrong. When memory and processing power became abundant, spell checking was performed in the background in an interactive way, such as has been the case with the Sector Software produced Spellbound program released in 1987 and 437:
It might seem logical that where spell-checking dictionaries are concerned, "the bigger, the better," so that correct words are not marked as incorrect. In practice, however, an optimal size for English appears to be around 90,000 entries. If there are more than this, incorrectly spelled words may be
200:, to recognize errors instead of correctly-spelled words. This approach usually requires a lot of effort to obtain sufficient statistical information. Key advantages include needing less runtime storage and the ability to correct errors in words that are not included in a dictionary. 517:
into new combinations of words. In German, compound nouns are frequently coined from other existing nouns. Some scripts do not clearly separate one word from another, requiring word-splitting algorithms. Each of these presents unique challenges to non-English language spell checkers.
582:-based spelling correction algorithm", published in 1999, which is able to recognize about 96% of context-sensitive spelling errors, in addition to ordinary non-word spelling errors. Context-sensitive spell checkers appeared in the now-defunct applications 433:
but it was not so helpful for logical or phonetic errors. The challenge the developers faced was the difficulty in offering useful suggestions for misspelled words. This requires reducing words to a skeletal form and applying pattern-matching algorithms.
714: 476:
The first MS-DOS spell checkers were mostly used in proofing mode from within word processing packages. After preparing a document, a user scanned the text looking for misspellings. Later, however, batch processing was offered in such packages as
117:
The original version of this poem was written by Jerrold H. Zar in 1992. An unsophisticated spell checker will find little or no fault with this poem because it checks words in isolation. A more sophisticated spell checker will make use of a
456:
than a reference to the Thai currency. Hence, it would typically be more useful if a few people who write about Thai currency were slightly inconvenienced than if the spelling errors of the many more people who discuss baths were overlooked.
150:
It then compares each word with a known list of correctly spelled words (i.e. a dictionary). This might contain just a list of words, or it might also contain additional information, such as hyphenation points or lexical and grammatical
311:
The first spell checkers for personal computers appeared in 1980, such as "WordCheck" for Commodore systems which was released in late 1980 in time for advertisements to go to print in January 1981. Developers such as Maria Mariani and
530:
of the surrounding words. Not only does this allow words such as those in the poem above to be caught, but it mitigates the detrimental effect of enlarging dictionaries, allowing more words to be recognized. For example,
512:
English is unusual in that most words used in formal writing have a single spelling that can be found in a typical dictionary, with the exception of some jargon and modified words. In many languages, words are often
371:
When Apple developed "a system-wide spelling checker" for Mac OS X so that "the operating system took over spelling fixes," it was a first: one "didn't have to maintain a separate spelling checker for each" program.
363:
might not have justified the investment of implementing a spell checker, companies like WordPerfect nonetheless strove to localize their software for as many national markets as possible as part of their global
174:
It is unclear whether morphological analysis—allowing for many forms of a word depending on its grammatical role—provides a significant benefit for English, though its benefits for highly
385:, introduced in 1994, was "designed for developers of applications that support Windows." It came with a dictionary but had the ability to build and incorporate use of secondary dictionaries. 303:. Its goal is to combine programs supporting different languages such as Aspell, Hunspell, Nuspell, Hspell (Hebrew), Voikko (Finnish), Zemberek (Turkish) and AppleSpell under one interface. 1320: 320:
packages or end-user products into the rapidly expanding software market. On the pre-Windows PCs, these spell checkers were standalone programs, many of which could be run in
460: 839:, citation: "Maria Mariani... was one of a group of six linguists from Georgetown University who developed the first spell-check system for the IBM corporation." 526:
There has been research on developing algorithms that are capable of recognizing a misspelled word, even if the word itself is in the vocabulary, based on the
1382: 170:. For many other languages, such as those featuring agglutination and more complex declension and conjugation, this part of the process is more complicated. 1542: 717:
Proceedings of the 9th International Conference on Natural Language Processing (PolTAL). Lecture Notes in Computer Science (LNCS). Springer. p. 438-449.
658:
Advances in Data Mining: Applications and Theoretical Aspects: 10th Industrial Conference, ICDM 2010, Berlin, Germany, July 12-14, 2010. Proceedings
429:
The first spell checkers were "verifiers" instead of "correctors." They offered no suggestions for incorrectly spelled words. This was helpful for
1520: 1349: 1172: 1931: 1375: 327:
However, the market for standalone packages was short-lived, as by the mid-1980s developers of popular word-processing packages like
335:
had incorporated spell checkers in their packages, mostly licensed from the above companies, who quickly expanded support from just
277:
Due to the inability of traditional spell checkers to check words in complex inflected languages, Hungarian László Németh developed
347:. However, this required increasing sophistication in the morphology routines of the software, particularly with regard to heavily- 697: 2100: 196:
to find correct spellings of misspelled words. An alternative type of spell checker uses solely statistical information, such as
666: 728: 2146: 1841: 1532: 1368: 1325: 1283: 2095: 1269: 787: 245:
The first spell checkers were widely available on mainframe computers in the late 1970s. A group of six linguists from
1702: 850: 274:. Aspell's main improvement is that it can more accurately suggest correct alternatives for misspelled English words. 1856: 1687: 1124: 1039: 682: 1627: 1106: 2044: 1697: 317: 207:
for those misspellings; this less flexible approach is often used in paper-based correction methods, such as the
823: 1692: 1437: 504:
that non-native language learners can rely on to detect and correct their misspellings in the target language.
267:
program commonly used in Unix is based on R. E. Gorin's SPELL. It was converted to C by Pace Willisson at MIT.
2136: 1961: 1682: 656: 1654: 501: 285:
and complex compound words. Hunspell also uses Unicode in its dictionaries. Hunspell replaced the previous
162:, the spell checker will need to consider different forms of the same word, such as plurals, verbal forms, 715:
Between Sound and Spelling: Combining Phonetics and Clustering Algorithms to Improve Target Word Recovery.
1999: 1984: 1956: 1821: 1816: 1391: 321: 189: 704:
Proceedings of Recent Advances in Natural Language Processing (RANLP2013). Hissar, Bulgaria. p. 172-178.
1736: 1707: 1485: 761: 1355: 438:
skipped because they are mistaken for others. For example, a linguist might determine on the basis of
1579: 1432: 963: 2105: 2029: 1761: 1717: 1602: 1500: 421:
Some spell checkers have separate support for medical dictionaries to help prevent medical errors.
167: 155: 596:
attempt to fix problems with grammar beyond spelling errors, including incorrect choice of words.
2131: 2009: 1979: 1646: 1072: 1480: 1866: 1559: 1537: 1527: 1495: 1470: 615: 527: 282: 985: 500:
and foreign words as misspellings. Nonetheless, spell checkers can be considered as a type of
1726: 583: 246: 163: 2079: 1755: 1731: 1584: 214: 193: 134:
spell checker in action for the above poem, the word "chequer" marked as unrecognized word
8: 2059: 1989: 1946: 1902: 1674: 1664: 1659: 1547: 1201:. (pp. 29). Master's Thesis, Dominican University of California. Retrieved 19 March 2012. 620: 492:
errors. However, even at their best, they rarely catch all the errors in a text (such as
340: 204: 824:"Georgetown U Faculty & Staff: The Center for Language, Education & Development" 2069: 1941: 1806: 1569: 1552: 1410: 1356:
History and text of "Candidate for a Pullet Surprise" by Mark Eckman and Jerrold H. Zar
1228: 694: 579: 478: 465: 439: 352: 296: 175: 2141: 2074: 1786: 1594: 1505: 1343: 1035: 662: 235: 1247: 1232: 376:'s spellcheck coverage includes virtually all bundled and third party applications. 1951: 1836: 1811: 1612: 1515: 1220: 605: 356: 336: 185:
allows users to approve or reject replacements and modify the program's operation.
159: 36: 2063: 2024: 2019: 1887: 1617: 1490: 1465: 1447: 1315: 1029: 735: 701: 610: 593: 252: 1771: 1751: 1475: 630: 482: 344: 290: 182: 119: 48: 1360: 1224: 796: 2125: 1846: 1826: 1607: 1310: 1057: 514: 488:
Spell checkers became increasingly sophisticated; now capable of recognizing
398: 348: 131: 60: 868: 2014: 1632: 1338: 635: 625: 313: 217:
have also been used for spell checking combined with phonetic information.
52: 1150: 1128: 1971: 1851: 1564: 1457: 1405: 1091: 990: 587: 489: 332: 324:
mode from within word-processing packages on PCs with sufficient memory.
231: 578:
The most successful algorithm to date is Andrew Golding and Dan Roth's "
1574: 1334: 1196: 410: 271: 56: 1442: 940: 827: 550: 497: 493: 365: 255:
invented one for the VAX machines of Digital Equipment Corp in 1981.
40: 1089:
Peter G. Aitken (November 8, 1994). "Spell-Checking for your Apps".
359:. Although the size of the word-processing market in a country like 1917: 1897: 1882: 1861: 1831: 1776: 1741: 1622: 402: 373: 328: 278: 203:
In some cases, spell checkers use a fixed list of misspellings and
44: 20: 507: 154:
An additional step is a language-dependent algorithm for handling
2054: 1912: 1892: 1766: 1510: 1425: 469: 406: 394: 360: 300: 286: 549:. The most common example of errors caught by such a system are 249:
developed the first spell-check system for the IBM corporation.
127: 1420: 1415: 1329: 1173:"CASES; Do Spelling and Penmanship Count? In Medicine, You Bet" 683:
Method for rule-based correction of spelling and grammar errors
264: 197: 1198:
Foreign Language Learning Difficulties and Teaching Strategies
916: 763:
Computer Programs for Detecting and Correcting Spelling Errors
2110: 1746: 695:
Effective Spell Checking Methods Using Clustering Algorithms.
430: 1352:, Spell-Check Crutch Curtails Correctness, by Lloyd de Vries 533: 444: 143:
A basic spell checker carries out the following processes:
553:
errors, such as the bold words in the following sentence:
1907: 1211:
Golding, Andrew R.; Roth, Dan (1999). "Journal Article".
851:"Teaching Computers to Spell (obituary for Henry Kučera)" 147:
It scans the text and extracts the words contained in it.
892: 818: 816: 1125:"Medical Spell Checker for Firefox and Thunderbird" 1107:"Aspell and Hunspell: A Tale of Two Spell Checkers" 521: 405:. Prior to using Hunspell, Firefox and Chrome used 813: 181:As an adjunct to these components, the program's 178:such as German, Hungarian, or Turkish are clear. 2123: 1593: 43:. Spell-checking features are often embedded in 1390: 1088: 508:Spell-checking for languages other than English 299:is another general spell checker, derived from 122:to consider the context in which a word occurs. 1346:, "Spellchecking by computer", by Roger Mitton 1376: 1245: 983: 961: 158:. Even for a lightly inflected language like 545:would not be recognized as a misspelling of 1210: 1027: 1012: 1383: 1369: 1337:, "How to Write a Spelling Corrector", by 1284:"Google's Context-Sensitive Spell Checker" 1170: 1148: 1015:Mac OS X Snow Leopard: The Missing Manual 971:Compute! Magazine, Issue 8, Vol. 3, No. 1 849:Harvey, Charlotte Bruce (May–June 2010). 661:. Springer Science & Business Media. 101:It helps me right all stiles of righting, 1319:) is being considered for deletion. See 1272:. googlesystem.blogspot.com. 29 May 2009 1031:Switching to the Mac: The Missing Manual 977: 759: 459: 126: 79:Eye strike the quays and type a whirred 16:Software to help correct spelling errors 2124: 848: 654: 270:The GNU project has its spell checker 1364: 734:. Stanford University. Archived from 713:Zampieri, M.; de Amorim, R.C. (2014) 693:de Amorim, R.C.; Zampieri, M. (2013) 106:Each frays come posed up on my screen 1842:Simple Knowledge Organization System 448:is more frequently a misspelling of 401:offer spell checking support, using 1171:Friedman, Richard A.; D, M (2003). 785: 729:"The First Three Spelling Checkers" 726: 13: 1127:. e-MedTools. 2017. Archived from 39:that checks for misspellings in a 14: 2158: 1857:Thesaurus (information retrieval) 1323:to help reach a consensus. › 1302: 1151:"German medical dictionary words" 760:Peterson, James (December 1980). 110:The chequer pours o'er every word 1246:Walt Mossberg (4 January 2007). 1061:. February 21, 1994. p. 68. 964:"Micro Computer Industries, Ltd" 522:Context-sensitive spell checkers 424: 281:, a spell checker that supports 74:It plane lee marks four my revue 1262: 1239: 1204: 1189: 1164: 1142: 1117: 1099: 1082: 1065: 1048: 1021: 1006: 984:Advertisement (November 1982). 955: 933: 909: 885: 861: 92:Its vary polished in its weigh. 83:Weather eye am write oar wrong 1438:Natural language understanding 1149:Quathamer, Dr. Tobias (2016). 962:Advertisement (January 1981). 842: 779: 753: 720: 707: 687: 675: 648: 416: 112:Two cheque sum spelling rule. 1: 1962:Optical character recognition 1308: 641: 99:It freeze yew lodes of thyme. 85:It tells me straight a weigh. 1655:Multi-document summarization 655:Perner, Petra (2010-07-05). 502:foreign language writing aid 108:Eye trussed too bee a joule. 90:Your shore real glad two no. 70:Eye have a spelling chequer, 7: 2147:Natural language processing 1985:Latent Dirichlet allocation 1957:Natural language generation 1822:Machine-readable dictionary 1817:Linguistic Linked Open Data 1392:Natural language processing 1073:"Browse September 27, 1993" 599: 388: 322:terminate-and-stay-resident 190:approximate string matching 103:And aides me when eye rime. 97:A chequer is a bless thing, 88:Eye ran this poem threw it, 81:And weight four it two say 76:Miss Steaks I can knot sea. 10: 2163: 1737:Explicit semantic analysis 1486:Deep linguistic processing 1055:"VisualTools VT-Speller". 1034:. "O'Reilly Media, Inc.". 986:"The Spelling Bee Is Over" 220: 211:entries of encyclopedias. 2088: 2043: 1998: 1970: 1930: 1875: 1797: 1785: 1716: 1673: 1645: 1580:Word-sense disambiguation 1456: 1433:Computational linguistics 1398: 1270:"Google Operating System" 1219:. SpringerLink: 107–130. 537:in the same paragraph as 225: 138: 94:My chequer tolled me sew. 2106:Natural Language Toolkit 2030:Pronunciation assessment 1932:Automatic identification 1762:Latent semantic analysis 1718:Distributional semantics 1603:Compound-term processing 1501:Named-entity recognition 1321:templates for discussion 789:Visible Legacies for Y3K 72:It came with my Pea Sea. 2010:Automated essay scoring 1980:Document classification 1647:Automatic summarization 1225:10.1023/A:1007545901558 283:agglutinative languages 258: 188:Spell checkers can use 47:or services, such as a 1867:Universal Dependencies 1560:Terminology extraction 1543:Semantic decomposition 1538:Semantic role labeling 1528:Part-of-speech tagging 1496:Information extraction 1481:Coreference resolution 1471:Collocation extraction 1326:List of spell checkers 1153:. Dr. Tobias Quathamer 869:"International Ispell" 616:Record linkage problem 496:errors) and will flag 473: 306: 135: 1628:Sentence segmentation 1250:. Wall Street Journal 945:, AbiWord, 2023-02-13 855:Brown Alumni Magazine 681:U.S. Patent 6618697, 584:Microsoft Office 2007 463: 393:Web browsers such as 247:Georgetown University 215:Clustering algorithms 130: 2137:Text editor features 2080:Voice user interface 1791:datasets and corpora 1732:Document-term matrix 1585:Word-sense induction 1028:David Pogue (2015). 1013:David Pogue (2009). 343:and eventually even 265:International Ispell 194:Levenshtein distance 2060:Interactive fiction 1990:Pachinko allocation 1947:Speech segmentation 1903:Google Ngram Viewer 1675:Machine translation 1665:Text simplification 1660:Sentence extraction 1548:Semantic similarity 621:Spelling suggestion 192:algorithms such as 176:synthetic languages 2070:Question answering 1942:Speech recognition 1807:Corpus linguistics 1787:Language resources 1570:Textual entailment 1553:Sentiment analysis 1195:Banks, T. (2008). 1177:The New York Times 921:hunspell.github.io 741:on 22 October 2012 700:2017-08-17 at the 474: 440:corpus linguistics 293:in version 2.0.2. 136: 2119: 2118: 2075:Virtual assistant 2000:Computer-assisted 1926: 1925: 1683:Computer-assisted 1641: 1640: 1633:Word segmentation 1595:Text segmentation 1533:Semantic analysis 1521:Syntactic parsing 1506:Ontology learning 1111:battlepenguin.com 917:"Hunspell: About" 668:978-3-642-14399-1 236:assembly language 2154: 2096:Formal semantics 2045:Natural language 1952:Speech synthesis 1934:and data capture 1837:Semantic network 1812:Lexical resource 1795: 1794: 1613:Lexical analysis 1591: 1590: 1516:Semantic parsing 1385: 1378: 1371: 1362: 1361: 1296: 1295: 1293: 1291: 1281: 1279: 1277: 1266: 1260: 1259: 1257: 1255: 1243: 1237: 1236: 1213:Machine Learning 1208: 1202: 1193: 1187: 1186: 1184: 1183: 1168: 1162: 1161: 1159: 1158: 1146: 1140: 1139: 1137: 1136: 1121: 1115: 1114: 1103: 1097: 1096: 1086: 1080: 1079: 1069: 1063: 1062: 1052: 1046: 1045: 1025: 1019: 1018: 1010: 1004: 1003: 1001: 999: 981: 975: 974: 968: 959: 953: 952: 951: 950: 937: 931: 930: 928: 927: 913: 907: 906: 904: 903: 889: 883: 882: 880: 879: 865: 859: 858: 846: 840: 838: 836: 835: 826:. Archived from 820: 811: 810: 808: 807: 801: 795:. Archived from 794: 783: 777: 776: 774: 773: 768: 757: 751: 750: 748: 746: 740: 733: 724: 718: 711: 705: 691: 685: 679: 673: 672: 652: 606:Cupertino effect 594:Grammar checkers 464:A screenshot of 413:, respectively. 37:software feature 29:spelling checker 2162: 2161: 2157: 2156: 2155: 2153: 2152: 2151: 2122: 2121: 2120: 2115: 2084: 2064:Syntax guessing 2046: 2039: 2025:Predictive text 2020:Grammar checker 2001: 1994: 1966: 1933: 1922: 1888:Bank of English 1871: 1799: 1790: 1781: 1712: 1669: 1637: 1589: 1491:Distant reading 1466:Argument mining 1452: 1448:Text processing 1394: 1389: 1324: 1305: 1300: 1299: 1289: 1287: 1282: 1275: 1273: 1268: 1267: 1263: 1253: 1251: 1244: 1240: 1209: 1205: 1194: 1190: 1181: 1179: 1169: 1165: 1156: 1154: 1147: 1143: 1134: 1132: 1123: 1122: 1118: 1105: 1104: 1100: 1087: 1083: 1071: 1070: 1066: 1054: 1053: 1049: 1042: 1026: 1022: 1011: 1007: 997: 995: 982: 978: 966: 960: 956: 948: 946: 942:AbiWord/enchant 939: 938: 934: 925: 923: 915: 914: 910: 901: 899: 891: 890: 886: 877: 875: 867: 866: 862: 847: 843: 833: 831: 822: 821: 814: 805: 803: 799: 792: 784: 780: 771: 769: 766: 758: 754: 744: 742: 738: 731: 725: 721: 712: 708: 702:Wayback Machine 692: 688: 680: 676: 669: 653: 649: 644: 611:Grammar checker 602: 524: 510: 485:since Word 95. 427: 419: 391: 351:languages like 345:Asian languages 309: 261: 228: 223: 141: 125: 124: 123: 115: 114: 111: 109: 107: 105: 104: 102: 100: 98: 96: 95: 93: 91: 89: 87: 86: 84: 82: 80: 78: 77: 75: 73: 71: 17: 12: 11: 5: 2160: 2150: 2149: 2144: 2139: 2134: 2132:Spell checkers 2117: 2116: 2114: 2113: 2108: 2103: 2098: 2092: 2090: 2086: 2085: 2083: 2082: 2077: 2072: 2067: 2057: 2051: 2049: 2047:user interface 2041: 2040: 2038: 2037: 2032: 2027: 2022: 2017: 2012: 2006: 2004: 1996: 1995: 1993: 1992: 1987: 1982: 1976: 1974: 1968: 1967: 1965: 1964: 1959: 1954: 1949: 1944: 1938: 1936: 1928: 1927: 1924: 1923: 1921: 1920: 1915: 1910: 1905: 1900: 1895: 1890: 1885: 1879: 1877: 1873: 1872: 1870: 1869: 1864: 1859: 1854: 1849: 1844: 1839: 1834: 1829: 1824: 1819: 1814: 1809: 1803: 1801: 1792: 1783: 1782: 1780: 1779: 1774: 1772:Word embedding 1769: 1764: 1759: 1752:Language model 1749: 1744: 1739: 1734: 1729: 1723: 1721: 1714: 1713: 1711: 1710: 1705: 1703:Transfer-based 1700: 1695: 1690: 1685: 1679: 1677: 1671: 1670: 1668: 1667: 1662: 1657: 1651: 1649: 1643: 1642: 1639: 1638: 1636: 1635: 1630: 1625: 1620: 1615: 1610: 1605: 1599: 1597: 1588: 1587: 1582: 1577: 1572: 1567: 1562: 1556: 1555: 1550: 1545: 1540: 1535: 1530: 1525: 1524: 1523: 1518: 1508: 1503: 1498: 1493: 1488: 1483: 1478: 1476:Concept mining 1473: 1468: 1462: 1460: 1454: 1453: 1451: 1450: 1445: 1440: 1435: 1430: 1429: 1428: 1423: 1413: 1408: 1402: 1400: 1396: 1395: 1388: 1387: 1380: 1373: 1365: 1359: 1358: 1353: 1347: 1341: 1332: 1304: 1303:External links 1301: 1298: 1297: 1286:. May 29, 2009 1261: 1238: 1203: 1188: 1163: 1141: 1116: 1098: 1095:. p. 299. 1081: 1064: 1047: 1040: 1020: 1005: 976: 973:. p. 119. 954: 932: 908: 884: 873:www.cs.hmc.edu 860: 841: 812: 786:Earnest, Les. 778: 752: 727:Earnest, Les. 719: 706: 686: 674: 667: 646: 645: 643: 640: 639: 638: 633: 631:Autocorrection 628: 623: 618: 613: 608: 601: 598: 576: 575: 523: 520: 509: 506: 483:Microsoft Word 442:that the word 426: 423: 418: 415: 390: 387: 308: 305: 291:OpenOffice.org 260: 257: 227: 224: 222: 219: 183:user interface 172: 171: 152: 148: 140: 137: 120:language model 116: 68: 67: 66: 65: 49:word processor 15: 9: 6: 4: 3: 2: 2159: 2148: 2145: 2143: 2140: 2138: 2135: 2133: 2130: 2129: 2127: 2112: 2109: 2107: 2104: 2102: 2101:Hallucination 2099: 2097: 2094: 2093: 2091: 2087: 2081: 2078: 2076: 2073: 2071: 2068: 2065: 2061: 2058: 2056: 2053: 2052: 2050: 2048: 2042: 2036: 2035:Spell checker 2033: 2031: 2028: 2026: 2023: 2021: 2018: 2016: 2013: 2011: 2008: 2007: 2005: 2003: 1997: 1991: 1988: 1986: 1983: 1981: 1978: 1977: 1975: 1973: 1969: 1963: 1960: 1958: 1955: 1953: 1950: 1948: 1945: 1943: 1940: 1939: 1937: 1935: 1929: 1919: 1916: 1914: 1911: 1909: 1906: 1904: 1901: 1899: 1896: 1894: 1891: 1889: 1886: 1884: 1881: 1880: 1878: 1874: 1868: 1865: 1863: 1860: 1858: 1855: 1853: 1850: 1848: 1847:Speech corpus 1845: 1843: 1840: 1838: 1835: 1833: 1830: 1828: 1827:Parallel text 1825: 1823: 1820: 1818: 1815: 1813: 1810: 1808: 1805: 1804: 1802: 1796: 1793: 1788: 1784: 1778: 1775: 1773: 1770: 1768: 1765: 1763: 1760: 1757: 1753: 1750: 1748: 1745: 1743: 1740: 1738: 1735: 1733: 1730: 1728: 1725: 1724: 1722: 1719: 1715: 1709: 1706: 1704: 1701: 1699: 1696: 1694: 1691: 1689: 1688:Example-based 1686: 1684: 1681: 1680: 1678: 1676: 1672: 1666: 1663: 1661: 1658: 1656: 1653: 1652: 1650: 1648: 1644: 1634: 1631: 1629: 1626: 1624: 1621: 1619: 1618:Text chunking 1616: 1614: 1611: 1609: 1608:Lemmatisation 1606: 1604: 1601: 1600: 1598: 1596: 1592: 1586: 1583: 1581: 1578: 1576: 1573: 1571: 1568: 1566: 1563: 1561: 1558: 1557: 1554: 1551: 1549: 1546: 1544: 1541: 1539: 1536: 1534: 1531: 1529: 1526: 1522: 1519: 1517: 1514: 1513: 1512: 1509: 1507: 1504: 1502: 1499: 1497: 1494: 1492: 1489: 1487: 1484: 1482: 1479: 1477: 1474: 1472: 1469: 1467: 1464: 1463: 1461: 1459: 1458:Text analysis 1455: 1449: 1446: 1444: 1441: 1439: 1436: 1434: 1431: 1427: 1424: 1422: 1419: 1418: 1417: 1414: 1412: 1409: 1407: 1404: 1403: 1401: 1399:General terms 1397: 1393: 1386: 1381: 1379: 1374: 1372: 1367: 1366: 1363: 1357: 1354: 1351: 1348: 1345: 1342: 1340: 1336: 1333: 1331: 1327: 1322: 1318: 1317: 1312: 1307: 1306: 1285: 1271: 1265: 1249: 1242: 1234: 1230: 1226: 1222: 1218: 1214: 1207: 1200: 1199: 1192: 1178: 1174: 1167: 1152: 1145: 1131:on 2019-05-04 1130: 1126: 1120: 1112: 1108: 1102: 1094: 1093: 1085: 1078: 1074: 1068: 1060: 1059: 1058:Computerworld 1051: 1043: 1041:9781491948125 1037: 1033: 1032: 1024: 1016: 1009: 994:. p. 165 993: 992: 987: 980: 972: 965: 958: 944: 943: 936: 922: 918: 912: 898: 894: 888: 874: 870: 864: 857:. p. 79. 856: 852: 845: 830:on 2009-02-05 829: 825: 819: 817: 802:on 2011-07-20 798: 791: 790: 782: 765: 764: 756: 737: 730: 723: 716: 710: 703: 699: 696: 690: 684: 678: 670: 664: 660: 659: 651: 647: 637: 634: 632: 629: 627: 624: 622: 619: 617: 614: 612: 609: 607: 604: 603: 597: 595: 591: 589: 585: 581: 573: 570: 566: 563: 559: 556: 555: 554: 552: 548: 544: 540: 536: 535: 529: 519: 516: 505: 503: 499: 495: 491: 486: 484: 480: 472:spell checker 471: 467: 462: 458: 455: 451: 447: 446: 441: 435: 432: 425:Functionality 422: 414: 412: 408: 404: 400: 399:Google Chrome 396: 386: 384: 381: 380:Visual Tools' 377: 375: 369: 367: 362: 358: 354: 350: 349:agglutinative 346: 342: 338: 334: 330: 325: 323: 319: 315: 304: 302: 298: 294: 292: 288: 284: 280: 275: 273: 268: 266: 256: 254: 250: 248: 243: 241: 237: 233: 218: 216: 212: 210: 206: 201: 199: 195: 191: 186: 184: 179: 177: 169: 165: 161: 157: 153: 149: 146: 145: 144: 133: 132:Google Chrome 129: 121: 113: 64: 62: 61:search engine 58: 55:, electronic 54: 50: 46: 42: 38: 34: 30: 26: 25:spell checker 22: 2034: 2015:Concordancer 1411:Bag-of-words 1339:Peter Norvig 1314: 1290:25 September 1288:. Retrieved 1276:25 September 1274:. Retrieved 1264: 1254:24 September 1252:. Retrieved 1241: 1216: 1212: 1206: 1197: 1191: 1180:. Retrieved 1176: 1166: 1155:. Retrieved 1144: 1133:. Retrieved 1129:the original 1119: 1110: 1101: 1090: 1084: 1076: 1067: 1056: 1050: 1030: 1023: 1014: 1008: 996:. Retrieved 989: 979: 970: 957: 947:, retrieved 941: 935: 924:. Retrieved 920: 911: 900:. Retrieved 896: 893:"GNU Aspell" 887: 876:. Retrieved 872: 863: 854: 844: 832:. Retrieved 828:the original 804:. Retrieved 797:the original 788: 781: 770:. Retrieved 762: 755: 743:. Retrieved 736:the original 722: 709: 689: 677: 657: 650: 636:LanguageTool 626:Words (Unix) 592: 577: 571: 568: 564: 561: 557: 546: 542: 538: 532: 525: 515:concatenated 511: 487: 475: 453: 449: 443: 436: 428: 420: 392: 382: 379: 378: 370: 326: 314:Random House 310: 295: 276: 269: 262: 253:Henry Kučera 251: 244: 239: 229: 213: 208: 202: 187: 180: 173: 164:contractions 142: 69: 53:email client 32: 28: 24: 18: 1972:Topic model 1852:Text corpus 1698:Statistical 1565:Text mining 1406:AI-complete 1350:CBSNews.com 1309:‹ The 1092:PC Magazine 991:PC Magazine 588:Google Wave 490:grammatical 417:Specialties 333:WordPerfect 232:Les Earnest 205:suggestions 168:possessives 151:attributes. 33:spell check 2126:Categories 1693:Rule-based 1575:Truecasing 1443:Stop words 1335:Norvig.com 1182:2018-08-29 1157:2018-08-29 1135:2018-08-29 1077:VT-SPELLER 998:21 October 949:2023-02-19 926:2023-02-19 902:2023-02-19 897:aspell.net 878:2023-02-19 834:2008-12-18 806:2011-02-18 772:2011-02-18 745:10 October 642:References 498:neologisms 411:GNU Aspell 383:VT Speller 368:strategy. 272:GNU Aspell 156:morphology 57:dictionary 2002:reviewing 1800:standards 1798:Types and 1344:BBK.ac.uk 551:homophone 494:homophone 366:marketing 353:Hungarian 242:program. 230:In 1961, 2142:Spelling 1918:Wikidata 1898:FrameNet 1883:BabelNet 1862:Treebank 1832:PropBank 1777:Word2vec 1742:fastText 1623:Stemming 1311:template 1248:"Review" 1233:12283016 698:Archived 600:See also 543:Thailand 403:Hunspell 389:Browsers 374:Mac OS X 341:European 339:to many 329:WordStar 279:Hunspell 209:see also 45:software 21:software 2089:Related 2055:Chatbot 1913:WordNet 1893:DBpedia 1767:Seq2seq 1511:Parsing 1426:Trigram 1313:below ( 560:coming 528:context 470:AbiWord 466:Enchant 407:MySpell 395:Firefox 361:Iceland 357:Finnish 337:English 316:rushed 301:AbiWord 297:Enchant 287:MySpell 221:History 198:n-grams 160:English 35:) is a 2062:(c.f. 1720:models 1708:Neural 1421:Bigram 1416:n-gram 1330:Curlie 1316:Curlie 1231:  1038:  665:  580:Winnow 479:Oracle 468:, the 240:ispell 226:Pre-PC 166:, and 139:Design 2111:spaCy 1756:large 1747:GloVe 1229:S2CID 967:(PDF) 800:(PDF) 793:(PDF) 767:(PDF) 739:(PDF) 732:(PDF) 558:Their 431:typos 59:, or 1876:Data 1727:BERT 1292:2010 1278:2010 1256:2010 1036:ISBN 1000:2013 747:2011 663:ISBN 586:and 572:reel 547:bath 539:Thai 534:baht 450:bath 445:baht 409:and 397:and 355:and 331:and 263:The 259:Unix 41:text 27:(or 23:, a 1908:UBY 1328:at 1221:doi 569:its 567:if 565:sea 562:too 541:or 454:bat 452:or 318:OEM 307:PCs 289:in 31:or 19:In 2128:: 1227:. 1217:34 1215:. 1175:. 1109:. 1075:. 988:. 969:. 919:. 895:. 871:. 853:. 815:^ 590:. 63:. 51:, 2066:) 1789:, 1758:) 1754:( 1384:e 1377:t 1370:v 1294:. 1280:. 1258:. 1235:. 1223:: 1185:. 1160:. 1138:. 1113:. 1044:. 1017:. 1002:. 929:. 905:. 881:. 837:. 809:. 775:. 749:. 671:. 574:.

Index

software
software feature
text
software
word processor
email client
dictionary
search engine
language model

Google Chrome
morphology
English
contractions
possessives
synthetic languages
user interface
approximate string matching
Levenshtein distance
n-grams
suggestions
Clustering algorithms
Les Earnest
assembly language
Georgetown University
Henry Kučera
International Ispell
GNU Aspell
Hunspell
agglutinative languages

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.