93:
932:
In interlinear morphological glosses, various forms of punctuation separate the glosses. Typically, the words are aligned with their glosses; within words, a hyphen is used when a boundary is marked in both the text and its gloss, a period when a boundary appears in only one. That is, there should be
1292:
In the manual correction phase, the database creators manually corrected the boundaries of the interlinear gloss instances discovered by the sequence-labelling method in Step 2 of the automatic construction phase. The creators then verified the language names and language codes in a second and third
1541:
Researchers have used interlinear glosses is to obtain the morphological paradigms of the object language (i.e., the language being glossed). To automatically create morphological paradigms from interlinear glosses, researchers have created tables for every stem in the gloss and a (possibly empty)
1287:
Third, each interlinear gloss instance was assigned a language name (e.g., Tagalog) and an ISO 693-3 language ID. Language names and IDs were automatically assigned to interlinear glosses using
Coreference Resolution models from Natural Language Processing, where the interlinear gloss instance was
1275:
The Online
Database of Interlinear Text (ODIN) is a database of over 200,000 instances of interlinear glosses for more than 1,500 languages extracted from scholarly linguistic research. The database was constructed in two phases: automatic construction followed by manual correction. The automatic
286:
More modern 19th- and 20th-century approaches took to glossing vertically, aligning the same sort of word-by-word content in such a way that the metalanguage terms were placed vertically below the source language terms. In this style, the given example might be rendered thus (here
English gloss):
1280:
First, search engines (e.g., Google, Bing) were queried to retrieve scholarly documents that were likely to contain interlinear glosses. The queries comprised terms relevant to linguistic research such as grammatical morphemes (e.g., "NOM", short for nominative; "3SG", short for 3rd person
870:. if the morpheme-by-morpheme gloss (middle line) contains an element that does not correspond to an overt element in the example, a standard strategy is to include an overt "ø" in the object-language text, which is separated by a hyphen like an overt element would be:
114:
Interlinear glosses have been used for a variety of purposes over a long period of time. One common usage has been to annotate bilingual textbooks for language education. This sort of interlinearization serves to help make the meaning of a
933:
the same number of words separated with spaces in the text and its gloss, as well as the same number of hyphenated morphemes within a word and its gloss. This is the basic system, and can be applied universally. For example:
711:. At the sub-word level, segmentable morphemes are separated by hyphens, both in the example and in the gloss. There should be the same number of hyphens in the example and in the gloss, as shown in the following example:
705:. According to the Leipzig Glossing Rules, it is standard to left-align the words in the object language with the corresponding words in the metalanguage; this alignment can be seen between lines (1-3) and line (4).
275:
This "inline" style allows examples to be included within the flow of text, and for the word order of the target language to be written in an order which approximates the target language syntax. (In the gloss here,
282:
is reordered from the corresponding source order to approximate German syntax more naturally.) Even so, this approach requires the readers to "re-align" the correspondences between source and target forms.
1497:
Given the morpheme segmented line (first line above) and the free translation line (third line above), the task is to produce the middle glossed line comprising stem translations (e.g.,
456:
This approach is denser and also requires effort to read, but it is less reliant on the grammatical structure of the metalanguage for expressing the semantics of the target forms.
524:
a free translation, which may be placed in a separate paragraph or on the facing page if the structures of the languages are too different for it to follow the text line by line.
1023:, when a single word in the source language happens to correspond to a phrase in the glossing language, though a period would still be used for other situations, such as Greek
1284:
Second, each line in an extracted document was tagged for whether it was a line belonging to an interlinear gloss or not using sequence-labeling methods from
Machine Learning.
122:
Such annotations have occasionally been expressed not through interlinear layout, but rather through enumeration of words in the object and meta language. One such example is
376:
Finally, modern linguists have adopted the practice of using abbreviated grammatical category labels. A 2008 publication which repeats this example labels it as follows:
471:
Though there is no formal specification for the IGT format, the
Leipzig Glossing Rules are a set of guidelines that aim to standardize the format as much as possible.
1764:
Lehmann, Christian (2004-01-23). "Directions for interlinear morphemic translations". In Geert Booij; Christian
Lehmann; Joachim Mugdan; Stavros Skopeteas (eds.).
1513:). Sequence prediction models from Natural Language Processing have been used to perform this task. Two factors contribute to the difficulty of this task:
1159:
1102:
80:
and its translation, and the structure of the original language. In its simplest form, an interlinear gloss is a literal, word-for-word translation of the
2184:
1418:
Natural
Language Processing models leveraging interlinear gloss resources, such as the Online Database of Interlinear Text, have been developed.
2145:
Proceedings of the 2012 Conference of the North
American Chapter of the Association for Computational Linguistics: Human Language Technologies
2108:
Proceedings of the 2015 Conference of the North
American Chapter of the Association for Computational Linguistics: Human Language Technologies
834:; a list of standard abbreviations for grammatical categories that are widely used in linguistics can be found in the Leipzig Glossing Rules.
52:(series of brief explanations, such as definitions or pronunciations) placed between lines, such as between a line of original text and its
474:
An interlinear text for linguistics will commonly consist of some or all of the following, usually in this order, from top to bottom:
1729:, often displayed as interlinear glosses under the tagged words, sometimes at the same time as an interlinear word-by-word translation
1705:
grammatical category). A statistical machine learning model for morphological inflection can be used to fill in the missing entries.
900:
is treated similarly to affixation but with a tilde (instead of the standard hyphen) that connects the copied element to the stem:
2178:
2166:
2172:
1288:
tagged with the language name (and ID) that appears in the scholarly document the interlinear gloss instance was extracted from.
1796:
1297:
The language distribution of interlinear gloss instances in Online
Database of Interlinear Text after phase 1 and (phase 2)
2192:. A forum for recommendations on the Interlinar Morphemic Glossing of ancient languages as attested in ancient manuscripts.
840:. When a single object-language element corresponds to several metalanguage elements, they are separated by periods. E.g.,
2061:
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume
1038:
may be separated with a double hyphen (or, for ease of typing, an equal sign) rather than a hyphen. A French example:
1738:
1426:
Natural Language Processing systems, for example, have been developed to automatically produce interlinear glosses.:
60:. When glossed, each line of the original text acquires one or more corresponding lines of transcription known as an
1768:. Handbücher der Sprach- und Kommunikationswissenschaft. Vol. 2. Berlin: W. de Gruyter. pp. 1834–1857.
1877:
Xia, Fei; Lewis, William; Wayne, Michael; Slayden, Glenn; Georgi, Ryan; Crowgey, Joshua; Bender, Emily (2016).
31:
1542:
slot for every grammatical category (e.g., ERG) in the gloss. For instance, given the glossed sentence below:
826:) are inflectional affixes representing future tense and negation. These inflectional affixes are glossed as
1983:. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP): 5251–5262.
2244:
2208:
Listing of older interlinear and construed texts, mostly from Latin or Ancient Greek and mostly to English
17:
2212:
49:
119:
explicit without attempting to formally model the structural characteristics of the source language.
460:
1861:
From Aari to Zulu: massively multilingual creation of language tools using interlinear glossed tex
2195:
1259:
A few other conventions which are sometimes seen are illustrated in the Leipzig Glossing Rules.
516:-by-morpheme gloss, where morphemes within a word are separated by hyphens or other punctuation,
1788:
1726:
778:
1878:
1929:
Xingyuan, Zhao; Satoru, Ozaki; Anastasopoulos, Antonios; Neubig, Graham; Levin, Lori (2020).
2161:
1937:. Proceedings of the 28th International Conference on Computational Linguistics: 5397–5408.
1822:
1780:
1524:
Some words in the morpheme segmented line have multiple correspondences in the gloss (e.g.,
2239:
2234:
2217:
123:
2205:
8:
909:
1521:
is the last word in the translation but the second word in the morpheme segmented line).
1517:
The translation is not necessarily in alignment with the morpheme segmented line (e.g.,
92:
2121:
2084:
2064:
2037:
1994:
1948:
1943:
1898:
1693:
would be filled (since it was observed in the interlinear gloss data) but the slot for
1562:
2024:
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
1989:
1931:"Automatic Interlinear Glossing for Under-Resourced Languages Leveraging Translations"
1823:"The Leipzig Glossing Rules. Conventions for Interlinear Morpheme by Morpheme Glosses"
539:
2088:
2079:
1998:
1952:
1792:
1781:
1741:, nineteenth-century composer and promoter of interlinear texts for language learning
1587:
978:
127:
2162:
The Leipzig Glossing Rules: Conventions for interlinear morpheme-by-morpheme glosses
2125:
2041:
2026:. Brussels, Belgium: Association for Computational Linguistics. pp. 2883–2889.
1267:
Efforts have been undertaken to digitize IGT for hundreds of the world's languages.
2111:
2074:
2027:
1984:
1938:
1902:
1890:
562:
529:
103:
2189:
1975:
Moeller, Sarah; Liu, Ling; Yang, Changbing; Kann, Katharina; Hulden, Mans (2020).
2110:. Denver, Colorado: Association for Computational Linguistics. pp. 922–931.
1930:
1723:– a gloss sometimes used with Chinese or Japanese to show the pronunciation
1626:
1605:
1599:
1574:
1568:
1192:
962:
880:
107:
1976:
1720:
506:
27:
Explanatory matter inserted between a line of original text and its translation
2056:
1894:
1536:
2228:
1165:
1119:
1080:
946:
735:
554:
3. a gloss showing the underlying tones in citation form (before undergoing
76:
for short. Such glosses help the reader follow the relationship between the
2032:
1697:
would be empty (assuming that no other interlinear gloss instance contains
772:
729:
2116:
2200:
2063:. Online: Association for Computational Linguistics. pp. 1901–1907.
1879:"Enriching a massively multilingual database of interlinear glossed text"
555:
548:
479:
116:
81:
77:
53:
37:
2147:. Montréal, Canada: Association for Computational Linguistics: 396–406.
2103:
2019:
1744:
1593:
850:
373:
Here word ordering is determined by the syntax of the object language.
1821:
Bickel, Balthasar; Bernard Comrie; Martin Haspelmath (February 2008).
1505:) and the grammatical category labels corresponding to affixes (e.g.,
1413:
2141:"Leveraging supplemental representations for sequential transduction"
1820:
1766:
Morphologie. Ein internationales Handbuch zur Flexion und Wortbildung
1076:
2196:
Online Interlinear of Biblical Greek Scriptures (New Testament) text
2140:
1783:
Language typology and language universals: an international handbook
2069:
1732:
513:
499:
57:
41:
2020:"An Encoder-Decoder Approach to the Paradigm Cell Filling Problem"
1928:
1087:
sulat, susulat, sumulat, sumusulat (verbal declensions) (Tagalog)
961:
1735:, often displayed as a gloss or annotation to the original text.
1034:
However, sometimes finer distinctions may be made. For example,
1714:
1035:
2104:"Inflection Generation as Discriminative String Transduction"
1185:
1072:
2102:
Nicolai, Garrett; Cherry, Colin; Kondrak, Grzegorz (2015).
1537:
Automatic discovery of morphological structure from glosses
2057:"Applying the Transformer to Character-level Transduction"
1079:, transfixes, etc.) may be set off by angle brackets, and
1717:– Japanese tradition of glossing Classical Chinese texts
1276:
construction stage itself was completed in three steps:
1191:
Morphemes which cannot be easily separated out, such as
1195:, may be marked with a backslash rather than a period:
810:) is translated into the corresponding English lexeme (
495:
a conventional transliteration into the Latin alphabet,
459:
In computing, special text markers are provided in the
1270:
532:
clause has been transcribed with five lines of text:
463:
to indicate the start and end of interlinear glosses.
1772:
1016:
An underscore may be used instead of a period, as in
692:
goa2 iau2-boe7 koat4-teng7 tang1-si5 boeh4 tng2-khi3.
686:
goa1 iau1-boe3 koat2-teng3 tang7-si5 boeh2 tng1-khi3.
537:
450:
1SG.SUBJ-3SG.OBJ-mach-APPL DET 1SG.POSS-Sohn ein Haus
2101:
1876:
1757:
1636:
evening-INS 1.SG.NOM run-PFV.PST.SG.FEM in store.ACC
1977:"IG2P: From Interlinear Glossed Texts to Paradigms"
1924:
1922:
1920:
1918:
1414:
Automatic processing of interlinear gloss instances
2055:Wu, Shijie; Cotterell, Ryan; Hulden, Mans (2021).
1974:
698:(5.) "I have not yet decided when I shall return."
2054:
2017:
1970:
1968:
1827:Dept. of Linguistics – Resources – Glossing Rules
788:now they-OBL-GEN farm forever behind stay-FUT-NEG
2226:
1915:
2138:
1872:
1870:
785:Gila abur-u-n ferma hamišaluǧ güǧüna amuqʼ-da-č
2201:ODIN - The Online Database of INterlinear text
1965:
1847:A Basic Vocabulary for a Beginner in Taiwanese
1816:
1814:
1812:
1810:
1808:
1490:you-GEN camel we.OBL-ERG.1.PL-steal-PRT be.NEG
1262:
983:
791:'Now their farm will not stay behind forever.'
420:
392:
384:
821:
815:
805:
799:
2139:Bhargava, Aditya; Kondrak, Grzegorz (2012).
1867:
277:
97:
2179:Towards a General Model of Interlinear Text
1852:
1805:
908:
2018:Silfverberg, Miikka; Hulden, Mans (2018).
1778:
680:goá iáu-boē koat-tēng tang-sî boeh tńg-khì
363:ni- c- chihui -lia in no- piltzin ce calli
263:ni- c- chihui -lia in no- piltzin ce calli
2115:
2078:
2068:
2031:
1988:
1942:
1586:
1481:
977:
777:
424:
400:
1009:room-from speed-with go_out-perfective-I
91:
2213:"The New Old Way of Learning Languages"
1763:
1643:There would be a paradigm for the stem
1625:
1604:
1598:
1573:
1567:
1083:with tildes, rather than with hyphens:
879:
695:(4.) I not-yet decide when want return.
269:ich mache es für der mein Sohn ein Haus
14:
2227:
1858:
1551:
447:ni-c-chihui-lia in no-piltzin ce calli
1466:
1458:
1435:
1421:
1227:
1219:
1208:
945:
734:
2190:Glossing Ancient Languages and Texts
1639:'In the evening I ran to the store.'
1561:
1462:
1454:
771:
728:
1487:mi-s ħumukuli elu-ab-ok'ek'-asi anu
1271:Online Database of Interlinear Text
1223:
1204:
1071:Affixes which cause discontinuity (
709:Morpheme-by-morpheme correspondence
561:4. a morpheme-by-morpheme gloss in
396:
366:I it make for to-the my son a house
24:
2167:Interlinear Glossed Text Standards
1592:
1293:pass over the data, respectively.
1247:'to our fathers' (the singular of
849:
814:) while the inflectional affixes (
410:
25:
2256:
2155:
1883:Language Resources and Evaluation
1470:
1006:room-ABL speed-COM go.out-PFV-1sg
1944:10.18653/v1/2020.coling-main.471
1863:(PhD). University of Washington.
1849:by Ko Chek Hoan and Tan Pang Tin
1633:Vecher-om ya pobeja-la v magazin
99:Toussaint–Langenscheidt Spanisch
2206:Latinum Interlinear Method page
2173:Interlinear Glossed Text Levels
2132:
2095:
2048:
1990:10.18653/v1/2020.emnlp-main.424
388:
2080:10.18653/v1/2021.eacl-main.163
2011:
1839:
1659:(Partial) paradigm for pobeja
927:
32:List of glossing abbreviations
13:
1:
2185:Interlinear Morphemic Glosses
1787:. Walter de Gruyter. p.
1750:
1493:'We didn't steal your camel.'
528:As an example, the following
1322:interlinear gloss instances
1316:interlinear gloss instances
822:
816:
806:
800:
766:
758:
750:
742:
723:
715:
669:
666:
663:
655:
652:
649:
641:
638:
635:
627:
624:
621:
613:
610:
607:
599:
596:
593:
583:
578:
573:
538:
466:
7:
1779:Haspelmath, Martin (2008).
1708:
1302:Range of interlinear gloss
1263:Interlinear gloss resources
1244:our-DAT.PL father\PL-DAT.PL
838:One-to-many correspondences
796:Grammatical category labels
10:
2261:
1012:'I left the room quickly.'
568:5. an English translation:
87:
29:
1895:10.1007/s10579-015-9325-4
369:"I made my son a house."
66:interlinear glossed text
1620:
1612:
1581:
1557:
1546:
1003:oda-dan hız-lı çık-tı-m
904:
439:
431:
416:
406:
380:
355:
347:
339:
331:
323:
315:
307:
299:
291:
249:
235:
221:
207:
193:
179:
165:
151:
137:
72:) – an
1727:Part-of-speech tagging
1476:
1449:
1441:
1430:
1128:
1109:
1098:
1090:
1048:
1045:
975:
972:
959:
956:
943:
940:
874:
844:
703:Word-by-word alignment
551:for the surface tones,
461:Specials Unicode block
442:
434:
419:
409:
383:
278:
255:
241:
227:
213:
199:
185:
171:
157:
143:
111:
98:
1859:Georgi, Ryan (2016).
1188:for other examples.)
95:
2218:The American Scholar
2033:10.18653/v1/D18-1315
937:Odadan hızlı çıktım.
124:Wilhelm von Humboldt
96:Interlinear text in
2245:Reordered languages
2117:10.3115/v1/N15-1093
1660:
1298:
1178:contemplative~write
1143:contemplative~write
988:go_out-perfective-I
1701:inflected for the
1658:
1422:Automatic glossing
1404:189,244 (157,114)
1296:
1160:contemplative mood
1153:⟨um⟩
1149:⟨um⟩
1131:⟨um⟩
1112:⟨um⟩
1103:contemplative mood
868:Non-overt elements
512:a word-by-word or
112:
1798:978-3-11-011423-2
1687:
1686:
1450:elu-ab-ok'ek'-asi
1411:
1410:
1031:'to the houses'.
547:2. a gloss using
266:1 3 2 4 5 6 7 8 9
260:1 2 3 4 5 6 7 8 9
128:Classical Nahuatl
126:'s annotation of
46:interlinear gloss
16:(Redirected from
2252:
2149:
2148:
2136:
2130:
2129:
2119:
2099:
2093:
2092:
2082:
2072:
2052:
2046:
2045:
2035:
2015:
2009:
2008:
2006:
2005:
1992:
1972:
1963:
1962:
1960:
1959:
1946:
1926:
1913:
1912:
1910:
1909:
1874:
1865:
1864:
1856:
1850:
1843:
1837:
1836:
1834:
1833:
1818:
1803:
1802:
1786:
1776:
1770:
1769:
1761:
1680:PFV.PST.SG.MASC
1661:
1657:
1629:
1608:
1602:
1596:
1590:
1577:
1571:
1565:
1553:
1483:
1472:
1468:
1464:
1460:
1456:
1437:
1376:12,822 (15,560)
1362:40,260 (46,420)
1348:97,158 (81,218)
1334:36,691 (10,814)
1299:
1295:
1229:
1225:
1221:
1210:
1206:
1177:
1175:
1169:
1154:
1150:
1147:sulat su~sulat s
1142:
1140:
1132:
1123:
1113:
1030:
1021:
985:
981:
965:
949:
912:
883:
853:
825:
819:
809:
803:
781:
775:
738:
732:
691:
685:
679:
586:
581:
576:
544:transliteration,
543:
536:1. the standard
530:Taiwanese Minnan
509:transliteration,
426:
422:
412:
402:
398:
394:
390:
386:
281:
104:Spanish-language
101:
62:interlinear text
21:
2260:
2259:
2255:
2254:
2253:
2251:
2250:
2249:
2225:
2224:
2158:
2153:
2152:
2137:
2133:
2100:
2096:
2053:
2049:
2016:
2012:
2003:
2001:
1973:
1966:
1957:
1955:
1927:
1916:
1907:
1905:
1875:
1868:
1857:
1853:
1844:
1840:
1831:
1829:
1819:
1806:
1799:
1777:
1773:
1762:
1758:
1753:
1711:
1703:PFV.PST.SG.MASC
1695:PFV.PST.SG.MASC
1672:PFV.PST.SG.FEM
1653:PFV.PST.SG.MASC
1647:with slots for
1641:
1631:
1618:
1610:
1579:
1555:
1539:
1495:
1485:
1474:
1447:
1439:
1424:
1416:
1273:
1265:
1257:
1241:unser-n Väter-n
1239:
1231:
1212:
1182:
1176:trigger⟩
1173:
1171:
1163:
1152:
1148:
1145:
1138:
1136:
1130:
1126:
1117:
1111:
1107:
1096:
1088:
1069:
1059:
1051:
1043:
1028:
1019:
1014:
1001:
990:
970:
954:
938:
930:
925:
915:
895:
885:
865:
855:
793:
783:
764:
756:
748:
740:
721:
700:
689:
683:
677:
675:
661:
647:
633:
619:
605:
591:
584:
579:
574:
469:
454:
445:
437:
429:
414:
404:
381:ni-c-chihui-lia
371:
361:
353:
345:
337:
329:
321:
313:
305:
297:
273:
258:
244:
230:
216:
202:
188:
174:
160:
146:
90:
34:
28:
23:
22:
15:
12:
11:
5:
2258:
2248:
2247:
2242:
2237:
2223:
2222:
2221:, Autumn 2008.
2209:
2203:
2198:
2193:
2187:
2182:
2176:
2170:
2164:
2157:
2156:External links
2154:
2151:
2150:
2131:
2094:
2047:
2010:
1964:
1914:
1889:(2): 321–349.
1866:
1851:
1838:
1804:
1797:
1771:
1755:
1754:
1752:
1749:
1748:
1747:
1742:
1739:James Hamilton
1736:
1730:
1724:
1718:
1710:
1707:
1691:PFV.PST.SG.FEM
1685:
1684:
1681:
1677:
1676:
1673:
1669:
1668:
1665:
1649:PFV.PST.SG.FEM
1619:
1611:
1580:
1556:
1545:
1544:
1538:
1535:
1534:
1533:
1522:
1475:
1448:
1440:
1429:
1428:
1423:
1420:
1415:
1412:
1409:
1408:
1405:
1402:
1401:1,326 (1,493)
1399:
1395:
1394:
1391:
1390:2,313 (3,012)
1388:
1385:
1381:
1380:
1377:
1374:
1371:
1367:
1366:
1365:21.27 (29.55)
1363:
1360:
1357:
1353:
1352:
1351:51.34 (51.69)
1349:
1346:
1343:
1339:
1338:
1335:
1332:
1329:
1325:
1324:
1318:
1312:
1306:
1290:
1289:
1285:
1282:
1272:
1269:
1264:
1261:
1232:
1213:
1198:
1197:
1127:
1108:
1097:
1089:
1086:
1085:
1052:
1044:
1041:
1040:
991:
971:
955:
939:
936:
935:
929:
926:
903:
902:
873:
872:
843:
842:
765:
757:
749:
741:
722:
714:
713:
662:
648:
634:
620:
606:
592:
572:
571:
570:
569:
566:
559:
552:
545:
526:
525:
518:
517:
510:
507:morphophonemic
503:
502:transcription,
496:
493:
482:(typically in
468:
465:
438:
430:
415:
405:
379:
378:
354:
346:
338:
330:
322:
314:
306:
298:
290:
289:
245:
231:
217:
203:
189:
175:
161:
147:
133:
132:
110:speakers, 1910
89:
86:
26:
9:
6:
4:
3:
2:
2257:
2246:
2243:
2241:
2238:
2236:
2233:
2232:
2230:
2220:
2219:
2214:
2211:Ernest Blum,
2210:
2207:
2204:
2202:
2199:
2197:
2194:
2191:
2188:
2186:
2183:
2180:
2177:
2174:
2171:
2168:
2165:
2163:
2160:
2159:
2146:
2142:
2135:
2127:
2123:
2118:
2113:
2109:
2105:
2098:
2090:
2086:
2081:
2076:
2071:
2066:
2062:
2058:
2051:
2043:
2039:
2034:
2029:
2025:
2021:
2014:
2000:
1996:
1991:
1986:
1982:
1978:
1971:
1969:
1954:
1950:
1945:
1940:
1936:
1932:
1925:
1923:
1921:
1919:
1904:
1900:
1896:
1892:
1888:
1884:
1880:
1873:
1871:
1862:
1855:
1848:
1845:Example from
1842:
1828:
1824:
1817:
1815:
1813:
1811:
1809:
1800:
1794:
1790:
1785:
1784:
1775:
1767:
1760:
1756:
1746:
1743:
1740:
1737:
1734:
1731:
1728:
1725:
1722:
1719:
1716:
1713:
1712:
1706:
1704:
1700:
1696:
1692:
1689:The slot for
1682:
1679:
1678:
1674:
1671:
1670:
1666:
1663:
1662:
1656:
1654:
1650:
1646:
1640:
1637:
1634:
1630:
1628:
1622:
1617:
1614:
1609:
1607:
1601:
1595:
1589:
1583:
1578:
1576:
1570:
1564:
1559:
1554:
1548:
1543:
1531:
1527:
1523:
1520:
1516:
1515:
1514:
1512:
1508:
1504:
1500:
1494:
1491:
1488:
1484:
1478:
1473:
1451:
1446:
1443:
1438:
1432:
1427:
1419:
1406:
1403:
1400:
1397:
1396:
1392:
1389:
1386:
1383:
1382:
1378:
1375:
1372:
1369:
1368:
1364:
1361:
1358:
1355:
1354:
1350:
1347:
1344:
1341:
1340:
1337:19.39 (6.88)
1336:
1333:
1330:
1327:
1326:
1323:
1319:
1317:
1313:
1311:
1307:
1305:
1301:
1300:
1294:
1286:
1283:
1279:
1278:
1277:
1268:
1260:
1256:
1254:
1251:'fathers' is
1250:
1245:
1242:
1238:
1235:
1230:
1216:
1211:
1201:
1196:
1194:
1189:
1187:
1181:
1179:
1172:⟨agent
1168:.past⟩
1167:
1166:agent trigger
1161:
1156:
1144:
1134:
1125:
1121:
1120:agent trigger
1115:
1106:
1104:
1100:
1095:
1092:
1084:
1082:
1081:reduplication
1078:
1074:
1068:
1067:'I love you.'
1065:
1062:
1058:
1055:
1050:
1047:
1039:
1037:
1032:
1026:
1022:
1013:
1010:
1007:
1004:
1000:
997:
994:
989:
986:
980:
974:
969:
966:
964:
958:
953:
950:
948:
942:
934:
924:
921:
918:
914:
911:
906:
901:
899:
898:Reduplication
894:
891:
888:
884:
882:
876:
871:
869:
864:
863:'to come out'
861:
858:
854:
852:
846:
841:
839:
835:
833:
829:
824:
818:
813:
808:
802:
797:
792:
789:
786:
782:
780:
774:
768:
763:
760:
755:
752:
747:
744:
739:
737:
731:
725:
720:
717:
712:
710:
706:
704:
699:
696:
693:
687:
681:
674:
671:
668:
665:
660:
657:
654:
651:
646:
643:
640:
637:
632:
629:
626:
623:
618:
615:
612:
609:
604:
601:
598:
595:
590:
587:
582:
577:
567:
564:
560:
557:
553:
550:
546:
542:
541:
535:
534:
533:
531:
523:
522:
521:
520:and finally
515:
511:
508:
504:
501:
497:
494:
491:
490:
485:
481:
478:The original
477:
476:
475:
472:
464:
462:
457:
453:
451:
448:
444:
441:
436:
433:
428:
418:
413:
408:
403:
382:
377:
374:
370:
367:
364:
360:
357:
352:
349:
344:
341:
336:
333:
328:
325:
320:
317:
312:
309:
304:
301:
296:
293:
288:
284:
280:
272:
270:
267:
264:
261:
257:
254:
251:
248:
243:
240:
237:
234:
229:
226:
223:
220:
215:
212:
209:
206:
201:
198:
195:
192:
187:
184:
181:
178:
173:
170:
167:
164:
159:
156:
153:
150:
145:
142:
139:
136:
131:
129:
125:
120:
118:
109:
106:textbook for
105:
100:
94:
85:
83:
79:
75:
71:
67:
63:
59:
56:into another
55:
51:
47:
43:
39:
33:
19:
2216:
2144:
2134:
2107:
2097:
2060:
2050:
2023:
2013:
2002:. Retrieved
1980:
1956:. Retrieved
1934:
1906:. Retrieved
1886:
1882:
1860:
1854:
1846:
1841:
1830:. Retrieved
1826:
1782:
1774:
1765:
1759:
1702:
1698:
1694:
1690:
1688:
1652:
1648:
1644:
1642:
1638:
1635:
1632:
1623:
1615:
1584:
1560:
1549:
1540:
1529:
1525:
1518:
1510:
1506:
1502:
1498:
1496:
1492:
1489:
1486:
1479:
1452:
1444:
1433:
1425:
1417:
1393:1.22 (1.92)
1379:6.78 (9.96)
1321:
1315:
1309:
1303:
1291:
1274:
1266:
1258:
1252:
1248:
1246:
1243:
1240:
1236:
1233:
1217:
1214:
1202:
1199:
1190:
1183:
1180:
1157:
1146:
1135:
1116:
1101:
1093:
1070:
1066:
1063:
1060:
1056:
1053:
1033:
1024:
1017:
1015:
1011:
1008:
1005:
1002:
998:
995:
992:
987:
967:
951:
931:
922:
919:
916:
907:
897:
896:
892:
889:
886:
877:
867:
866:
862:
860:come.out-INF
859:
856:
847:
837:
836:
831:
827:
811:
804:, the stem (
795:
794:
790:
787:
784:
769:
761:
753:
745:
726:
718:
708:
707:
702:
701:
697:
694:
688:
682:
676:
672:
658:
644:
630:
616:
602:
588:
549:tone numbers
527:
519:
488:
487:
483:
473:
470:
458:
455:
452:
449:
446:
375:
372:
368:
365:
362:
358:
350:
342:
334:
326:
318:
310:
302:
294:
285:
274:
271:
268:
265:
262:
259:
252:
246:
238:
232:
224:
218:
210:
204:
196:
190:
182:
176:
168:
162:
154:
148:
140:
134:
121:
113:
73:
69:
65:
61:
45:
35:
2240:Linguistics
2235:Translation
1667:inflection
1328:>10,000
1320:Percent of
1077:circumfixes
928:Punctuation
923:'is buying'
556:tone sandhi
489:bold italic
480:orthography
117:source text
82:source text
78:source text
74:interlinear
54:translation
38:linguistics
18:Interlinear
2229:Categories
2070:2005.10213
2004:2021-12-15
1958:2021-12-15
1908:2021-12-15
1832:2010-06-30
1751:References
1745:Metaphrase
1675:pobeja-la
1407:100 (100)
1387:838 (862)
1373:326 (460)
1359:122 (139)
1342:1000-9999
1314:Number of
1310:languages
1308:Number of
1304:instances
1281:singular).
1064:I⹀you⹀love
1061:je⹀te⹀aime
1049:I⹀you⹀love
1046:je⹀te⹀aime
1042:Je t'aime.
1029:FEM.PL.DAT
968:speed-with
801:amuqʼ-da-č
767:amuqʼ-da-č
540:pe̍h-ōe-jī
417:no-piltzin
30:See also:
2089:218718982
1999:226262296
1953:227231816
1733:Treebanks
1721:Ruby text
1582:pobeja-la
1547:Vecher-om
952:room-from
848:come.out-
751:hamišaluǧ
628:koat-teng
625:koat-teng
622:koat-tēng
467:Structure
2181:(E-MELD)
2175:(E-MELD)
2169:(E-MELD)
2126:14929030
2042:53082616
1709:See also
1550:evening-
1511:ERG.1.PL
1442:ħumukuli
1356:100-999
1345:37 (31)
1234:(German)
1164:⟨
1141:trigger⟩
1099:su~sulat
1054:(French)
973:çık-tı-m
920:IPFV~buy
724:abur-u-n
670:tng-khi.
667:tng-khi.
514:morpheme
500:phonetic
279:mache es
58:language
42:pedagogy
1903:2674996
1621:magazin
1469:-steal-
1218:father\
1215:Väter-n
1200:unser-n
1162:~write
1155:u~sulat
1133:u~sulat
1073:infixes
1036:clitics
1025:oikíais
1018:go_out-
993:Turkish
976:go.out-
941:oda-dan
917:bi~bili
905:bi~bili
890:boy-NOM
857:çık-mak
845:çık-mak
820:) and (
754:forever
673:return.
664:tńg-khì
642:tang-si
639:tang-si
636:tang-sî
617:not-yet
614:iau-boe
611:iau-boe
608:iáu-boē
563:English
340:piltzin
222:piltzin
88:History
2124:
2087:
2040:
1997:
1951:
1935:COLING
1901:
1795:
1715:Kanbun
1699:pobeja
1645:pobeja
1624:store.
1530:be.NEG
1398:Total
1370:10-99
1331:3 (1)
1237:
1193:umlaut
1174:
1170:write
1158:write
1151:ulat s
1139:
1137:⟨agent
1122:.past⟩
1105:~write
1057:
1027:house.
999:
996:
960:speed-
957:hız-lı
887:puer-ø
875:puer-ø
762:behind
759:güǧüna
631:decide
484:italic
399:-mach-
327:to-the
308:chihui
166:chihui
108:German
2122:S2CID
2085:S2CID
2065:arXiv
2038:S2CID
1995:S2CID
1981:EMNLP
1949:S2CID
1899:S2CID
1664:Slot
1519:camel
1445:camel
1253:Vater
1249:Väter
1186:affix
1184:(See
1124:write
1094:write
1091:sulat
944:room-
893:'boy'
798:. In
770:stay-
743:ferma
727:they-
565:, and
440:calli
427:-Sohn
359:house
356:calli
250:calli
158:mache
50:gloss
48:is a
44:, an
1793:ISBN
1651:and
1585:run-
1434:you-
1431:mi-s
1384:1-9
1203:our-
1114:ulat
913:~buy
910:IPFV
878:boy-
830:and
812:stay
807:amuq
746:farm
716:Gila
690:(3.)
684:(2.)
678:(1.)
659:want
656:boeh
653:boeh
650:boeh
645:when
589:(4.)
585:(3.)
580:(2.)
575:(1.)
443:Haus
425:POSS
401:APPL
389:SUBJ
316:-lia
311:make
256:Haus
228:Sohn
214:mein
180:-lia
102:, a
40:and
2112:doi
2075:doi
2028:doi
1985:doi
1939:doi
1891:doi
1789:715
1627:ACC
1606:FEM
1594:PST
1588:PFV
1575:NOM
1552:INS
1526:anu
1503:you
1482:NEG
1480:be.
1477:anu
1471:PRT
1459:ERG
1455:OBL
1453:we.
1436:GEN
1224:DAT
1205:DAT
1020:PFV
984:1sg
979:PFV
963:COM
947:ABL
881:NOM
851:INF
832:NEG
828:FUT
779:NEG
773:FUT
736:GEN
730:OBL
719:now
600:goa
597:goa
594:goá
486:or
435:ein
421:1SG
411:DET
397:OBJ
393:3SG
385:1SG
343:son
332:no-
319:for
292:ni-
242:ein
208:no-
200:der
186:für
144:ich
138:ni-
70:IGT
64:or
36:In
2231::
2215:,
2143:.
2120:.
2106:.
2083:.
2073:.
2059:.
2036:.
2022:.
1993:.
1979:.
1967:^
1947:.
1933:.
1917:^
1897:.
1887:50
1885:.
1881:.
1869:^
1825:.
1807:^
1791:.
1683:?
1655::
1616:in
1600:SG
1569:SG
1558:ya
1532:).
1499:mi
1467:PL
1228:PL
1220:PL
1209:PL
1075:,
817:da
558:),
505:a
498:a
492:),
432:ce
407:in
348:ce
335:my
324:in
303:it
300:c-
236:ce
194:in
172:es
152:c-
130::
84:.
2128:.
2114::
2091:.
2077::
2067::
2044:.
2030::
2007:.
1987::
1961:.
1941::
1911:.
1893::
1835:.
1801:.
1613:v
1603:.
1597:.
1591:.
1572:.
1566:.
1563:1
1528::
1509::
1507:a
1501::
1465:.
1463:1
1461:.
1457:-
1255:)
1226:.
1222:-
1207:.
1129:s
1118:⟨
1110:s
982:-
823:č
776:-
733:-
603:I
423:.
395:.
391:-
387:.
351:a
295:I
253:9
247:9
239:8
233:8
225:7
219:7
211:6
205:6
197:5
191:5
183:4
177:4
169:2
163:3
155:3
149:2
141:1
135:1
68:(
20:)
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.