Knowledge

:Knowledge Signpost/2022-08-31/Recent research - Knowledge

Source šŸ“

329:"In this thesis, we have computationally analysed the language used in Knowledge in order to find similarities between the language used in different articles. To do so, we have syntactically parsed articles of Knowledge in different languages using UDPipe 2.0 and gathered the languagesā€™ recurrent syntactic patterns using Grammatical Frameworkā€™s GF-UD. Then, we have compared the analyses with cosine similarity in two ways: based on dependency relations and based on linguistic patterns. We have seen that there is a basis for the Abstract Knowledge project: there are syntactic similarities not only within one language, but also within multiple languages. In addition, we have found that semantically-related topics have a higher similarity than those which are not. Finally, we have gathered syntactic patterns of every language and compared them, which can constitute the basis of the creation of the Renderers for 396:
platform's incentives. Then, adding the Contributors data, we discuss the three-dimensional model (more precise than the three-dimensional VAR). It provides a more accurate short-term prediction of Edits than the two-dimensional dynamic model. The global picture shows that the number of new Edits tends to decline in the future, while the number of new Contributors and Readers will grow in the long run. This can probably be explained by the fact that many of the subjects, in which Readers are interested, have already been contributed to the Knowledge platform, and there is no demand for the new Edits. However, the Contributors will continue to correct some articles, and the Readers will be visiting the platform for the references.
299:"... how much money external-website owners would have to pay in order to obtain an equivalent number of clicks by other means, such as paid ads. In this spirit, we applied the Google Ads API to the content of official websites linked from Knowledge in order to generate key words for sponsored search and estimated their cost per click at market price. We conclude that the owners of external websites linked from English Knowledgeā€™s infoboxes would need to collectively pay a total of around $ 7ā€“13 million per month (or $ 84ā€“156 million per year) for sponsored search in order to obtain the same volume of traffic that they receive from Knowledge for free." 284:"Knowledge frequently serves as a stepping stone between search engines and third-party websites. We captured this effect quantitatively as well as in a manual analysis, where we found that URLs that are down-ranked or censored by search engines, and thus not retrievable via search, can often be found in Knowledge infoboxes, which leads search users to take a detour via Knowledge. We conclude that Knowledge regularly and systematically meets information needs that search engines do not meet, which further confirms Knowledgeā€™s central role in the Web ecosystem." 178: 464: 103: 110: 289: 274: 130: 601: 90: 240:"... that internet cultureā€”a topic held by most articles about websitesā€”is indeed particularly over-represented among the articles with the very highest official-link CTR. Similar effects were observed for society (a loose mix of articles), sports, software, and entertainment, among others. On the contrary, we observed that geographical, biography, and television, among others, were particularly under-represented among the highest-CTR official links." 120: 36: 216:"During the period considered, Knowledge had 5.3M articles that contained at least one of 63.1M external links (totaling 49.8M unique target URLs). In total, 35.3M (56.0%) of these links appeared in references, 24.9M (39.5%) in article bodies, and 2.8M (4.5%) in infoboxes. Around 1.3M articles in English Knowledge had an infobox with links, and the average number of links per infobox in these articles was 2.08." 140: 100: 392:, we obtain a high precision prediction. Moreover, the dynamical system approach provides the global qualitative picture of the model's phase portrait, and allows us to discuss multidimensional patterns and long-term properties of the process. The simple limiting behavior allows us to associate different trends with different process's realization scenarios that can be influenced by externalities. 150: 361:. The methodology is content analysis with a directed approach: data were gathered in Novemberā€“December 2020. The paper argues Knowledge can usefully be analysed as a heterotopia because it exposes the contentious conditions of knowledge production, which is not standard practice for an encyclopaedia." 340: 827:
I wonder if it changes when an archived page from the Wayback machine is used? I fixed a rotted url today regarding an external link to a building preservation group. The structure is being demolished so preservation is moot and while the group has disbanded their domain redirects to a porn site.
771:
As was commented in the article, the split was "Infoboxes","References", "Other" (also called main body). As such, two EL, one in the lede or in a section outside of a reference and one properly in an EL section would be viewed identically. I would *love* to see the analysis done with an EL section
264:
The short click time of infobox links, however, seems to be due to their prominent position within articles: when approximately controlling for position by considering only article-body links in the top 20% of the page, the median click time dropped to 22.2 seconds, only 10% longer than for infobox
232:
The researchers proceeded to analyze the CTR of these official infobox links in more detail, finding that it is "correlated strongly and negatively" with an article's length and popularity (number of pageviews), "possibly because longer articles, by offering more information, reduce the userā€™s need
395:
We demonstrate these ideas using the examples of the Knowledge's traffic of Readers, Contributors and Edits . First, we consider the two-dimensional model, predicting the traffic of Readers and Edits. Different trends (corresponding to different fixed points) can be associated with different
805:
Yes - such a frustrating oversight! I have to say though, I'm actually surprised that the overall dolar value came out so low (basically negligeable when spread across so many potential target organisations). I suppose it's actually good news, since it discourages manipulation by companies.
248:"The global median click time was 32.9 seconds (31.8 seconds for desktop, 34.4 seconds for mobile), with a much lower value for infobox links (18.7 seconds; 20.1 seconds for official links), and larger values for the article-body links (35.4 seconds) and reference links (51.8 seconds)." 711: 269:
Again analyzing by article topic, the researchers found that "clicks on official links to entertainment-related websites occurred faster, whereas links to websites on more classic encyclopedic topics, such as biographies, geographical content, history, etc., occurred more slowly."
353:ā€™s (1926ā€“1984) concept of heterotopia. In Foucaultā€™s writings, heterotopias are both similar to and distinct from the conditions that give rise to them. The paper undertakes a case study of one entry on Knowledge (the entry for the ā€œEpistemeā€) focusing primarily on 228:
Focusing on infobox links, the authors train a classifier to distinguish "official" links, defined as "the official website of the entity described in the respective article", which made up 0.8% of the 63.1 million links studied and had an even higher CTR (2.47%).
143: 113: 379: 256:, where that term excludes the separate "External links" section at the end of an article:in fact, the guidelines state that external links "normally should not be placed in the body of an article". As a consequence, the paper (unfortunately or perhaps 191: 320: 260:) mostly does not provide information on whether external links that are placed higher up within the article text (in violation of the guidelines or exploiting one of their rare exceptions) may generate more traffic, apart from one partial result: 220:
During the time period studied (one month in 2019), "English Knowledge generated 43M clicks to external websites, in roughly even parts via links in infoboxes, cited references, and article bodies". This corresponds to a much higher
70: 153: 133: 828:
If the above analysis is correct then having the link in Knowledge may have increased its value for the domain brokers. Or maybe it now goes down because a click on wikipedia no longer takes one to the porn site.
200:
last year, examines how often external links on English Knowledge are clicked, and "also sheds new light on the poorly understood role has as a provider not only of information, but also of economic wealth."
373:
are a departure from the present but heterotopias both engage with and question the present by enacting an alternative, destabilising established practices and understandings in the process. "
233:
to gather additional information from external links, and because more popular articles are more likely to appear in shallower information-seeking sessions" according to previous research.
209: 312:
Other recent publications that could not be covered in time for this issue include the items listed below. Contributions, whether reviewing or summarizing newly published research,
645: 814: 76: 726: 388:"... we consider models constructed with the help of dynamical systems that have relatively simple limiting behavior. Switching between different trajectories of the 700: 660: 640: 123: 670: 837: 655: 588: 579: 680: 630: 625: 800: 781: 635: 762: 665: 618: 786:
Ah, I did not consider that ==External Links== would not be considered under References. Yes, it would indeed be interesting to see such an analysis.
851: 650: 612: 55: 44: 690: 695: 685: 903: 555:
Rayskin, Victoria (2020-01-27). "Dynamical systems' models for the prediction of multi-variable time series. Knowledge's traffic example".
748:
39.5% of the clicks coming through in-body external links is a bit concerning, no? I thought such hyperlinks were generally discouraged.
483: 794: 756: 731: 252:
The authors use the term "article body" as a catch-all for every location outside infoboxes and footnotes. This is inconsistent with
738: 21: 212:
that specifically focused on interactions with citation links), which captured reader clicks on three kinds of external links:
93: 879: 874: 869: 236:
Next they examine how the CTR varies by article topic (while controlling for an article's length and popularity), finding
485:
The linguistic structure of Knowledge. A multilingual analysis and comparison of the language used in Knowledge articles
864: 715: 446: 341:
Knowledge as an example of Michel Foucault's "heterotopia" concept ("an alternative but not an idyllic alternative")
306: 183:
A monthly overview of recent academic research about Knowledge and other Wikimedia projects, also published as the
313: 807: 859: 600: 49: 35: 17: 244:
Furthermore, the study examined the "click time" (from opening an article to clicking an external link):
321:
Syntactic similarities between Knowledge language versions as an encouraging sign for Abstract Knowledge
288: 273: 787: 749: 463: 380:
Predicting Knowledge's pageview, editor and edit numbers using a three-dimensional time series model
205: 225:(CTR) for infobox links (0.9%) than for article body links (0.14%) and reference links (0.03%). 184: 177: 833: 369:
are an alternative but not an idyllic alternative. They are possible rather than imaginary.
885: 515: 8: 366: 519: 556: 538: 425: 330: 222: 197: 192:"Official" external links on Knowledge generate $ 7-13 million worth of monthly traffic 722: 541: 531: 469: 443: 403: 830: 777: 523: 435: 527: 295:
Lastly, regarding the economic value Knowledge for website owners, the paper asks
416:
Piccardi, Tiziano; Redi, Miriam; Colavizza, Giovanni; West, Robert (2021-04-19).
350: 257: 196:
A paper titled "On the Value of Knowledge as a Gateway to the Web", presented at
503: 389: 897: 534: 358: 456: 439: 253: 417: 773: 424:. New York, NY, USA: Association for Computing Machinery. pp.Ā 249ā€“260. 163: 30:
The dollar value of "official" external links: And other new research
561: 452: 430: 354: 370: 349:"This paper analyses the online encyclopaedia Knowledge using 481: 415: 736:If your comment has not appeared here, you can try 418:"On the Value of Knowledge as a Gateway to the Web" 895: 161: 71:The dollar value of "official" external links 491:(Master's Thesis). University of Gothenburg. 204:The study was based on internal data from 560: 482:Patricia Grau Francitorra (Spring 2022). 429: 287: 272: 254:Knowledge's guidelines on external links 739: 554: 508:New Review of Hypermedia and Multimedia 14: 896: 501: 422:Proceedings of the Web Conference 2021 54: 29: 904:Knowledge Signpost archives 2022-08 27: 599: 56: 34: 28: 915: 721:These comments are automatically 462: 176: 148: 138: 128: 118: 108: 98: 88: 852:putting together the next issue 815:00:18, 26 September 2022 (UTC) 732:add the page to your watchlist 548: 502:Flavin, Michael (2021-10-02). 495: 475: 409: 13: 1: 838:04:25, 2 September 2022 (UTC) 801:15:02, 4 September 2022 (UTC) 782:15:54, 1 September 2022 (UTC) 772:split from "Other/main body". 763:10:28, 1 September 2022 (UTC) 528:10.1080/13614568.2022.2047800 384:From the abstract and paper: 210:previous research publication 206:a client-side instrumentation 185:Wikimedia Research Newsletter 707: 365:As explained in the paper, " 18:Knowledge:Knowledge Signpost 7: 208:(originally gathered for a 10: 920: 504:"Knowledge = Heterotopia" 307:Other recent publications 646:News from Wiki Education 440:10.1145/3442381.3450136 729:.Ā To follow comments, 604: 398: 363: 335: 301: 292: 286: 280:They also note that 277: 267: 250: 242: 218: 39: 603: 386: 347: 327: 297: 291: 282: 276: 262: 246: 238: 214: 38: 725:from this article's 592:"Recent research" ā†’ 520:2021NRvHM..27..324F 345:From the abstract: 325:From the abstract: 716:Discuss this story 605: 331:Abstract Knowledge 314:are always welcome 293: 278: 223:click-through rate 198:The Web Conference 45:ā† Back to Contents 40: 811: 798: 760: 740:purging the cache 701:From the archives 661:Technology report 641:Discussion report 584:"Recent research" 50:View Latest Issue 911: 888: 850:needs your help 836: 809: 791: 753: 743: 741: 735: 714: 671:Featured content 623: 615: 608: 591: 583: 567: 566: 564: 552: 546: 545: 499: 493: 492: 490: 479: 473: 467: 466: 460: 433: 413: 180: 166: 152: 151: 142: 141: 132: 131: 122: 121: 112: 111: 102: 101: 92: 91: 62: 60: 58: 919: 918: 914: 913: 912: 910: 909: 908: 894: 893: 892: 891: 890: 889: 884: 882: 877: 872: 867: 862: 855: 844: 843: 829: 797: 788:W. Tell DCCXLVI 759: 750:W. Tell DCCXLVI 745: 737: 730: 719: 718: 712:+ Add a comment 710: 706: 705: 704: 676:Recent research 656:Tips and tricks 616: 611: 609: 606: 595: 594: 589: 586: 581: 575: 574: 570: 553: 549: 500: 496: 488: 480: 476: 461: 449: 414: 410: 406: 382: 351:Michel Foucault 343: 323: 309: 194: 189: 181: 168: 167: 160: 159: 158: 149: 139: 129: 119: 109: 99: 89: 83: 80: 69: 68:Recent research 65: 63: 53: 52: 47: 41: 31: 26: 25: 24: 12: 11: 5: 917: 907: 906: 883: 878: 873: 868: 863: 858: 857: 856: 846: 845: 842: 841: 840: 824: 823: 822: 821: 820: 819: 818: 817: 793: 766: 765: 755: 720: 717: 709: 708: 703: 698: 693: 688: 683: 681:Traffic report 678: 673: 668: 663: 658: 653: 648: 643: 638: 633: 631:Special report 628: 626:News and notes 622: 613:31 August 2022 610: 598: 597: 596: 587: 578: 577: 576: 572: 569: 568: 547: 514:(4): 324ā€“338. 494: 474: 447: 407: 405: 402: 400: 390:phase portrait 381: 378: 376: 355:the main entry 342: 339: 337: 322: 319: 308: 305: 303: 193: 190: 175: 174: 172: 170: 169: 157: 156: 146: 136: 126: 116: 106: 96: 85: 84: 81: 75: 74: 73: 72: 67: 66: 64: 61: 57:31 August 2022 48: 43: 42: 33: 32: 15: 9: 6: 4: 3: 2: 916: 905: 902: 901: 899: 887: 881: 876: 871: 866: 861: 853: 849: 839: 835: 832: 826: 825: 816: 813: 804: 803: 802: 796: 789: 785: 784: 783: 779: 775: 770: 769: 768: 767: 764: 758: 751: 747: 746: 742: 733: 728: 724: 713: 702: 699: 697: 694: 692: 689: 687: 684: 682: 679: 677: 674: 672: 669: 667: 664: 662: 659: 657: 654: 652: 649: 647: 644: 642: 639: 637: 634: 632: 629: 627: 624: 620: 614: 607:In this issue 602: 593: 585: 573: 563: 558: 551: 543: 540: 536: 533: 529: 525: 521: 517: 513: 509: 505: 498: 487: 486: 478: 471: 465: 458: 454: 450: 448:9781450383127 445: 441: 437: 432: 427: 423: 419: 412: 408: 401: 397: 393: 391: 385: 377: 374: 372: 368: 362: 360: 359:the talk page 356: 352: 346: 338: 334: 332: 326: 318: 317: 315: 304: 300: 296: 290: 285: 281: 275: 271: 266: 261: 259: 255: 249: 245: 241: 237: 234: 230: 226: 224: 217: 213: 211: 207: 202: 199: 188: 186: 179: 173: 165: 155: 147: 145: 137: 135: 127: 125: 117: 115: 107: 105: 97: 95: 87: 86: 78: 59: 51: 46: 37: 23: 19: 848:The Signpost 847: 808:T.Shafee(Evo 675: 636:In the media 619:allĀ comments 571: 550: 511: 507: 497: 484: 477: 421: 411: 399: 394: 387: 383: 375: 367:Heterotopias 364: 348: 344: 336: 328: 324: 311: 310: 302: 298: 294: 283: 279: 268: 263: 251: 247: 243: 239: 235: 231: 227: 219: 215: 203: 195: 182: 171: 164:Tilman Bayer 94:PDF download 886:Suggestions 723:transcluded 666:Serendipity 258:fortunately 144:X (Twitter) 562:1912.06939 457:Q109589191 431:2102.07385 404:References 82:Share this 77:Contribute 22:2022-08-31 880:Subscribe 727:talk page 542:247465312 535:1361-4568 898:Category 875:Newsroom 870:Archives 651:In focus 582:Previous 453:Wikidata 134:Facebook 124:LinkedIn 114:Mastodon 20:‎ | 834:Ribandā–ŗ 691:Gallery 516:Bibcode 371:Utopias 265:links. 774:Naraht 696:Humour 154:Reddit 104:E-mail 865:About 810:& 686:Essay 557:arXiv 539:S2CID 489:(PDF) 426:arXiv 16:< 860:Home 831:Blue 812:Evo) 778:talk 590:Next 532:ISSN 470:Code 444:ISBN 357:and 524:doi 436:doi 333:." 162:By 79:ā€” 900:: 799:) 780:) 761:) 580:ā† 537:. 530:. 522:. 512:27 510:. 506:. 468:. 451:. 442:. 434:. 420:. 854:. 795:c 792:/ 790:( 776:( 757:c 754:/ 752:( 744:. 734:. 621:) 617:( 565:. 559:: 544:. 526:: 518:: 472:. 459:. 455:: 438:: 428:: 316:. 187:.

Index

Knowledge:Knowledge Signpost
2022-08-31
The Signpost
ā† Back to Contents
View Latest Issue
31 August 2022
Contribute
PDF download
E-mail
Mastodon
LinkedIn
Facebook
X (Twitter)
Reddit
Tilman Bayer

Wikimedia Research Newsletter
The Web Conference
a client-side instrumentation
previous research publication
click-through rate
Knowledge's guidelines on external links
fortunately


are always welcome
Abstract Knowledge
Michel Foucault
the main entry
the talk page

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.

ā†‘