Wikipedia:Wikipedia Signpost/2019-03-31/Recent research

Recent research

Barnstar-like awards increase new editor retention

A monthly overview of recent academic research about Wikipedia and other Wikimedia projects, also published as the Wikimedia Research Newsletter.

"Purely symbolic" barnstar-like awards increase retention of new editors on German Wikipedia

We hereby present
[User Name]
with the award

Edelweiss with Star
of the Portal Switzerland
for contributions to the German language Wikipedia.
sgd. The Project Edelweiss-Award


Example award (author's translation, from the paper)

New editors who received the award (right) were 20% more likely to remain active during the following month, compared to the control group who didn't receive it

In a large-scale randomized experiment on the German Wikipedia,[1] new editors who were presented with a barnstar-like award on their user talk page were 20% more likely to remain active during the following month. This statistically significant increase in the number of users coming back to contribute more persisted for a full year (four quarters). The effect also appeared when only considering article (mainspace) edits.

The "Edelweiss-Auszeichnung" (Edelweiss with Star) was awarded in a monthly process. All users who had just made their first article edit and at least one other edit, with at least five days between their first and last edit, were considered initially eligible for the award. This was followed by a semi-automated screening process, "developed in consultation with experienced community members", to remove e.g. blocked users, corporate accounts and "advertisers". Apart from this, the award (in its lowest level) was not based on an assessment of the quality of the user's contributions. Its "description does not contain any explicit performance criteria for getting the award, other than that the editors have made their first contributions to the German language Wikipedia in the previous month; it is mentioned that there were more than 4,000 newcomers as potential candidates in a given month." The award was handed out by the author, using a role account, to around 150 users per month. She notes that

"... randomly bestowing awards seems to be an almost impossible endeavor, because awards are designed to be given to individuals who excel in their tasks. However, this experiment shows that it can succeed if two important conditions are fulfilled. First, a basic preselection has to exclude obviously undeserving candidates, such as vandals. Second, subjects who by chance do not receive the award should be an unidentifiable group who ideally are ignorant of the award’s existence."

The screening process seems to have been reasonably effective in weeding out bad-faith contributors, with only 2% of the awarded users and 3% of the control group having been blocked after more than two years.

The paper also emphasizes that close coordination with the editor community, and the attachment to a thematic portal (Portal Switzerland, similar to a WikiProject on the English Wikipedia) were important to the award's success:

"... practitioners' endorsement is most likely to be vital for any such endeavor. The backing and trust of several highly reputable community members were central to this experiment. These contacts were established via telephone calls, which were followed up by regular roundtable meetings with a group of editors willing to tackle the retention problem with the help of the experiment. They became official founding members of the project, which was thus institutionalized under the umbrella of the Swiss national Wikipedia portal, providing the award with considerable repute and a formal character ..."

In contrast, a team of Carnegie Mellon researchers recently withdrew a similar research project proposal on the English Wikipedia due to community opposition. See previous coverage from The Signpost.

See also our earlier coverage of related research: "A Preliminary Study on the Effects of Barnstars on Wikipedia Editing", "Recognition may sustain user participation"

Conferences and events

See the research events page on Meta-wiki for upcoming conferences and events, including submission deadlines, and the page of the monthly Wikimedia Research Showcase for videos and slides of past presentations.

Other recent publications

Other recent publications that could not be covered in time for this issue include the items listed below. Contributions, whether reviewing or summarizing newly published research, are always welcome.

"A Historical Perspective on Information Systems: A Tool and Methodology for Studying the Evolution of Social Representations on Wikipedia"

"... we draw on the theory of social representation to build an analytical tool, WikiGen ["Wikipedia Genealogy Generator", available at http://wikigen.org/ ], and develop a methodology for examining the evolution of collective knowledge on Wikipedia. We demonstrate the usefulness of the tool and methodology by applying it to an illustrative case study, the Wikipedia article on cloud computing." (from the abstract[2])

Simulation find that admins and instant reverts are the key to Wikipedia's reliability

"The surprisingly high reliability of Wikipedia has often been seen as a beneficial effect of the aggregation of diverse contributors, or as an instance of the wisdom of crowds phenomenon; additional factors such as elite contributors, Wikipedia's policy or its administration have also been mentioned. We adjudicate between such explanations by modelling and simulating the evolution of a Wikipedia entry. The main threat to Wikipedia's reliability, namely the presence of epistemically disruptive agents such as disinformers and trolls, turns out to be offset only by a combination of factors: Wikipedia's administration and the possibility to instantly revert entries, both of which are insufficient when considered in isolation." (from the abstract[3])

Editing persistently is fun, but editing a lot is not

"We combine motivational data from two surveys of Wikipedia newcomers with data of two periods of editing activity. We find that persistence in editing is related to fun, while the amount of editing is not: individuals who persist in editing are characterized by higher fun motives early on (when compared to dropouts), though their motives are not related to the number of edits made. Moreover, we found that newcomers' experience of fun was reinforced by their amount of activity over time: editors who were initially motivated by fun entered a virtuous cycle, whereas those who initially had low fun motives entered a vicious cycle." (from the abstract[4])

See also earlier coverage of a related paper by some of the same authors: "Emergent Role Behaviours in Wikipedia – The 'How' and 'Why'".

"Can deep learning techniques improve classification performance of vandalism detection in Wikipedia?"

"... we study the applicability of a leading technology as deep learning to the problem of vandalism detection. The first set is obtained by expanding a list of vandal terms taking advantage of the existing semantic-similarity relations in word embeddings and deep neural networks. Deep learning techniques are applied to the second set of features [...]. The last set uses graph-based ranking algorithms to generate a list of vandal terms from a vandalism corpus extracted from Wikipedia. These three sets of new features are evaluated separately as well as together to study their complementarity, improving the results in the state of the art." (from the abstract[5])

"Improving New Editor Retention on Wikipedia"

"...we model whether a new user will become an established member of the community based on their initial activity. ... we are primarily interested in determining positive and negative impacts to new user retention." (From the abstract[6])

"'Anonymous calling': The WikiScanner scandals and anonymity on the Japanese Wikipedia"

"The Wikiscanner tool, which traced the origin of edits on Wikipedia, stirred media scandals throughout the world. Relying on a 'trace ethnography' method, following the discussion on Wikipedia articles, this article deals with the Japanese edition reaction to the scandals. I argue that this reaction represents a unique form of online publicity that facilitates anonymous normative discussion. In addition [...], the article contends that Wikipedia enables a rare model of anonymous public debate which bridges earlier Japanese conceptions of anonymity and publicity." (from the abstract[7])

"Feature Analysis for Assessing the Quality of Wikipedia Articles through Supervised Classification"

"... the problem of automatically assessing the quality of Wikipedia articles is considered. In particular, the focus is on the analysis of hand-crafted features that can be employed by supervised machine learning techniques to perform the classification of Wikipedia articles on qualitative bases. [... This approach] produced encouraging results with respect to the considered features." (from the abstract[8])

"Towards Compiling Textbooks from Wikipedia"

"we explore challenges in compiling a pedagogic resource like a textbook on a given topic from relevant Wikipedia articles, and present an approach towards assisting humans in this task. We present an algorithm that attempts to suggest the textbook structure from Wikipedia based on a set of seed concepts (chapters) provided by the user. We also conceptualize a decision support system where users can interact with the proposed structure and the corresponding Wikipedia content to improve its pedagogic value. The proposed algorithm is implemented and evaluated against the outline of online textbooks on five different subjects. We also propose a measure to quantify the pedagogic value of the suggested textbook structure." (from the abstract[9])

"... we propose relational event models to analyze dynamic network effects explaining the allocation of contributor attention to Wikipedia articles about migration-related topics. Among others, we test for the presence of a rich-get-richer effect in which articles edited by many users are likely to receive even more contributions in the future and uncover which users start working on less popular articles. We further analyze local clustering effects in which pairs of users tend to repeatedly collaborate on the same articles ..." (from the abstract[10])

References

  1. ^ Gallus, Jana (2016-09-30). "Fostering Public Good Contributions with Symbolic Awards: A Large-Scale Natural Field Experiment at Wikipedia". Management Science. 63 (12): 3999–4015. doi:10.1287/mnsc.2016.2540.
  2. ^ Gal, Uri; Riemer, Kai; Chasin, Friedrich (2018-12-01). "A Historical Perspective on Information Systems: A Tool and Methodology for Studying the Evolution of Social Representations on Wikipedia". Communications of the Association for Information Systems. 43 (1): 711–750. doi:10.17705/1CAIS.04337.
  3. ^ Lageard, Valentin; Paternotte, Cédric (2018-11-29). "Trolls, bans and reverts: simulating Wikipedia" (PDF). Synthese. 198: 451–470. doi:10.1007/s11229-018-02029-0. Closed access icon
  4. ^ Balestra, Martina; Zalmanson, Lior; Cheshire, Coye; Arazy, Ofer; Nov, Oded (2017). "It was Fun, but Did it Last?". Proceedings of the ACM on Human-Computer Interaction. 1: 1–13. doi:10.1145/3134656. Closed access icon Author's copy
  5. ^ Martinez-Rico, Juan R.; Martinez-Romo, Juan; Araujo, Lourdes (2019-02-01). "Can deep learning techniques improve classification performance of vandalism detection in Wikipedia?". Engineering Applications of Artificial Intelligence. 78: 248–259. doi:10.1016/j.engappai.2018.11.012. Closed access icon
  6. ^ Hollenbeck, Jonathan; Miyaguchi, Anthony. "Improving New Editor Retention on Wikipedia" (PDF) (Student project report). Stanford University. (Code.)
  7. ^ Reis, Omri (2018-12-01). ""Anonymous calling": The WikiScanner scandals and anonymity on the Japanese Wikipedia". First Monday. 23 (12). doi:10.5210/fm.v23i12.9184. ISSN 1396-0466.
  8. ^ Bassani, Elias; Viviani, Marco (2018-12-06). "Feature Analysis for Assessing the Quality of Wikipedia Articles through Supervised Classification". arXiv:1812.02655 [cs.CL].
  9. ^ Mathew, Ditty; Chakraborti, Sutanu (2018). "Towards Compiling Textbooks from Wikipedia". In Tanja Mitrovic; Bing Xue; Xiaodong Li (eds.). AI 2018: Advances in Artificial Intelligence. Lecture Notes in Computer Science. Springer International Publishing. pp. 828–842. doi:10.1007/978-3-030-03991-2_75. ISBN 9783030039912. Closed access icon
  10. ^ Lerner, Jürgen; Lomi, Alessandro (2019). "Let's Talk About Refugees: Network Effects Drive Contributor Attention to Wikipedia Articles About Migration-Related Topics" (PDF). In Luca Maria Aiello; Chantal Cherifi; Hocine Cherifi; Renaud Lambiotte; Pietro Lió; Luis M. Rocha (eds.). Complex Networks and Their Applications VII. Studies in Computational Intelligence. Springer International Publishing. pp. 211–222. doi:10.1007/978-3-030-05414-4_17. ISBN 9783030054144. Closed access icon