Wikipedia:Wikipedia Signpost/2022-02-27/Recent research

Recent research

How editors and readers may be emotionally affected by disasters and terrorist attacks


A monthly overview of recent academic research about Wikipedia and other Wikimedia projects, also published as the Wikimedia Research Newsletter.


How earthquakes and terrorist attacks may elicit anger and sadness among Wikipedia editors and readers

In a study titled "Emotions in Wikipedia: the role of intended negative events in the expression of sadness and anger in online peer production", four Germany-based researchers (three among them from the field of psychology) argue that while "Wikipedia explicitly strives to provide objective and neutral information in unbiased language [, ...] Wikipedia articles might still contain subtle expressions of emotions from the experiences of their authors."

Specifically, the authors

The automated linguistic analysis method is Linguistic Inquiry and Word Count (LIWC), a tool developed in the 1990s by Pennebaker and others. LIWC has been widely used and vetted, but nowadays exists alongside more sophisticated sentiment analysis methods – a fact that the paper's limitations section coyly alludes to. The researchers used LIWC to detect the percentage of an article's words that match "three specific negative emotion categories [...]: sadness (e.g. 'loss', 'sorrow', or 'grief'), anger (e.g. 'offensive', 'brutal', or 'violent'), and anxiety (e.g. 'panic', 'afraid', or 'scared')." They illustrate them with the following examples:

  • Sadness-related words: "The Haitian art world suffered great losses" (from 2010 Haiti earthquake)
  • Anger-related words: "He had a history of violence, including an arrest in July 2009 for assaulting his girlfriend" (from Boston Marathon bombing)
  • Anxiety-related words: "... which sparked fears in the scientific community of massive numbers of fish dying" (from Hurricane Katrina)

To avoid confounds, the text of references and external links was excluded from this analysis. Also, the authors "did not include the word 'attack' within the anger category or the word 'terror' within the anxiety category in all analyses to avoid the possibility that the topic of the articles could be a confounding factor in our analyses". However, they appear to have made no attempt otherwise to distinguish emotions that are expressed directly in the text (in what editors call "Wikipedia voice") from emotions that are merely reported and attributed to others (such as the scientific community's "fears" in the Hurricane Katrina example, or quoted reactions from politicians etc.).

The authors suggest that online peer production systems such as Wikipedia

The paper extends and replicates results from a 2017 publication by the same authors (which had also examined article talk pages, finding that "Surprisingly, Wikipedia articles on those two [types of] events contained more emotional content than related Wikipedia talk pages").

Having "demonstrated that Wikipedia articles on terrorist attacks contained more anger-related content than Wikipedia articles on earthquakes", two of the authors replicate and extend this result by directly measuring the emotional reactions of Wikipedia readers in a more recent study. Specifically,

The researchers conclude


Briefly

  • See the page of the monthly Wikimedia Research Showcase for videos and slides of past presentations.
  • The deadline for paper submissions to the "Wiki-M3L" workshop has been extended to March 9. The event is intended to be "a space for the Wikipedia community and the multimodal & multilingual research community to share and support each other."


Other recent publications

Other recent publications that could not be covered in time for this issue include the items listed below. Contributions, whether reviewing or summarizing newly published research, are always welcome.

"Multilingual Entity Linking System for Wikipedia with a Machine-in-the-Loop Approach"

This paper describe the development of a machine learning model used in the "Add a link" task suggestion feature for new Wikipedia editors (deployed by the Wikimedia Foundation's "Growth" team on several language Wikipedias last year). From the abstract:

Screenshots from the user evaluation of the link recommendation system

(See also: research project page on Meta-wiki)


"Wikipedia Entities as Rendezvous across Languages: Grounding Multilingual Language Models by Predicting Wikipedia Hyperlinks"

From the paper and abstract:


From the abstract:

From the abstract and paper:

(See also: Research project page on Meta-wiki)

References

Uses material from the Wikipedia article Wikipedia:Wikipedia Signpost/2022-02-27/Recent research, released under the CC BY-SA 4.0 license.