Jump to content

NLP for Wikipedia (EMNLP 2024)

From Meta, a Wikimedia project coordination wiki

Home

Call for Papers Program
WikiNLP: Advancing Natural Language Process for Wikipedia
Co-located with EMNLP 2024


This is the website for the Advancing Natural Language Processing for Wikipedia workshop that will happen on the 16th of November 2024 as part of the The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP). The workshop will take place in Miami, Florida, USA, but will be a hybrid event in which we aim to facilitate online participation. Here you can find all important dates, the call for papers as well as (eventually) the list of invited speakers and the program of the workshop.

Motivation

[edit]

A space both to celebrate Wikimedia's contributions to the NLP community and highlight approaches to ensuring the sustainability of this relationship for years to come.

Wikipedia is a uniquely important resource for the NLP community; it is multilingual, can be freely reused under its open license, and is edited and maintained by a dedicated community of editors who have earned its status as a very high-quality dataset for many applications. With this value comes many tensions however:

  • Despite Wikipedia's presence in over 300 language editions, much focus in language modeling remains on the high-resource languages;
  • Despite the openness of Wikipedia and its role in many advances in natural language modeling, there are concerns that some of these advances such as generative text models could undermine Wikipedia and threaten its sustainability as a community and ultimately data resource;
  • Despite the heavy usage of Wikimedia data among the NLP community, few researchers work on developing tools that can contribute back to the Wikimedia community.

We will invite researchers to contribute novel uses of Wikimedia data or studies of the impact of Wikimedia data within the NLP community. We will also discuss successful approaches to developing tooling that can assist the Wikimedia community in maintaining and improving the breadth of the Wikimedia projects.

Important Dates

[edit]

Organizers

[edit]

Program Chairs

[edit]

Contact: nlp4wikipedia[@]googlegroups.com

Reviewers

[edit]

Many thanks to the following reviewers:

  • Saied Alshahrani
  • Pablo Aragón
  • Hiba Arnaout
  • Arnav Arora
  • Bonaventure F. P. Dossou
  • Srihari Jayakumar
  • Kartik Mathur
  • Jeanna Matthews
  • Tiziano Piccardi
  • Miriam Redi
  • Marija Sakota
  • Sina Semnani
  • Indira Sen
  • Diego Sáez Trumper
  • Harold Triedman
  • Mykola Trokhymovych
  • Houcemeddine Turki
  • Thejas Venkatesh