Wikipedia Discussion Corpora

For the Simple English Wikipedia Discussion Corpus, please cite:
Behind the Article: Recognizing Dialog Acts in Wikipedia Talk Pages
Oliver Ferschke and Iryna Gurevych and Yevgen Chebotar
In:  Proceedings of the 13th Conference of the European Chapter of the ACL (EACL 2012), (to appear), April 2012. Avignon, France.

For the English Wikipedia Discussion Corpus, please cite:
The Quality of Content in Open Online Collaboration Platforms: Approaches to NLP-supported Information Quality Management in Wikipedia
Oliver Ferschke. PhD Thesis. July 2014, Technische Universität Darmstadt.

Resource Download

  • Simple English Wikipedia Discussion Corpus (MMAX)
  • Simple English Wikipedia Discussion Corpus (XMI)
  • English Wikipedia Discussion Corpus (.zip containing MMAX, XMI, and .csv)

The corpus is based on Wikipedia discussions, and therefore available under the Creative Commons Attribution/Share-Alike License (CC-BY-SA)

 In case of questions, please contact Oliver Ferschke.

A A A | Drucken Print | Impressum Impressum | Sitemap Sitemap | Suche Search | Kontakt Contact | Webseitenanalyse: Mehr Informationen
zum Seitenanfangzum Seitenanfang