Overview

Under the heading of Language Technology for Digital Humanities, UKP Lab conducts projects at the boundary between Natural Language Processing, Computer Science on the one hand, and Humanities, Social Sciences, and Educational Research on the other hand. In particular, we work on making digital analysis methods more accessible to text-based humanities, implement tools to explore and annotate text corpora, and contribute to the infrastructures supporting Digital Humanities. Our research interests in this area include:

  • Creating user-friendly tools to explore and annotate text corpora

  • Analyzing corpora at the semantic level, e.g. opinion mining or identifying metaphoric language

  • Processing and analyzing historical texts

  • Interoperability with Digital Humanities infrastructures such as DARIAH and CLARIN

Current Projects

  • DKPro: At UKP, we believe in supporting reproducible NLP research through re-usable and freely available software components. To this end, UKP created the award-winning DKPro repository of open-source software covering many aspects of NLP from pre-processing, lexical resource, machine-learning, to semantic analysis. As DKPro is growing and gaining popularity, it now starts evolving into a community project in which UKP collaborates e.g. with researchers from the University of Duisburg-Essen.

  • CLARIN F-AG7 KP 3: In association with the CLARIN project, we are building the flexible, web-based annotation tool WebAnno and apply it to the annotation of non-standard varieties of German at the semantic level. This work is done in collaboration with the Language Technology Group in Darmstadt and with researchers from the University of Heidelberg.

  • CEDIFOR: In this context , we aim to foster interdisciplinary work between Computer Science and Digital Humanities by providing know-how and research infrastructures for text analytics to humanities researchers in the Rhein-Main area, supporting them in their investigation of novel research questions. This project is conducted in collaboration with the Goethe University Frankfurt am Main and the German Institute for International Educational Research (DIPF).

  • DARIAH-DE II: The mission of the EU-ESFRI-Project DARIAH-EU is to enhance and support digitally-enabled research across the arts and humanities. In the context of the second phase of the German contribution DARIAH-DE, UKP collaborates closely with researchers from the Julius Maximilians University of Würzburg to automatically detect and analyze narrative structures in German. These techniques are applied to a corpus of around 2.000 novels, which were written over the last centuries.

  • Welt der Kinder: The digital humanities project “Welt der Kinder” is designed as a test model for future similar projects in historical sciences. By very close cooperation between historians, information scientists, and computer scientists, it aims to gain new insights about the way the world was conveyed to children in a period from 1850 until 1918 - a time in of accelerated production of knowledge that was equally dominated by globalization and nationalisation.
  • Processing of audiovisual content: The amount of audiovisual content is constantly increasing, specially in the educational domain, making tasks like transcription and visual analysis a very cumbersome activity for humanistic researchers. This project aims to create technology which facilitates the integration of manual and automatic analysis of audiovisual content.
  • OpenMinTeD: OpenMinTeD aspires to enable the creation of an infrastructure that fosters and facilitates the discovery and use of text mining technologies and interoperable services. It examines several use cases identified by experts from different scientific areas, ranging from generic scholarly communication to literature related to life sciences, food and agriculture, and social sciences and humanities.

Past Projects

  • DARIAH-DE I: The mission of the EU-ESFRI-Project DARIAH-EU is to enhance and support digitally-enabled research across the arts and humanities. In the first phase of the German contribution DARIAH-DE, UKP investigated possibilities of using the emerging DARIAH infrastructure by means of the use-case of setting up a digital archive and by means of integrating DARIAH and TextGrid services.

  • CLARIN F-AG7 KP 1: In association with the CLARIN project, we developed the flexible web-based annotation tool WebAnno. The tool supports visual annotation of multiple linguistic layers, including custom defined layers. It is interoperable with CLARIN infrastructures such as WebLicht. The tool has been developed in closed cooperation with the CLARIN F-AG7 KP 2 project, which defines “best practices” for linguistic annotation on several language layers for different annotator status groups. This work has been done in collaboration with the Language Technology Group in Darmstadt.

  • LOEWE Research Center “Digital Humanities” TP 2.2 “Text as an Instance”: In this project, UKP collaborated very closely with linguists and computational linguists on the comparative analysis of non-canonical grammatical constructions in German and English. Due to the infrequence and ambiguity of such constructions, a dedicated analysis process and supporting tools needed to be developed for annotation. The result of this is the CSniper annotation tool that combines collaborative search and annotation into a user-friendly tool. This project has been conducted with researchers from the Department of Linguistics and Literature in Darmstadt as well as from the Goethe University in Frankfurt am Main.

  • LOEWE Research Center “Digital Humanities” TP 2.3 “Text as a Process”: This project analyzed the linguistic properties of collaboratively created text in the Web 2.0. For more details, please refer to the respective section in the Text Analytics area description.

Completed PhD Theses

Publications

Additional Attributes

Type

Sense-annotating a lexical substitution data set with Ubyline

Tristan Miller, Mohamed Khemakhem, Richard Eckart de Castilho, Iryna Gurevych
In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), p. 828--835, May 2016
European Language Resources Association (ELRA)
[Online-Edition: https://www.ukp.tu-darmstadt.de/data/sense-labelling-resources/glass/]
[Inproceedings]

High Performance Word Sense Alignment by Joint Modeling of Sense Distance and Gloss Similarity

Michael Matuschek, Iryna Gurevych
In: Proceedings of the the 25th International Conference on Computational Linguistics (COLING 2014), p. 245-256, August 2014
Dublin City University and Association for Computational Linguistics
[Online-Edition: http://www.aclweb.org/anthology/C14-1025]
[Inproceedings]

A broad-coverage collection of portable NLP components for building shareable analysis pipelines

Richard Eckart de Castilho, Iryna Gurevych
In: Proceedings of the Workshop on Open Infrastructures and Analysis Frameworks for HLT (OIAF4HLT) at COLING 2014, p. 1--11, August 2014
Association for Computational Linguistics and Dublin City University
[Online-Edition: https://dkpro.github.io/dkpro-core/]
[Inproceedings]

DKPro TC: A Java-based Framework for Supervised Learning Experiments on Textual Data

Johannes Daxenberger, Oliver Ferschke, Iryna Gurevych, Torsten Zesch
In: Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, p. 61-66, June 2014
Association for Computational Linguistics
[Online-Edition: https://github.com/dkpro/dkpro-tc]
[Inproceedings]

WebAnno: a flexible, web-based annotation tool for CLARIN

Richard Eckart de Castilho, Chris Biemann, Iryna Gurevych, Seid Muhie Yimam
In: Proceedings of the CLARIN Annual Conference (CAC) 2014, p. online, 2014
CLARIN ERIC
[Online-Edition: https://webanno.github.io/webanno/]
[Inproceedings]

The Impact of Topic Bias on Quality Flaw Prediction in Wikipedia

Oliver Ferschke, Iryna Gurevych, Marc Rittberger
In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013), Vol. 1, p. 721--730, August 2013
Association for Computational Linguistics
[Online-Edition: http://www.ukp.tu-darmstadt.de/data/wiki-flaws/]
[Inproceedings]

WebAnno: A Flexible,Web-based and Visually Supported System for Distributed Annotations

Seid Muhie Yimam, Iryna Gurevych, Richard Eckart de Castilho, Chris Biemann
In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (System Demonstrations) (ACL 2013), p. 1-6, August 2013
Association for Computational Linguistics
[Online-Edition: https://webanno.github.io/webanno/]
[Inproceedings]

Behind the Article: Recognizing Dialog Acts in Wikipedia Talk Pages

Oliver Ferschke, Iryna Gurevych, Yevgen Chebotar
In: Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2012), p. 777--786, April 2012
[Online-Edition: http://aclweb.org/anthology/E/E12/E12-1079.pdf]
[Inproceedings]

CSniper - Annotation-by-query for non-canonical constructions in large corpora

Richard Eckart de Castilho, Sabine Bartsch, Iryna Gurevych
In: Proceedings of the 50th Meeting of the Association for Computational Linguistics (ACL) 2012 (Demo section), p. 85-90, 2012
Association for Computational Linguistics
[Online-Edition: https://dkpro.github.io/dkpro-csniper/]
[Inproceedings]
A A A | Drucken Print | Impressum Impressum | Sitemap Sitemap | Suche Search | Kontakt Contact | Webseitenanalyse: Mehr Informationen
zum Seitenanfangzum Seitenanfang