DKPro

DKPro is a collection of software components for natural language processing (NLP) based on the  Apache UIMA framework.

Many powerful and state-of-the-art NLP components are already freely available in the NLP research community. New and improved components are being developed and released continuously. The components cover the whole range of NLP-related processing tasks. DKPro provides wrappers for such third-party tool as well as original NLP components. DKPro builds heavily on  uimaFIT which allows for rapid and easy development of NLP processing pipelines.

As part of the DKPro effort NLP components for a number of different application domains are developed including basic pre-processing, information retrieval and semantic text processing. 

Downloads

Please download  DKPro Core ASL and  DKPro Core GPL from their Google Code project pages.

Old versions of DKPro Core can be found here.

Team

The principal investigator is Prof. Dr. Iryna Gurevych.

Richard Eckart de Castilho is currently the technical lead.

DKPro is a shared project of all UKP to which all group members contribute.

Awards

The UKP group received two IBM's 2008 Unstructured Information Analytics (UIA) Awards for their DKPro proposals! Click here for the associated press release (German only).

Project Publications

Displaying results 1 to 4 out of 4

DKPro-UGD: A Flexible Data-Cleansing Approach to Processing User-Generated Discourse
Richard Eckart de Castilho and Iryna Gurevych
In: Online-proceedings of the First French-speaking meeting around the framework Apache UIMA, July 2009.
http://e.nicolas.hernandez.free.fr/pub/rec/09/RMLL-cfp-en.html.

Information Extraction with the Darmstadt Knowledge Processing Software Repository (Extended Abstract)
Iryna Gurevych and Mark-Christoph Müller
In: Proceedings of the Workshop on Linguistic Processing Pipelines, July 2008.

Darmstadt Knowledge Processing Repository Based on UIMA
Iryna Gurevych, Max Mühlhäuser, Christof Müller, Jürgen Steimle, Markus Weimer, Torsten Zesch
In: Proceedings of the First Workshop on Unstructured Information Management Architecture at Biannual Conference of the Society for Computational Linguistics and Language Technology, April 2007.

Teaching "Unstructured Information Management: Theory and Applications" to Computational Linguistics Students
Iryna Gurevych, Christof Müller, Torsten Zesch
In: Proceedings of the First Workshop on Unstructured Information Management Architecture at Biannual Conference of the Society for Computational Linguistics and Language Technology, April 2007.

A A A | Drucken Print | Impressum Impressum | Sitemap Sitemap | Suche Search | Kontakt Contact
zum Seitenanfangzum Seitenanfang