Software

At UKP we release the software we develop as a service to the academic community. Most of the software listed below forms part of the Darmstadt Knowledge Processing Repository (DKPro), an umbrella initiative for reusable software components and language resources.

  • CSniper is a search-based annotation tool to help distributed annotation teams finding infrequent linguistic phenomena in large corpora.
  • DKPro Core provides a set of ready to use software components for natural language processing, based on the Apache UIMA framework.
  • DKPro Lab is a lightweight framework for parameter sweeping experiments. It allows you to set up experiments consisting of multiple interdependent tasks in a declarative manner with minimal overhead.
  • DKPro LSR (Lexical Semantic Resources) is a unified API for several lexical-semantic resources.
  • DKPro Similarity is an open source software package for developing text similarity algorithms.
  • DKPro Spelling includes components for real-word spelling error correction and experimental frameworks for mining such errors from the Wikipedia revision history as well as for the "Helping Our Own" shared tasks 2011 and 2012.
  • DKPro Statistics is a collection of open-licensed statistical tools, currently including correlation and inter-rater agreement methods.
  • DKPro TC (Text Classification) is a UIMA-based text classification framework built on top of DKPro Core, DKPro Lab and the Weka Machine Learning Toolkit. It is intended to alleviate supervised machine learning experiments with any kind of textual data.
  • DKPro WSD is a modular, extensible Java framework for word sense disambiguation.
  • JOWKL (Java OmegaWiki Library) is an open-source, Java-based application programming interface that allows to access all information contained in OmegaWiki, such as glosses, usage examples, translations and much more.
  • JWKTL (Java Wiktionary Library) is a free, Java-based application programming interface that allows to access the information contained in Wiktionary.
  • JWPL (Java Wikipedia Library) is a free, Java-based application programming interface that allows to access all information contained in Wikipedia.
  • UBY-API is a free, Java-based application programming interface that allows to access all information contained in UBY, a large-scale lexical-semantic resource for NLP based on ISO LMF.
  • WebAnno is a general purpose web-based annotation tool for a wide range of linguistic annotations.

In addition, current (and past) members of our group have contributed to the following free and open source community projects:

  •  Apache UIMA™ is a framework for building analysis pipelines for unstructured information, such as text, video, or audio data.
  •  Apache uimaFIT™ is a simplified API for UIMA, facilitating the programmatic assembly of processing pipelines, the building of analysis components, and maintenance of component meta data. It is a great asset for the building and running research experiments and a fundamental building block of many UIMA-based DKPro products, particularly DKPro Core. This project was formerly known simply as  uimaFIT.
  •  jWeb1T is a Java tool for efficiently searching n-gram data in the Web 1T 5-gram corpus format.
  •  TreeTagger for Java is a Java wrapper for the popular  TreeTagger package for part-of-speech tagging and chunking by Helmut Schmid.
  • JoBimText provides a software solution for automatic text expansion using contextualized distributional similarity.
  • TWSI Sense Substituter produces lexical substitutions in context for over 1000 frequent English nouns.
A A A | Drucken Print | Impressum Impressum | Sitemap Sitemap | Suche Search | Kontakt Contact
zum Seitenanfangzum Seitenanfang