Graph-based methods for NLP

The seminar takes place Thursdays, 14:15 - 15:45 in S2|02 room A126.

Description

Graphs have become more and more popular in the area of knowledge processing during the last years. In particular, if the underlying resources already contain a graph structure, such as the Wikipedia link graph, for example, that connects related concepts, it seems natural to make use of this structure by means of graph algorithms. Otherwise, graph structures have to be established first by modeling the problem in an appropriate way. Nevertheless, graphs grant some obvious benefits from an algorithmic perspective: there are efficient algorithms for a wide variety of problems, that take into account both, local as well as global relations such as neighborhoods of entities.

The seminar provides detailed coverage of current graph approaches in natural language processing and information retrieval, their strengths and limitations, and current research directions by including recent research papers. In the course of the seminar, students will acquire key skills like the fundamentals in academic research and scientific writing, and they will be encouraged to improve their presentation skills.

Applications include but are not limited to:

  • Word Sense Disambiguation
  • Text Similarity
  • Sentiment Analysis / Opinion Mining
  • Question Answering
  • Word Sense Induction
  • Named Entities
  • Wikipedia discussion / revisions
  • Text Quality Assesment
  • Cross-lingual retrieval

Methods include but are not limited to:

  • Graph Traversal Algorithms
  • Shortest Paths Algorithms
  • Graph Clustering Algorithms
  • Matching / Assignment Algorithms
  • Heuristics, Random Walks, etc.

Literature

Jurafsky, Daniel, and James H. Martin (2009). Speech and Language Processing: An Introduction to Natural Language Processing, Speech Recognition, and Computational Linguistics. 2nd edition. Prentice-Hall.

For each topic, current research papers will be discussed in class.

Prerequisites and Preparation

Every student should have the knowledge of the introductory chapters of Speech and Language Processing: An Introduction to Natural Language Processing, Speech Recognition, and Computational Linguistics or comparable. It is expected that the texts have been thoroughly worked through by the fourth session at the latest. Knowledge in graph algorithms (for example the lecture "Efficient Graph Algorithms" held in the winter term) are useful but not mandatory.

Expectations

Each student is expected to

  • give a talk in class and answer some questions afterwards
  • write a term paper
  • show active participation in class

Materials and Forum

Access to course materials will be provided in the first seminar session on 12th of April. The introductory slides can be found here:

Intro

Natrural language graphs

Graph algorithms in NLP

For general advice on presenting your topic, please have a look at these guidelines and these helpful instructions.

Lecturers

Tucan number for registration: 20-00-0596-se

Timetable

-----------------------

----------------------------

---------------------------------------------------------------------------------------------------------------------

12th April 2012

Wolfgang Stille

Introduction and topic assignment

19th April 2012

Chris Biemann

Natural Language Graphs

26th April 2012

Wolfgang Stille

Graph Algorithms

3rd May 2012

Thanh Tung Dang

Esuli, A. and Sebastiani, F. (2007). Pageranking wordnet synsets: An application to opinion mining. In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pages 424–431, Prague, Czech Republic. Association for Computational Linguistics.

 

Christof Gräber

Laniado, D., Tasso, R., Volkovich, Y., and Kaltenbrunner, A. (2011). When the wikipedians talk: Network and tree structure of wikipedia discussion pages. In Proceedings of the Fifth International Conference on Weblogs and Social Media, Barcelona, Catalonia, Spain, July 17-21, 2011.

10th May 2012

Alexander Gabriel

Niemann, E. and Gurevych, I. (2011). The People’s Web meets Linguistic Knowledge: Automatic Sense Alignment of Wikipedia and WordNet. In Proceedings of the 9th International Conference on Computational Semantics, p. 205--214, January 2011.

17th May 2012

Public holiday

 

24th May 2012

Christian Brückner

Hoffart, J., Yosef, M. A., Bordino, I., Fürstenau, H., Pinkal, M., Spaniol, M., Taneva, B., Thater, S., and Weikum, G. (2011). Robust disambiguation of named entities in text. In Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, United Kingdom 2011, pages 782–792.

31st May 2012

Andreas Zimpfer

Mihalcea, R. (2005). Unsupervised large-vocabulary word sense disambiguation with graph-based algorithms for sequence data labeling. In In HLT/EMNLP 2005, pages 411–418.

 

 

Sinha, R. and Mihalcea, R. (2007). Unsupervised graph-based word sen- se disambiguation using measures of word semantic similarity. In Proceedings of the International Conference on Semantic Computing, pages 363–369, Washington, DC, USA. IEEE Computer Society.

7th June 2012

Public holiday

 

14th June 2012

dropped

 

21st June 2012

Lukas Werner

Cong, G., Wang, L., Lin, C.-Y., Song, Y.-I., and Sun, Y. (2008). Finding question- answer pairs from online forums. In Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR ’08, pages 467–474, New York, NY, USA. ACM.

 

Torsten Sillus

Leskovec, J., Huttenlocher, D. P., and Kleinberg, J. M. (2010). Signed networks in social media. In Proceedings of the 28th International Conference on Human Factors in Computing Systems, CHI 2010, Atlanta, Georgia, USA, April 10-15, 2010, pages 1361–1370.

21st June 2012

Wolfgang Stille

Feedback

A A A | Drucken Print | Impressum Impressum | Sitemap Sitemap | Suche Search | Kontakt Contact
zum Seitenanfangzum Seitenanfang