Hasso-Plattner-Institut für Softwaresystemtechnik
Natural Language Processing

Prof. Dr. Felix Naumann

Hasso-Plattner-Institut
für Softwaresystemtechnik
Prof.-Dr.-Helmert-Str. 2-3
D-14482 Potsdam, Germany

Natural Language Processing

Description

Understanding the human language by machines is one of the important topics in computer science. There is a large range of tools and technologies for natural language processing that are used by many users in daily life: from the simplest cases such as spell checkers and grammar checkers to more complicated systems such as speech recognition, machine translation, question answering, email categorization, hand writing recognition, and search engines.

Processing natural language is divided into two main parts:

  • end to end applications that are frequently used by many people,
  • intermediate techniques that are the core part of different applications.

In this lecture, the main techniques and applications of natural language processing will be introduced. In addition we briefly describe language modeling and machine learning concepts that are required to deal with language processing techniques and applications.

 

Organizational information

  • Contact: Dr. Saeedeh Momtazi
  • 3 Credit Points
  • Schedule: Thursdays, 09:15 - 10:45
  • Room: HS 3
  • Mailing List: nlp2012(at)hpi(dot)uni-potsdam(dot)de

 

 

Schedule and Materials

Date

Topic(s)

Slides

12.04.12

  • Introduction to Language Techonoly

 Session-1.pdf

19.04.12

  • Language Modeling

  Session-2.pdf

26.04.12

  • Machine Learning for NLP

  Session-3.pdf

 

03.05.12

  • Part of Speech Tagging
  • Named Entity Recognition

  Session-4.pdf

 

10.05.12

  • Parsing

  Session-5.pdf

17.05.12

        Public Holiday

 

24.06.12

        No Lecture

 

31.05.12

  • Lexical Semantics
  • Word Sense Disambiguation

07.06.12

 

14.06.12

 

21.06.12

 

28.06.12

 

05.07.12

 

12.07.12

 

Exercises

  • Exercise 1:
    • Task: implement the probabilistics CKY algorithm that use a set of rules R. The rules will be available soon. For the start up you can use the small set of rules from the slides.
    • Deadline: 30.05.2012

Course Book

SPEECH AND LANGUAGE PROCESSING

An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition

 

by Daniel Jurafsky and James H. Martin

Second Edition

 

Topics

  • Introduction {2 x 2h}
    • Introduction to Language Technology
    • Language Modeling
  • Machine Learning for NLP {2 x 2h}
    • Learning Techniques
    • Classification Algorithms
    • Clustering Algorithms
  • NLP Techniques {5 x 2h}
    • Part Of Speech Tagging
    • Syntactic Analysis
    • Named Entity Recognition
    • Word Similarity
    • Word Sense Disambiguation
    • Lexical Semantics & Semantic Role Labeling
  • NLP Applications {5 x 2h}
    • Text Categorization
    • Information Extraction & Ontology Extraction
    • Question Answering
    • Sentiment Analysis
    • Summarization
    • Machine Translation
    • Information Retrieval