Logo Leibniz Universität Hannover
Logo: Institut für Verteilte Systeme - Fachgebiet Wissensbasierte Systeme (KBS)
Logo Leibniz Universität Hannover
Logo: Institut für Verteilte Systeme - Fachgebiet Wissensbasierte Systeme (KBS)
  • Zielgruppen
  • Suche
 

Foundations of Information Retrieval

Prof. Wolfgang Nejdl, Shanshan Xu

Content

The lecture gives an introduction to Web Information Retrieval with particular emphasis on the algorithms and technologies used in the modern search engines.
The module covers an introduction to the traditional text IR, including Boolean retrieval, vector space model as well as tolerant retrieval. Afterwards, the technical basics of Web IR are discussed, starting with the Web size estimation and duplicate detection followed by the link analysis and crawling. This leads on to the study of the modern search engine evaluation methods and various test collections. Finally, applications of classification and clustering in the IR domain are discussed. The theoretical basis is illustrated by the examples of the modern search systems, such as Google, Altavista, Clusty, etc.

Die Lehrveranstaltung behandelt Algorithmen, Strukturen und innovative Systeme, die im Rahmen des World Wide Web relevant sind bzw. durch das World Wide Web möglich geworden sind. Kernpunkte der Lehrveranstaltung sind Web-Suche (Web Crawling, Text Indexing, Ranking Mechanismen), Analyse und Struktur des World Wide Web, Datenmanagement (Suche, Topologien, Systeme), sowie weitere aktuelle Themen.

Recommended Literature:

Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze, Introduction to Information Retrieval, Cambridge University Press. 2008.

Participants:

Computer science students (recommended from the 3. semester) and ITIS students.

Contact:

Shanshan Xu

 Language:

English

 Lecture Dates

Lectures and tutorial session will take place online. Please refer to the StudIP course for more information.

Exam

The exam will be in English. You can answer in English or German. All topics discussed in the lectures, exercises and programming exercises are relevant.

Duration: 90 minutes.
Auxiliary material: a non-programmable calculator, dictionary.

Lecture notes:

WS 2019/20

Archive of the lectures "Foundations of Information Retrieval" and "Technologien für das Internet I":