Foundations of Information Retrieval

Prof. Wolfgang Nejdl, Markus Rokicki


The lecture gives an introduction to Web Information Retrieval with particular emphasis on the algorithms and technologies used in the modern search engines.
The module covers an introduction to the traditional text IR, including Boolean retrieval, vector space model as well as tolerant retrieval. Afterwards, the technical basics of Web IR are discussed, starting with the Web size estimation and duplicate detection followed by the link analysis and crawling. This leads on to the study of the modern search engine evaluation methods and various test collections. Finally, applications of classification and clustering in the IR domain are discussed. The theoretical basis is illustrated by the examples of the modern search systems, such as Google, Altavista, Clusty, etc.

Die Lehrveranstaltung behandelt Algorithmen, Strukturen und innovative Systeme, die im Rahmen des World Wide Web relevant sind bzw. durch das World Wide Web möglich geworden sind. Kernpunkte der Lehrveranstaltung sind Web-Suche (Web Crawling, Text Indexing, Ranking Mechanismen), Analyse und Struktur des World Wide Web, Datenmanagement (Suche, Topologien, Systeme), sowie weitere aktuelle Themen.

Recommended Literature:

Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze, Introduction to Information Retrieval, Cambridge University Press. 2008.


Computer science students (recommended from the 3. semester) and ITIS students.


Markus Rokicki, M.Sc.



 Lecture Dates

The lecture and exercise sessions in WS 2016/2017 take place in the Multimedia-Room, Appelstr. 4, Thursday 11:30 - approx. 14:00 (starting October 27, 2016).

The lectures will not be recorded.

Exam in WS 2016/2017

The exam will take place on 13.03.2017 at 10am in lecture rooms MZ1 and MZ2.

The exam will be in English.

Duration: 90 minutes.
Auxiliary material: a non-programmable calculator. 

Lecture notes:

Please find the lecture notes for WS 2016/2017 here.

Archive of the lectures "Foundations of Information Retrieval" and "Technologien für das Internet I": WS 2015/2016, WS 2014/2015WS 2013/2014WS 2012/2013, WS 2011/2012, WS 2010/11, WS 2009/2010