Literature
We use mainly the book "Introduction to Information Retrieval" of Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze, that you can find online and as PDF here.
Contact
In case you have questions, please contact Dr. Elena Demidova.
Lectures and Dates
- Boolean retrieval print version (17.10.2013)
- The term vocabulary & postings lists print version (24.10.2013)
- Scoring, term weighting, the vector space model print version (31.10.2013)
- Dictionaries & tolerant retrieval print version (07.11.2013)
- Evaluation & result summaries print version (14.11.2013)
- Relevance feedback & query expansion print version (21.11.2013)
- XML retrieval (28.11.2013)
- Text classification & Naive Bayes print version (05.12.2013)
- Flat clustering print version (12.12.2013) K-means Demo
- Web search basics print version (19.12.2013) Google Adwords Demo
- Size estimation & duplicate detection print version (09.01.2014)
- Crawling print version (16.01.2014)
- Link analysis print version (23.01.2014)
- Q&A (30.01.2014)
Exercises
- Exercise 1 (24.10.2013)
- Exercise 2 (31.10.2013)
- Exercise 3 (07.11.2013) notes to ex 3
- Exercise 4 (14.11.2013)
- Exercise 5 (21.11.2013)
- Exercise 6 (28.11.2013)
- Exercise 7 (12.12.2013)
- Exercise 8 (19.12.2013)
- Midterm test
- Exercise 9 (09.01.2014)
- Exercise 10 (16.01.2014) iCrawl announcement
- Exercise 11 (23.01.2014) notes to ex 11
- Q&A (30.01.2014)
Program code
To run the code you shoud have a running Eclipse IDE as well as the IR library Lucene 4.5.x (download).
Lucene Test (Zip archive)