Introduction to IR
Motivation, Basic Concepts, Basic structure of search engine, Past and Future, The Retrieval Process, Web search and IR, Information Retrieval vs Data Retrieval , IR Vs. IE, Concept of relevance
Indexing & Query Processing
Index Construction, Indexing techniques for textual information items, such as inverted indices, Document Preprocessing: tokenization, stemming and stop words. Pattern Matching.
Study Popular Retrieval Models
Taxonomy of Information Retrieval Models, A Formal Characterization of IR Models.
Classic Information Retrieval: Basic Concepts, Boolean Model, Vector Model, Probabilistic Model, Brief Comparison of Classic Models
Language modeling. Probability ranking principle. Other commonly-used techniques include relevance feedback, pseudo relevance feedback, and query expansion and its Techniques.
Retrieval Performance Evaluation
Measures to compute similarity (Cosine, Jacquard), Retrieval performance evaluation: Recall and Precision, NDCG.
An Introduction to Web Search Basics : Web structure & Characteristics, Web Crawling and Indexes, Link Analysis, Introduction to IR based on Semantics, Ontologies.
· C.D. Manning, P. Raghavan, H. Schütze, “Introduction to Information Retrieval”, Cambridge UP, 2008.
D.A. Grossman, O. Frieder, “Information Retrieval: Algorithms and Heuristics”, Springer, 2004.