Motivation, Basic Concepts, Basic structure of search engine, Past and Future, The Retrieval Process, Web search and IR, Information Retrieval vs Data Retrieval , IR Vs. IE, Concept of relevance
Index Construction, Indexing techniques for textual information items, such as inverted indices, Document Preprocessing: tokenization, stemming and stop words. Pattern Matching.
Taxonomy of Information Retrieval Models, A Formal Characterization of IR Models.
Classic Information Retrieval: Basic Concepts, Boolean Model, Vector Model, Probabilistic Model, Brief Comparison of Classic Models
Language modeling. Probability ranking principle. Other commonly-used techniques include relevance feedback, pseudo relevance feedback, and query expansion and its Techniques.
Measures to compute similarity (Cosine, Jacquard), Retrieval performance evaluation: Recall and Precision, NDCG.
Web structure & Characteristics, Web Crawling and Indexes, Link Analysis, Introduction to IR based on Semantics, Ontologies.
|