This course enables the students to Course Outcomes(COs): Learning Outcome (at course level) Learning and teaching strategies Assessment Strategies CO183. Familiar with classic and recent developments in Web search and web mining. CO184. Identify the different components of a web page that can be used for mining. CO185. Learn basic concepts to web content mining. CO186. Implement Page Ranking algorithm and modify the algorithm for mining information CO187. Modify an existing search engine to make it personalized using web analytics Approach in teaching: Interactive Lectures, Discussion, Demonstration, Experiment Learning activities for the students: Self-learning assignments, Quiz activity, presentation, flip classroom, · Assignments · Written test in classroom · Classroom activity · Continues Assessment · Semester End Examination
Introduction Introduction – Web Mining – Theoretical background –Algorithms and techniques – Association rule mining – Sequential Pattern Mining -Information retrieval and Web search – Information retrieval Models-Relevance Feedback- Text and Web page Pre-processing
Web Content Mining Web Content Mining – Supervised Learning – Decision tree - Naive Bayesian Text Classification -Support Vector Machines - Ensemble of Classifiers. Unsupervised Learning - K-means Clustering -Hierarchical Clustering –Partially Supervised Learning
Web Structure and Web Usage Mining Hyperlink based Ranking – Introduction -Social Networks Analysis- Co-Citation and Bibliographic Coupling - Page Rank -Authorities -Enhanced Techniques for Page Ranking - Community Discovery – Web Crawling -A Basic Crawler Algorithm- Implementation Issues Web Usage Mining – sources of data- Applications -Click stream Analysis -Web Server Log Files - Data Collection and Pre Processing- Cleaning and Filtering- Data Modeling for Web Usage Mining – Issues- Discovery and Analysis of Web Usage Patterns – Used tools in Web Usage mining.
Introduction to web analytics Motivation and historical perspective on the development of web analytics, Display and search advertising , Knowledge discovery from web data, Major computing paradigms, Typical problem formulations
Web analytics at e-Business scale Framework for mapping business needs to web analytics tasks, Data collection architecture, Introduction to OLAP, Web data exploration and reporting, Introduction to Splunk