Course Objectives
This course enables the students to
Course Outcomes(COs):
Learning outcomes (at course level) | Learning and teaching strategies | Assessment Strategies |
CO89. State the Data Warehouse fundamentals, Data Mining Principles.
CO90. Describe data warehouse with dimensional modeling and apply OLAP operations.
CO91. Apply data mining algorithms to solve real world problems.
CO92. Compare and evaluate different data mining techniques like classification, prediction, clustering and association rule mining.
CO93. Apply Data mining techniques on real world problems using tool.
CO94. Benefit user experiences towards research innovation and integration. | Approach in teaching: Interactive Lectures, Discussion, Demonstration with real world examples, Role plays, tool based experiment
Learning activities for the students: Self-learning assignments, Quiz activity, Effective questions, case study based learning approach, presentation, flip classroom
|
|
Data Warehousing: Basic Concepts, Architecture of Data Warehouse, OLAP and Data Cubes, Dimensional Data Modeling-star, snowflake schemas , Data Preprocessing – Need, Data Cleaning, Data Integration &Transformation, Data Reduction
Introduction to Data Mining: Basic Data Mining Tasks, Data Mining versus Knowledge Discovery process , Data Mining Issues, Data Mining Metrics, Social Implications of Data Mining, Overview of Applications of Data Mining
Data Mining Techniques: Frequent item-sets and Association rule mining: Apriori algorithm, Use of sampling for frequent item-set, FP tree algorithm
Mining Various Kinds of Association Rules – Association Mining to Correlation Analysis – Constraint-Based Association Mining.
Classification & Prediction: Decision tree learning: Construction, performance, attribute selection Issues: Over-fitting, tree pruning methods, missing values, continuous classes Classification and Regression Trees (CART) , Bayesian Classification: Bayes Theorem, Naïve Bayes classifier, Bayesian Networks Inference , Parameter and structure learning: Linear classifiers, Least squares, logistic, perceptron and SVM classifiers, KNN classifiers, Prediction: Linear regression, Non-linear regression
Accuracy Measures: Precision, recall, F-measure, confusion matrix, cross-validation, bootstrap, Clustering: k-means, k-medoids, Expectation Maximization (M) algorithm, Hierarchical clustering, Correlation clustering. Brief overview of advanced techniques: Active learning, Reinforcement learning, Text mining, Graphical models, Web Mining , Basics of Data Mining Tools