INTRODUCTION TO DATA SCIENCE

Paper Code: 
MCA 324A
Credits: 
4
Periods/week: 
4
Max. Marks: 
100.00
Objective: 

Course Objectives:

This course enables the students to

  1. Define the concepts of data science.
  2. Understand the concepts of big data in data science.
  3. Demonstrate the data science process.
  4. Differentiate between business intelligence and data science.
  5. Evaluate using different statistical methods.
  6. Construct cases and new ideas where the knowledge of data science can be implemented.

 

Course Outcomes(COs):

 

Learning Outcome (at course level)

 

Learning and teaching strategies

Assessment Strategies

  1. Define basic concepts of Data Science.
  2. Describe Big data and its Applications.
  3. Articulate the process of data Science and types of analytics.
  4. Compare and analyze different Statistical Methods used for data Science.
  5. Explore various Tools used for data Science.
  6. Implement Data Science Algorithms to solve various real life problems.

Approach in teaching:

Interactive Lectures, Modeling, Discussions, implementing enquiry based learning, Student centered approach, Through audio-visual aids

 

Learning activities for the students:

Experiential Learning, Presentations, Case based learning, Discussions, Quizzes and  Assignments

  • Assignments
  • Written tests in classroom
  • Classroom Activity
  • Objective Quiz
  • Semester End Exam
 

 

12.00
Unit I: 
12

Introduction

What is Data Science, Need for Data Science, Components of Data Science, Big data, Facets of data: Structured data, Unstructured data, Natural Language, Machine-generated data, Graph-based or network data, Audio, image and video, Streaming data, The need for Business Analytics, Data Science Life Cycle, Applications of data science

12.00
Unit II: 

Introduction to Big Data

Classification of Digital Data, Big Data and its importance, Four Vs, Drivers for Big data, Big data analytics, Classification of Analytics , Top Challenges Facing Big Data, Responsibilities of data scientists, Big data applications in healthcare, medicine, advertising

12.00
Unit III: 

Data Science Process

Overview of data science process, setting the research goal, Retrieving data, Cleansing, integrating and transforming data, Exploratory data analysis, Data Modeling, Presentation and automation, Types of Analytics: Descriptive analytics, Diagnostic analytics, Predictive analytics, Prescriptive analytics

12.00
Unit IV: 

Statistics

Basic terminologies, Population, Sample, Parameter, Estimate, Estimator, Sampling distribution, Standard Error, Properties of Good Estimator, Measures of Centers, Measures of Spread, Probability, Normal Distribution, Binary Distribution, Hypothesis Testing ,Chi-Square Test , ANOVA

 

12.00
Unit V: 

Data Science Tools and Algorithms

Basic Data Science languages- R, Python, Knowledge of Excel, SQL Database, Introduction to Weka, Regression Algorithms: How Regression Algorithm Work, Linear Regression, Logistic Regression, K-Nearest Neighbors Algorithm, K-means algorithm.

 

ESSENTIAL READINGS: 

 

  • Herbert Jones, Data Science: The Ultimate Guide to Data Analytics, Data Mining, Data Warehousing, Data Visualization, Regression Analysis, Database Querying, Big Data for Business and Machine Learning for Beginners, Bravex Publications,2020
  • Samuel Burns, “Fundamentals of Data Science: Take the first Step to Become a Data Scientist” , Amazon KDP Printing and Publishing, First Edition, 2019
  • Davy Cielen, Arno D.B. Meysman, Mohamed Ali, “Introducing Data Science”, Manning Publications, 2016

 

Suggested Readings:

  • Cathy O’Neil and Rachel Schutt, “Doing Data Science, Straight Talk From The Frontline”, O’Reilly. 2014.

 

Academic Year: