This Course enables the students to
Course | Learning outcome (at course level) | Learning and teaching strategies | Assessment Strategies | |
Course Code | Course Title | |||
24DBDA 511 B | Big Data and Text Mining (Theory) | CO307. Identify the applications of big data and analyze various sources of text data. CO308. Compare tools for handling big data in a real time scenario. CO309. Apply text mining techniques to analyse textual data. CO310. Evaluate methods for text representation and text mining in textual analysis. CO311. Build and evaluate machine learning models for text classification using appropriate metrics. CO312.Contribute effectively in course-specific interaction | Approach in teaching. Interactive Lectures, Demonstrations, Learning activities for the students. Self-learning assignments, Quizzes, Presentations, Discussions |
|
What is Big Data? Handling and Processing Big Data, Methodological Challenges and Problems faced in handling big data, big data applications, Text based big data. sources of text data, issues and handling of big data.
Big Data Overview, Drivers of Big Data, Big Data Attributes, Examples of Big Data Analytics, Introduction to Big Data Tools, Techniques, and Systems. The relationship between Apache Spark and Hadoop Ecosystem, Components of Spark.
Text mining concept, Data preprocessing (Tokenization, Normalization, Stemming), Data cleaning Applications of text mining.
Text clustering . Feature Selection and Transformation Methods for Text Clustering, Word and Phrase-based Clustering(K means AND K-Mediods).
Text Representation (Sequence of words, Syntactic structure, Entities and relation, Logic predicates), Word association mining and analysis. Basic word relations Paradigmatic, syntagmatic, Applications in text mining, Topic mining and analysis. Motivation, opinion mining and sentiment analysis, Text based prediction.
Classification, commonly used text classification methods. Decision Trees, SVM Classifiers, Feature Selection for Text Classification.
SUGGESTED TEXT BOOKS
SUGGESTED READINGS:
e RESOURCES
JOURNALS