This course enables the students to
Course | Learning Outcome (at course level)
| Learning and teaching strategies | Assessment Strategies | |
Course Code | Course Title | |||
24MCA 423D | Natural Language Processing (Theory) |
| Approach in teaching: Interactive Lectures, Discussion, Presentations, Video Tutorials, Demonstration.
Learning activities for the students: Self-learning Assignments, Effective questions, Simulation |
|
Introduction to Natural Language Processing:
Natural Language Processing: Need, applications, Challenges in NLP: Ambiguity in language, Contextual words and phrases and homonyms, Co-reference, Domain-specific language, Low-resource languages, Segmentation, Lemmatization, Spelling correction, Synsets, Hypernyms and Code mixed language. Computational Linguistics, Different levels of Language Analysis, Representations and Understanding, Organization of Natural language Understanding Systems, Tokenization, Stop Words.
Grammar for Natural Language:
Grammar and sentence Structure, Top-Down and Bottom-Up Parsers, Transition Network Grammars, Auxiliary Verbs and Verb Phrases, Movement Phenomenon in Language, Context-Free Grammer, Handling questions in Context-Free Grammars. Human preferences in Parsing, Encoding uncertainty, Deterministic Parser.
Language Modeling and POS Tagging
Introduction to N-grams, Chain Rule, Smoothing –Interpolation, Backoff, Web-Scale LMs, Evaluation and Perplexity.
Hidden Markov Models and POS Tagging: Part of Speech Tagging, Markov Chain, Hidden Markov Models, Forward Algorithm, Viterbi Algorithm
Introduction to Natural Language Tool Kit (NLTK) with Python:
Text normalization, Tokenizing, Filtering Stop Words, Stemming, Tagging, Word representation, Sentence representation, Vector space model, Term Frequency, TF-IDF Representation, Introduction to Word embedding, iNLTK (Natural Language Toolkit for Indic Languages).
Neural Networks for Text
Multi Layer Perceptron, Convolutional Neural Network, Recursive Neural Network, Recurrent Neural Network, Long Short-Term Memory (LSTM), Sequence to sequence modeling, Text classification using neural network. Introduction to Machine translation, Natural Language Generation, Large language models , Question Answering Bot, Generative models, Generative Pre-Trained Transformer (GPT).
E-Resources
Journals