Module 1: NLP Fundamentals
Introduction to NLP β Understanding how machines process human language.
End-to-End NLP Pipeline β From raw text to model prediction.
Text Preprocessing β Tokenization, stopwords, stemming & lemmatization.
Text Representation β Converting text into numerical vectors.
Word2Vec (CBOW & Skip-gram) β Deep understanding of word embeddings.
Text Classification β Spam detection, sentiment analysis & categorization.
POS Tagging β Identifying grammatical roles of words.
Duplicate Question Pairs β Detecting semantically similar questions.
Module 2: Neural Networks & Transformers
Artificial Neural Networks β Brain-inspired learning systems.
Activation Functions β Sigmoid, Tanh, ReLU & Leaky ReLU.
Loss Functions β MSE, MAE, Huber, Hinge, BCE & CCE.
Backpropagation β Core learning algorithm of deep networks.
Learning Rate, Epochs & Batch Size β Training fundamentals.
History of Large Language Models (LLMs).
EncoderβDecoder Architecture β Sequence-to-sequence models.
Attention Mechanism β Learning what matters most.
Bahdanau vs Luong Attention β Comparative study.
Transformers β Backbone of modern NLP & GenAI.
Self Attention & Scaled Dot Product Attention.
Multi-Head Attention β Parallel attention learning.
Positional Encoding β Injecting sequence order.
Layer Normalization β Stable training.
Masked Self Attention & Cross Attention.
Transformer Decoder & Inference Process.
Module 3: Large Language Models (LLMs)
How LLMs Work β High-level internal overview.
LLM Parameters β Temperature, Top-P, Stop Sequences & more.
LLM Hallucination β Causes & mitigation techniques.
Module 4: Prompt Engineering
Prompt Engineering β Designing effective prompts from beginner to pro.
Module 5: Word Embeddings & Similarity
Word Embeddings β Dense vector representations.
Word2Vec Intuition β Semantic relationships.
CBOW & Skip-gram β Internal working.
One-Hot Encoding & Bag of Words.
N-grams β Contextual learning.
TF-IDF β Importance-weighted embeddings.
Word vs Sentence Embeddings.
Cosine Similarity & Distance Metrics.
Module 6: Generative AI & LangChain
GenAI Roadmap β End-to-end learning path.
Introduction to LangChain.
LangChain Components β Models, prompts, chains & tools.
Structured Output & Output Parsers.
Document Loaders & Text Splitters.
Vector Stores & Retrievers.
Tool Calling & AI Agents.
Ollama β Running powerful local LLMs.
Module 7: APIs, Deployment & MLOps
API Fundamentals & Architecture.
FastAPI β High-performance Python APIs.
HTTP Methods β GET, POST, PUT, DELETE.
Pydantic β Data validation & schemas.
Serving ML Models with FastAPI.
Docker for Machine Learning.
FastAPI + Docker Deployment.
Deploying AI APIs on AWS.
Projects & Practical Implementation
Real-World AI & NLP Projects β Each student will work on industry-relevant problems using real datasets.
Minimum 5+ Projects per Student β Projects will be built after completing the modules to ensure strong practical understanding.
NLP-Based Projects β Text classification, duplicate question detection, sentiment analysis, and POS tagging based systems.
LLM & GenAI Projects β Prompt engineering use cases, LLM-based applications, and AI assistants.
LangChain Projects β Building document-based Q&A systems, AI agents, and retrieval-augmented generation (RAG) pipelines.
Deployment-Focused Projects β Serving ML models using FastAPI, Docker, and deploying APIs on cloud platforms.
EMS-Based Project Submission β All projects will be submitted, tracked, and evaluated through the EMS system.
Performance Review & Feedback β Each project will be reviewed and feedback will be provided for improvement.