CS1146 WEB MINING
Instructor
Place Email id Syllabus Study Materials |
Radha
SRM University [email protected] Download UNIT I - INTRODUCTION (9 hours) Introduction: World Wide Web, History of the Web and the Internet, What is Data Mining? What is Web Mining? Introduction to Association Rule Mining, Supervised Learning & Unsupervised Learning. Information Retrieval and Web Search: Basic Concepts of Information Retrieval, Information Retrieval Models, Relevance Feedback, Evaluation Measures, Text and Web Page Pre-Processing, Inverted Index and Its Compression, Latent Semantic Indexing, Web Search, Meta-Search: Combining Multiple Rankings, Web Spamming. UNIT II- SOCIAL NETWORK ANALYSIS (9 hours) Social Network Analysis: Introduction, Co-Citation and Bibliographic Coupling, Page Rank, HITS Algorithm, Community Discovery. Web Crawling: A Basic Crawler Algorithm, Implementation Issues, Universal Crawlers, Focused Crawlers, Topical Crawlers, Evaluation, Crawler Ethics and Conflicts. UNIT III- STRUCTURED DATA EXTRACTION (9 hours) Structured Data Extraction: Wrapper Generation, Preliminaries, Wrapper Induction, Instance-Based Wrapper Learning, Automatic Wrapper Generation: Problems, String Matching and Tree Matching, Building DOM Trees, Extraction Based on a Single List Page, Extraction Based on Multiple Pages. UNIT IV- INFORMATION INTEGRATION (9 hours) Information Integration: Introduction to Schema Matching, Pre-Processing for Schema Matching, Schema -Level Matching, Domain and Instance-Level Matching, Combining Similarities, 1: m Match, Integration of Web Query Interfaces, Constructing a Unified Global Query Interface. Opinion Mining and Sentiment Analysis: The Problem of Opinion Mining, Document Sentiment Classification, Sentence Subjectivity and Sentiment Classification, Opinion Lexicon Expansion, Aspect- Based Opinion Mining, Opinion Search and Retrieval, Opinion Spam Detection. UNIT V- WEB USAGE MINING (9 hours) Web Usage Mining: Data Collection and Pre-Processing, Data Modeling for Web Usage Mining, Discovery and Analysis of Web Usage Patterns, Recommender Systems and Collaborative Filtering, Query Log Mining, Computational Advertising. |
Text Book
|
Mining the Web, 1st Edition Discovering Knowledge from Hypertext Data
Author : S Chakrabarti Mprint: Morgan Kaufmann Download |