Вы находитесь на странице: 1из 10

UNIT Chapter in Book Topic Page

Number
What is Data Mining? 1
1
Motivating Challenges 2
INTRODUCTIO
The origins of data mining 4
N
Data Mining Tasks 6
Types of Data Attribute and 19
Measurement
UNIT 1 Types of Data 23
Sets
2 DATA 1 Data Quality Measurement & 36
data Collection
Issues
Issues Related 43
To Application
Data Preprocessing Aggregation 45
Sampling 47
Dimensionality 50
Reduction
Feature Subset 52
Selection
Feature 55
Creation
Discretization & 57
Binarization
Variable 63
Transformation
Measures of Similarity and Dissimilarity Basics 66
Similarity & 67
Dissimilarity
between Simple
UNIT 2 2 DATA 2
Attributes
Dissimilarities 69
Between Data
Objects
Similarities 72
Between Data
Objects
Examples if 73
Proximity
Measures
Issues in 80
Proximity
Calculation
Selecting The 83
Right Proximity
Measure
UNIT CHAPTER IN BOOK TOPIC CONTENT PAGE NO.

Preliminaries 146

General approach to 148


solving a classification
problem

Decision tree induction How a Decision Tree Works 150

How To Build A Decision 151


Tree

Method for expressing 155


4 CLASSIFICATION attribute test conditions

Measure for selecting the 158


best split

Algorithm for decision tree 164


induction

An example : web robot 166


UNIT detection
3 Characteristics Of decision 168
tree induction

Rule-based classifier How a rule based classifier 207


5 CLASSIFICATION
works

Rule ordering schemes 211

How to build a rule based 212


classifier

Direct methods for rule 213


extraction

Indirect method for rule 221


extraction

Characteristics of rule based 223


classifier

Nearest-neighbor Algorithm 223


classifier

Characteristics Of Nearest 225


Neighbor Classifier
6 ASSOCIATION ANALYSIS Problem Definition 328
Frequent Itemset The Apriori 333
generation Principal
Frequent Itemset 335
Generation in the
Apriori
 UNIT - 4 Algorithm
Candidate 338
Generation and
Pruning
Support 342
Counting
Computational 345
Complexity

Rule Generation Confidence 350


Based Pruning
Rule Generation 350
in Apriori
Algorithm
An Example: 352
Congressional
Voting Records
Compact Maximal 354
representation of Frequent
frequent itemsets Itemsets

Closed Frequent 355


Itemsets
Alternative 359
methods for
generating
frequent itemsets

UNIT - 5 FP-Growth FP Tree 363


algorithm Representation
Frequent Itemset 366
Generation in FP
Growth
Algorithm
Evaluation of Objective 371
association Measures of
patterns Interestingness
Measure beyond 382
pairs of
Objective
measures of
Interestingness
binary variables
Simson’s 384
Paradox
Effect of skewed 386
support
distribution

ASSOCIATION ANALYSIS – Sequential Problem 429


2:  patterns. Formulation
Sequential 431
Pattern
Discovery
Timing 436
Constraints
Alternative 439
Counting
Schemes

UNIT CHAPTER IN BOOK TOPIC CONTENT PAGE NO.

CLUSTER ANALYSIS Overview What Is Cluster Analysis 490


UNIT - 6 Different Types of 491
Clustering
Different Types of Clusters 493
K-means The basic K-means 497
Algorithm
K-means: Additional issues 506
Bisecting K-Means 508
K-Means and Different 510
Types of Cluster
Strength and Weaknesses 510
K-means as an 513
Optimization Problem
Agglomerative Basic Agglomerative 516
hierarchical Hierarchical Clustering
clustering Algorithm
Specific Techniques 518
The Launce-Williams 524
Formula for Cluster
Key issue in Hierarchical 524
Clustering
Strength & Weakness 526
DBSCAN Traditional Density: 527
Center-Based Approach
The DBSCAN Algorithm 528
Strengths and Weaknesses 530
Overview of Overview 533
Cluster
Evaluation

Unsupervised Cluster 536


Evaluation Using Cohesion
and Separation
Unsupervised Cluster 542
Evaluation Using
Proximity Matrix
Unsupervised Evaluation 544
of Hierarchical Clustering
Determining the correct 546
Number of Clusters
Clustering Tendency 547
Supervised Measures of 548
Cluster Validity
Assessing the Significance 553
of Cluster Validity
Measures
Hours
 

UNIT CHAPTER IN BOOK 2 TOPIC CONTENT PAGE NO.

UNIT - 7 FURTHER TOPICS IN Multidimensional Generalization of 592


DATA MINING analysis and Structured Data
descriptive
Aggregation and 593
mining of
Approximation
complex data
in Spatial and
objects
Multimedia Data
Generalization
Generalization of 594
Object Identifiers
and
Class/subclass
Hierarchies
Generalization of 595
Class
Composition
Hierarchies
Construction and 596
Mining of Object
Cubes
Generalization 596
Based Mining of
Plan Databases
by Divide and
Conquer
Spatial data Spatial data Cube 601
mining Construction and
Spatial OLAP

Mining Spatial 605


Association and
Co-location
Patterns
Spatial 606
Clustering
Methods
Spatial 606
Classification
and Spatial Trend
Analysis
Mining Raster 607
Databases
Multimedia data Similarity Search 608
mining in Multimedia
Data
Multidimensional 609
Analysis of
Multimedia Data
Classification 611
and Predication
Analysis of
Multimedia Data
Mining 612
Association in
Multimedia Data
Audio & Video 613
Data Mining
Text mining Text Data 615
Analysis and
Information
Retrieval
Dimensionality 621
Reduction for
Text
Text Mining 624
Approach
Mining the WWW Mining the Web 628-630
page layout
structure
Mining the Web 631
link Structure to
Identify
Authoritative
Web Pages
Mining 637
Multimedia Data
on the Web
Automatic 638
Classification of
Web Documents
Web Usage 640
Mining

UNIT CHAPTER IN BOOK TOPIC CONTENT PAGE NO.

UNIT - 8 APPLICATIONS Data mining Data mining for 649


applications Financial Data
Analysis

Retail Industry 651


Telecommunication 652
Industry
Biological Data 654
Analysis
Other Scientific 657
Application
Intrusion Detection 658
Data mining How to Choose a 660
system products Data mining
and research System
prototypes
Examples of 663
Commercial Data
Mining Systems
Additional Theoretical 665
themes on Data Foundation of Data
mining Mining
Statistical Data 666
Mining
Visual and Audio 667
Data Mining
Data Mining 670
Privacy and Data
Security
Social impact of Ubiquitous and 675
Data mining Invisible Data
Mining
Data Mining 678
Privacy and Data
Security
Trends in Data 681
mining

TEXT BOOKS:
1. Introduction to Data Mining - Pang-Ning Tan, Michael Steinbach, Vipin Kumar,   Pearson
Education, 2007
2. Data Mining – Concepts and Techniques - Jiawei Han and Micheline Kamber, 2 nd Edition,
Morgan Kaufmann, 2006. 
 
REFERENCE BOOKS:
1.       Insight into Data Mining – Theory and Practice - K.P.Soman, Shyam Diwakar, V.Ajay, PHI, 2006.
 

Вам также может понравиться