Академический Документы
Профессиональный Документы
Культура Документы
assignment
z midterm exam
z final exam
z
40 %
30 %
30 %
References
Christopher Westphal and Teresa
Blaxton, Data Mining Solutions, John Wiley
& Sons Inc., 1998.
z Pieter Adrians and Dolf Zantinge, Data
Mining, Addison Wesley, 1996.
z Michael J.A. Berry and Gordon Linoff,
Data Mining Techniques, John Wiley &
Sons Inc., 1997.
z Alex Berson and Stephen J. Smith, Data
Warehousing, Data Mining & OLAP,
McGraw Hill, 1997.
z
References
z
z
z
References
z
Evolution of Database
Technology
Evolution of Database
Technology
z 1960s:
z 1980s:
z 1970s:
Business Impact
Data Mining
Client Data
Increasing
business Impact
Data
Warehouse
Custom
Application
Data Mining
Information Discovery
Data Exploration
ERP
OLAP
Statistical Analysis, Querying and Reporting
Data Warehouses / Data Marts
Packaged
Application
Custom
Application
Intelligence Enterprise
Potential Applications of
Data Mining
z Market
purchasing
OLAP
Data Sources
Paper, Files, Information Providers, Database Systems, OLTP
Potential Applications of
Data Mining
z Risk
forecasting
credit
money
mining
Web
Usage Mining
Web Content Mining
Web
Potential Applications of
Data Mining
z
Structure Mining
Text mining
Data Mining
Data Preparation
Preprocessed
Data
Selection
Structured Data
Business
Objective
Target
Data
Uns tructured
Data
Databases
Objectives Determination
Identify
z Data
Selection
Identify
Transformation
Preprocessing
The
Mining
Select
Data
modeling technique
Mining Operations
Predictive Modeling
Database Segmentation
Link Analysis
Visualization
Database Segmentation
(clustering)
Database Segmentation
z partitioning
a database into
segments of similar records, that is
records that share a number of
properties.
Annual
Income
Model:
Age
Predictive Modeling
models for future prediction
Classification:
predicts categorical class labels
Prediction:
models continuous-valued
functions
z Finding
z Model:
Classifier
Training
Data
Unseen Data
(Jeff, Professor, 4)
NAME
Tom
Merlisa
George
Joseph
RANK
YEARS TENURED
Assistant Prof
2
no
Associate Prof
7
no
Professor
5
yes
Assistant Prof
7
yes
Tenured?
Link Analysis
z Finding
frequent patterns,
associations, correlations, or causal
structures among sets of items or
objects in transaction databases,
relational databases, and other
information repositories
Model:
Apiori Algorithm,
Visualization
Visualization
Visualization of a decision
tree in MineSet 3.0
of results
Interpret
of knowledge
zCRISP-DM
Cross
Industry Standard
Process for Data Mining
(CRISP-DM)
zConsortium
of data
miners from various
industries
manufacturing,
marketing, and
government
Enterprise Miner
Data
z Clementine
Multiple
(from SPSS)
z IBM
Intelligent Miner
Server 2005
Multiple
Tight
z Oracle
z Weka
Data Miner
Multiple
Inc.)
Multiple
(DBMiner Technology
Multiple
Open
Source Software
exploration
development
of application-specific
data mining system
Invisible data mining (mining as built-in
function)
Constraint-based
mining: use of
constraints to guide data mining
systems in their search for
interesting patterns
z Integration
of data mining
language
A
z Web
Mining
z Multimedia Mining
Machine
Learning
Information
Science
Statistics
Data Mining
Visualization
Other
Disciplines