Академический Документы
Профессиональный Документы
Культура Документы
May 5, 2015
Chapter 1. Introduction
May 5, 2015
May 5, 2015
Evolution of Database
Technology
1970s:
1980s:
1990s2000s:
May 5, 2015
May 5, 2015
Other Applications
May 5, 2015
Target marketing
Cross-market analysis
May 5, 2015
Customer profiling
data mining can tell you what types of customers buy what
products (clustering or classification)
May 5, 2015
Resource planning:
Competition:
May 5, 2015
Applications
Approach
Examples
May 5, 2015
10
Retail
May 5, 2015
11
Other Applications
Sports
Astronomy
May 5, 2015
12
Selection
Data Cleaning
Data Integration
Databases
May 5, 2015
13
May 5, 2015
14
Making
Decisions
Data Presentation
Visualization Techniques
Data Mining
Information Discovery
End User
Business
Analyst
Data
Analyst
Data Exploration
Statistical Analysis, Querying and Reporting
Data Warehouses / Data Marts
OLAP, MDA
Data Sources
Paper, Files, Information Providers, Database Systems, OLTP
May 5, 2015
DBA
15
Architecture of a Typical
Data Mining System
Graphical user interface
Pattern evaluation
Data mining engine
Database or
data warehouse
Filtering
Data cleaning & data
integration
server
Databases
May 5, 2015
Knowledgebase
Data
Warehouse
Data Mining: Concepts and
Techniques
16
Relational databases
Data warehouses
Transactional databases
Advanced DB and information repositories
May 5, 2015
17
May 5, 2015
18
Cluster analysis
May 5, 2015
19
Outlier analysis
Outlier: a data object that does not comply with the general
behavior of the data
Similarity-based analysis
May 5, 2015
20
May 5, 2015
21
Approaches
May 5, 2015
First general all the patterns and then filter out the
uninteresting ones.
Generate only the interesting patternsmining query
optimization
Data Mining: Concepts and
Techniques
22
May 5, 2015
General functionality
23
A Multi-Dimensional View of
Data Mining Classification
Databases to be mined
Relational, transactional, object-oriented, object-relational,
active, spatial, time-series, text, multi-media,
heterogeneous, legacy, WWW, etc.
Knowledge to be mined
Characterization, discrimination, association, classification,
clustering, trend, deviation and outlier analysis, etc.
Multiple/integrated functions and mining at multiple levels
Techniques utilized
Database-oriented, data warehouse (OLAP), machine
learning, statistics, visualization, neural network, etc.
Applications adapted
May 5, 2015
24
May 5, 2015
25
May 5, 2015
Techniques
26
May 5, 2015
27