Академический Документы
Профессиональный Документы
Культура Документы
Database Fundamentals
Chapter 11:
Data Warehousing
Leon Chen
1
Overview
Definition
Data Warehouse:
Data Mart:
Data Reconciliation
or data cleansing
Transform
Load and Index
Incremental extract =
Record-level:
Field-level:
11
12
L
One companywide warehouse
T
E
Periodic extraction data is not completely current in warehouse
13
Data marts:
Mini-warehouses, limited in scope
T
E
14
T
E
Single ETL for
enterprise data warehouse (EDW)
15
E
Near real-time ETL for
@active Data Warehouse
16
17
Data Characteristics
Status vs. Event Data
Status
Status
18
Data Characteristics
Transient vs.
Periodic Data
Changes to existing
records are written
over previous
records, thus
destroying the
previous data content
star schema
Fact tables contain
factual or quantitative
data
1:N relationship
between dimension
tables and fact
tables
Dimension tables
are denormalized to
maximize
performance
20
21
Modeling dates
23
Dimension table keys must be surrogate (nonintelligent and non-business related), because:
24
OLAP Operations
25
26
Summary report
Example:
Drill-down
Drill-down with color added
27