Академический Документы
Профессиональный Документы
Культура Документы
Agenda
Customer Environment
Real-Time Requirement
Mike Schmitz
My Background
An independent data warehousing consultant specializing in the
dimensional approach to data warehouse / data mart design and
implementation with in-depth experience utilizing efficient, scalable
techniques whether dealing with large-scale data warehouses or smallscale, platform constrained data mart implementations. I deliver
dimensional design and implementation as well as ETL workshops in the
U.S. and Europe.
I have helped implement data warehouses using Redbrick, Oracle,
Teradata, DB2, Informix, and SQL Server on mainframe, UNIX, and NT
platforms, working with small and large businesses across a variety of
industries including such customers as Hewlett Packard, American
Express, General Mills, AT&T, Bell South, MCI, Oracle Slovakia, J.D.
Power and Associates, Mobil Oil, The Health Alliance of Greater Cincinnati,
and the French Railroad SNCF.
Mike Schmitz
Business rules
Various data process flows and dependencies
Real Time
Near Real-Time
Just in Time for the business
Mike Schmitz
Mike Schmitz
Customer Target
Mike Schmitz
Three-Step Approach:
1.
2.
3.
Mike Schmitz
1.
Synchronous or Asynchronous
2.
Trigger
Staging
Operations
Asynch CDC
Batch
10
DW
Mike Schmitz
11
Asynchronous CDC
Redo
log
files
Based on
Log Miner
Logical
Change Data
Oracle10g
DW
Tables
Transform
SQL, PL/SQL,
Java
OLTP
DB
Mike Schmitz
Continuous
Periodic Batch
13
14
Mike Schmitz
AO_CDC_OP
AO_CDC
AO_CDC_DW
15
Mike Schmitz
D_OUTPUT_MINUTE
OUTPUT_MINUTE_KEY: NUMBER
OUTPUT_DAY_KEY: NUMBER
OUTPUT_MINUTE_KEY: NUMBER
GENERATING_PLANT_KEY: NUMBER(4)
OUTPUT_ACTUAL_QTY_IN_KWH: NUMBER(15)
F_CURRENT_DAY_PLANT_OUTPUT
D_GENERATING_PLANT
GENERATING_PLANT_KEY: NUMBER(4)
PLANT_ID: VARCHAR2(24)
PLANT_NAME: VARCHAR2(32)
PLANT_STATUS: VARCHAR2(15)
PLANT_TARGET_MAX_CAPACITY_KWH: NUMBER(15)
PLANT_ABSOL_MAX_CAPACITY_KWH: NUMBER(15)
UPDATE_TS: TIMESTAMP(6)
OUTPUT_DAY_KEY: NUMBER(7)
OUTPUT_MINUTE_KEY: NUMBER(4)
GENERATING_PLANT_KEY: NUMBER(4)
OUTPUT_ACTUAL_QTY_IN_KWH: NUMBER(15)
What do we have?
AO_CDC_DW.F_CURRENT_DAY_PLANT_OUTPUT
18
AO_CDC_DW.F_PLANT_OUTPUT
AO_CDC_OP.PLANT_OUTPUT
AO_CDC_DW.V_PLANT_OUTPUT
Mike Schmitz
First
19
Continuous Feed
20
Hourly
Every five minutes
Mike Schmitz
21
Mike Schmitz
22
Mike Schmitz
23
Mike Schmitz
Next Step
24
25
Mike Schmitz
Summary
26
Mike Schmitz
Michael Brey
Principal Member Technical Staff
ST/NEDC Oracle Engineering
Oracle Corporation
Overview
Benchmark Description
System Description
Database Parameters
Performance Data
The Benchmark
System Info
SunFire 4800
A standard Shared Memory Processor (SMP)
8 900-Mhz CPUs
16 GB physical memory
Solaris 5.8
Database storage: striped across 8 Sun
StorEdge T3 arrays (9X36.4MB each)
Database Parameters
Parallel_max_servers 20
Streams_pool_size 400M (default 10% shared
pool)
Shared_pool_size 600M
Buffer cache 128M
Redo buffers 4M
Processes 600
Async
HotLog
Async
AutoLog
Available
Oracle 9i
Oracle 10g
Oracle 10g
source
system cost
System
resources
System
resources
Minimal
Part of txn
YES
NO
NO
Changes
seen
Real time
Near real
time
Variable
Systems
Tests
Appr. 10, 50, and 100% of all OLTP tables with DML
operations were included in CDC
CPU Consumption
Supplemental Logging
no CDC
no CDC w/ suppl
4
3
2
1
Time (s)
985
915
845
775
705
635
565
495
425
355
285
215
145
75
0
5
Usage (#CPUS)
CPU Consumption
10% DML Change tracking
no CDC w/suppl
CDC 10%
5
4
3
2
1
Time (s)
980
905
830
755
680
605
530
455
380
305
230
155
80
0
5
Usage (#CPUS)
CPU Consumption
50% DML Change tracking
no CDC w/suppl
CDC 50%
Time (s)
985
915
845
775
705
635
565
495
425
355
285
215
145
75
0
5
Usage (#CPUS)
CPU Consumption
10%,100% DML Change tracking
Time (s)
985
845
775
705
635
565
495
425
355
285
215
145
75
8
7
6
5
4
3
2
1
0
5
Usage (#CPUS)
915
no CDC w/suppl
CDC 10%
CDC 100%
Summary
Q&
A
Next Steps.
Data Warehousing DB Sessions
Monday
Tuesday
11:00 AM
#40153, Room 304
8:30 AM
#40125, Room 130
3:30 PM
#40176, Room 303
3:30 PM
#40177, Room 303
4:00 PM
#40166, Room 130
5:00 PM
#40043, Room 104
Next Steps.
Data Warehousing DB Sessions
Thursday
8:30 AM
#40179, Room 304
11:00 AM
#36782, Room 304
Oracle OLAP
1:00PM
#40150, Room 102
Reminder
please complete the OracleWorld
online session survey
Thank you.