Академический Документы
Профессиональный Документы
Культура Документы
Corporate Performance
Management
Business Process
Management
Business Activity
Monitoring
Data Integration
Migration
Data
Warehousing
Data
Synchronization
Master Data
Management
-----
Federation
Real Time
Messaging
-----
HAVE
Data in Disparate Sources
---------------
---
Legacy
ERP
---------------
CRM
---------------
Best-of-breed Applications
3
Sync
Async
Batch
Source
Applications
Extract
Transform
Load
Target Data
Warehouse
Services that operate on data are not easily reusable in other contexts
ETL Services and Processes are insecure and hard to monitor (i.e.
no SLA)
5
Challenges
In Data Integration
CHALLENGE
1.
2.
Non-integrated integration
3.
4.
DIFFERENTIATOR
1
2
3
4
Best Performance
Heterogeneous E-LT
Productivity
Declarative Design
Real-time Integration
Declarative CDC
Hot-Pluggable
Knowledge Modules
Future Proof
CHALLENGE
EMERGING SOLUTION
1.
2.
Non-integrated integration
Convergence of integration
solutions
3.
4.
Shift to pattern-driven
development
11
E-LT Architecture
High Performance
Conventional ETL Architecture
Extract
Transform
Load
Benefits
Transform
E-LT
Transform
Extract
Load
11
Technical Detail
Traditional E-T-L
S1
Target 1
S2
ETL DB
S3
-------------
Repository
Staging tables
11
11
Technical Detail
Target 1
S1
Extract
Load
Transform
S2
Staging tables
S3
-------------
ODI Agent
ODI Designer
12
22
Active Integration
Batch, Event-based, and Service-oriented Integration
Service Conductor
Event-oriented
Integration
Service-oriented
Integration
Metadata
Declarative Design
Data-oriented Integration
Data Conductor
Benefits
Enables real-time data warehousing and operational data hubs
Services plug into Oracle SOA Suite for comprehensive integration
13
33
Declarative Design
Developer Productivity
Define
Benefits
Significantly reduce the learning curve
Shorter implementation times
Streamline access to non-IT pros
Automatically
Generate
What
Dataflow
You Want
14
44
Pluggable Architecture
Reverse
Engineer Metadata
Journalize
Read from CDC
Source
Load
From Sources to
Staging
Check
Constraints before
Load
Integrate
Transform and Move
to Targets
Service
Expose Data and
Transformation
Services
Reverse
WS
WS
WS
Staging Tables
Load
CDC
Sources
Journalize
Integrate
Check
Services
Target Tables
Benefits
44
Knowledge Modules
Hot-Pluggable: Modular, Flexible, Extensible
Journalize
Read from CDC
Source
Load
From Sources to
Staging
Check
Constraints before
Load
Integrate
Transform and Move
to Targets
Service
Expose Data and
Transformation
Services
Reverse
WS
WS
WS
Staging Tables
Load
Integrate
CDC
Sources
Target Tables
Check
Journalize
Services
Error Tables
Log Miner
SQL Server
Triggers
DB2 Journals
Oracle
DBLink
DB2 Exp/Imp
JMS Queues
Oracle
SQL*Loader
Check MS
Excel
Check
Sybase
TPump/
Multiload
Type II SCD
Oracle Merge
Siebel EIM
Schema
Oracle Web
Services
DB2 Web
Services
Benefits
44
Generic SQL DB
Oracle DB 9i
Oracle DB 10g
Oracle DB 10g XE
IBM DB2/400
IBM DB2/UDB
IBM Informix SE
IBM LDAP Server
MS SQL Server 2000
MS SQL Server 2005
MS SQL Server 2005 SE
MS Office Access 2000
MS Office Excel 2000
MS Active Directory
Sybase ASA 8.x & 9.x
Sybase IQ 12.x
Sonic MQ v7.0
Teradata V2R5.x
Teradata V2R6.x
18
Load
Transform
Capture Changes
Incremental Update
Data Integrity
Aggregate
Export
Cube
Data Warehouse
Cube
Cube
Metadata
Data Transformation
Analytics
Operational
-------------
Heterogeneous sources
and targets
Incremental load
Slowly changing
dimensions
Data integrity and
consistency
Changed data capture
Data lineage
Data Warehousing
19
Energy
Energy Banking
Banking
Retail
Retail
Customer
Customer Supplier
Supplier Employee
Employee Product
Product
Industry
.
. Solutions
Mfr
Mfr
Asset
Asset
MDM
.
. Applications
Other
Sources
SAP/R3
Golden
Master
Records
E-LT
Metadata
PeopleSoft
Oracle
EBS
Vertical Driven
Data Object Centric
Application Focus
Middleware Foundation
Process Orchestration
Business Intelligence
Registry & Policies
Data Integration & Quality
Siebel
CRM
20
Interactive
Dashboards
Publisher
Delivers
Oracle BI
Enterprise Data
Warehouse
Bulk E-LT
Oracle Data Integrator
E-LT Agent
Other
Sources
SAP/R3
E-LT
Metadata
PeopleSoft
Oracle
EBS
Siebel
CRM
21
Web Services
Manager
Declarative Rules
Engine
Enterprise Service
Bus
E-LT
Metadata
Bulk Data
Processing
22
Enterprise Service
Bus
Report Cache
Agent
CDC
Data
Warehouse
PeopleSoft
SAP/R3
Message
Queues
Metadata
Oracle BI
Invoke
Invoke
Dashboards, Reporting,
Analysis, Publishing
Invoke
BPEL Process
Manager
Invoke
Transformation
Data Services
Services
E-LT Agent
Knowledge
Modules
Oracle BAM
Invoke
Metadata
Repository
Active
Data Cache
WSDL
Generate Data
Services
Service as
Data Source
High speed
Batch ELT
High speed
JMS ELT
CDC based
ELT
Oracle JMS
XML
Oracle BI
Enterprise Data
Warehouse
CDC
24
Performance
25
Recommended
Considered
Can use
26
Performance Report
27
e
lif ios
l
a ar
Re cen
S
Message by
Message
Mini Batches
Large Volume
(over 1M)
Data Volume
Processing
28
Depends on whether an
intermediary XML format
is useful for other
processing (use ESB),
or if joining File data to
tabular RDB data is
required (use ODI)
u
(so
rce
u
(so
)
rce
DB
XML
ESB
ESB
ESB
File
ESB
ESB
depends
DB
ESB
depends
ODI
XML
File
DB
XML
depends
depends
ODI
File
depends
ODI
ODI
DB
ODI
ODI
ODI
XML
File
DB
XML
depends
ODI
ODI
File
ODI
ODI
ODI
DB
ODI
ODI
ODI
u
(so
)
rce
t)
File
Between 10-50MB
Depends on ho much
cross-referencing
among the data values
and rows is required
during transformation
the more there is, the
faster ODI will perform
relative to ESB
e
rg
(ta
XML
e
rg
(ta
e
rg
(ta
t)
t)
*caveat always benchmark if you are unsure and require best possible results
29
Oracle 10.2+/Linux
ODI Designer
Data Synchronization
Oracle 10.2+/Win
Hardware: Dual Core/2 GB RAM
Repositories
Agent
Performance Results
100k rows, 15 fields
Load: LKM DBLink 3s
Real-time synchronization (JKM DBLink)
Update 65k: 13s
Delete 30k: 8s
Summary
35
1 Best Performance
2 Productivity
DIFFERENTIATOR
Heterogeneous E-LT
Declarative Design
Real-time
3 Integration
Declarative CDC
4 Hot-Pluggable
Knowledge Modules
5 Future Proof
Reference Customers
37
Customer: Overstock.com
Solution: High-Volume Real-Time Data Transformation
Technology: Oracle Data Integrator, Oracle 9i & 10g RAC,
Dell Linux, IBM AIX, Teradata 8-node 54000
Oracle Data Integrator Solution:
Business Problem:
GoldenGate TDM
Transactional Management
Platforms:
IBM AIX, Dell Linux
38
Business Problem:
Flat Files
For more than 40 years, Sabre Holdings (NYSE: TSG) has transformed the airline industry
through technological advancement, the Company offers a portfolio of travel marketing,
distribution and technology solutions.
39
Customer: DHL
Solution: High-Volume Real-Time Data Transformation
Technology: Oracle Data Integrator, Oracle RDBMSs,
Teradata Data Warehouse, Cobol Flat Files
Oracle Data Integrator Solution:
Solution Architecture:
Business Problem:
Company: DHL
Flat Files
Platforms:
Linux, Cobol
For more than 35 years, DHL has built the world's premier global delivery network by
trailblazing express shipping in one country after another. Over 220 countries and territories
later, DHL is the global market leader of the international express and logistics industry.
40
Customer: iBasis
Solution: High-Volume Real-Time Data Transformation
Technology: Oracle Data Integrator, Oracle 10g, Netezza
PowerCenter NPS8350 Warehouse Appliance
Oracle Data Integrator Solution:
Business Problem:
Company: iBasis
Flat Files
Applications (future):
Call Billing, Network Monitoring
Founded in 1996, iBasis (NASDAQ: IBAS) is one of the largest carriers of international voice
traffic in the world and a leading provider of prepaid calling services.
41
Analysts Coverage
42
Gartner
Sunopsis (Oracle) has made strides in building
market awareness beyond its base in Europe.
Sunopsis has a range of capabilities, spanning ETL
and real-time messaging, and an architecture that
enables distribution of transformation workload
across data sources and targets.
Ted Friedman, Bill Gassman,
Magic Quadrant for Extraction, Transformation and Loading, 1H05,
May 11, 2005
43
Bloor Research
While there are many relatively young
vendors within the ETL market, Sunopsis has
undoubtedly made the biggest impression,
both in terms of the users that it has gained
and in the way that its approach has
influenced the market.
Philip Howard,
Bullseye Report - Extract, Transform & Load,
March 28, 2006
44
Gartner
By purchasing Sunopsis, Oracle has acquired a server-independent and
platform-independent data integration tool, which will be renamed Oracle Data
Integrator (ODI). OFM and Oracle Applications customers will welcome the
addition of the ODI's database independence. In particular, the acquisition could
provide needed new momentum for Fusion Middleware. Fusion Middleware
customers have heterogeneous IT environments, as do former PeopleSoft,
Siebel Systems and JD Edwards customers, who have an ongoing requirement
for integration with non- Oracle systems. The acquisition will provide OFM with
a data integration tool that is capable of deploying small-grained data services
within a service-oriented architecture (SOA) environment. This capability could
have a positive influence on Fusion Middleware - if Oracle leverages the
Sunopsis philosophy.
Mark A. Beyer, Ted Friedman
Sunopsis Data Integration May Fuel Oracle Fusion Middleware
October 23, 2006
45
Forrester Research
Oracle has recognized that its customers require
diverse data integration features without having to
integrate and manage products from many vendors.
Integrating Sunopsis heterogeneous extract, load,
transform (ELT) and event-driven CDC capabilities
within its middleware offerings is a great start.
Rob Karel
Oracle Makes Serious Move In Data Heterogeneity by Acquiring
Sunopsis
October 29, 2006
46