Академический Документы
Профессиональный Документы
Культура Документы
This document contains Confidential, Proprietary and Trade Secret Information (Confidential Information) of
Informatica Corporation and may not be copied, distributed, duplicated, or otherwise reproduced in any manner
without the prior written consent of Informatica.
While every attempt has been made to ensure that the information in this document is accurate and complete,
some typographical errors or technical inaccuracies may exist. Informatica does not accept responsibility for any
kind of loss resulting from the use of information contained in this document. The information contained in this
document is subject to change without notice.
The incorporation of the product attributes discussed in these materials into any release or upgrade of any
Informatica software productas well as the timing of any such release or upgradeis at the sole discretion of
Informatica.
Protected by one or more of the following U.S. Patents: 6,032,158; 5,794,246; 6,014,670; 6,339,775;
6,044,374; 6,208,990; 6,208,990; 6,850,947; 6,895,471; or by the following pending U.S. Patents:
09/644,280; 10/966,046; 10/727,700.
This edition published January 2006
White Paper
Table of Contents
Executive Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .2
Data Migration Challenges in SAP Implementations . . . . . . . . . . . . . . . . . .4
Identifying and Analyzing Source Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .4
Accessing Source Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .5
Addressing Data Quality of Legacy Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .6
Preparing and Loading Data into SAP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .7
Supporting the Data Migration Lifecycle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .9
Informatica and SAP: Working Together for Joint Customer Success . . . .18
Powered By NetWeaver Certification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .18
Master Relationship Agreement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .19
Track Record of Joint Customer Success . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .19
Executive Summary
Forrester Research recently conducted a survey showing investment in enterprise resource
applications (ERP) and enterprise applications in general remains the top IT spending priority for
2005. A major driver for many large companies is regulatory compliance initiatives, such as
Sarbanes-Oxley for public companies, Basel II in the banking industry, and FDA Part 11 for
biotechnology and pharmaceutical organizations.1
Other business drivers behind the decision to purchase and implement ERP applications like SAP
include:
Retirement of legacy systems to support new regulatory requirements such as Sarbanes-Oxley,
Basel II, and the Patriot Act
Standardization of enterprise business applications to support and unify business processes
that are changing and evolving
Need for providing accountability through operational transparency
Upgrade or consolidation of existing SAP systems due to new functionality or a company
merger or acquisition
Creation of a single view of the customer to cut order costs and increase customer satisfaction
ERP applications like SAP have an impact on business processes; therefore, the decision to
purchase and implement them is approached with considerable due diligence. While the vendor
selection and implementation decision gets much of the spotlight, there are critical project
phases upon which the success of an SAP implementation hinges. These project phases are not
always considered with the same degree of detail as the purchase and implementation decision.
Examples of such project phases include:
Business process reengineering. Software that touches and drives processes across the
enterprise cannot simply be installed and turned on. Current as is processes must be
understood and methodically mapped to the new SAP system and its to be capabilities.
Invariably, gaps are uncovered during business process reengineering which must be
considered and planned for.
Change management and user adoption. SAP implementations cannot rely on an if we build
it, they will come approach. The success of a new business application is ultimately
measured by its adoption by business users. Careful consideration must be given to executive
sponsorship and to business as well as technical user training.
Hamerman, Paul and R Ray Wang. ERP ApplicationsThe Technology And Industry Battle Heats Up.
Forrester Market Overview, June 9, 2005
White Paper
Research has shown that software implementations are put at risk when data migration is not
thoroughly considered and planned. According to recent research, more than 80 percent of
software implementation projects fail or overrun their budgets and schedules. Of the projects
that are overrun, half exceed timescales by 75 percent and two-thirds exceed the overall project
budgets. A major reason why these failure rates are so high is because data migration is
considered a minor, one-time event during the overall implementation.
The successful implementation of mission-critical SAP enterprise applications requires a missioncritical approach to data migration.
This white paper examines five common data migration challenges associated with SAP
implementations. It discusses the value of using a single, unified enterprise data integration
platform to address these challenges.
Figure 1: True Context of Data Revealed Only with Proper Analysis of Data
White Paper
This means the migration team is faced with the reality of data extraction across 50 sources.
Given the volume of source systems, the Migrating Chart of Accounts line item in the project
plan does not accurately reflect the amount of data migration work required. It is not a single
data migration unit of work. Moreover, the basic count of sourcing requirements does not take
into consideration the joining and reconciling of data between at the source application.
The diversity and volume of source systems can complicate and jeopardize the overall success of
the migration.
White Paper
Stored. Storing data across multiple business applications often leads to redundant and
inconsistent data. For example, various attributes of customer master entity information are
frequently stored in multiple business applications, such as customer relationship
management, sales force automation, and sales and marketing applications. Customer
information, such as names, titles, addresses, and purchase history, may be stored in different
formats or duplicated across different systems, preventing a single view of the customer.
Data migration teams need to understand and accept that there may be dirty data. To address
data quality issues when migrating legacy applications into SAP, data migration teams should
consider the datas:
Existence. Does the required data for the SAP solution exist? Does it exist within the
enterprise, or possibly in a partners or outsourcing vendors environment? If it doesnt exist,
what is the business rule to populate the required information in SAP?
Validity. Do data values fall within an acceptable range or domain? For example, if the legacy
applications have 73 U.S. state codes instead of 50, is this valid?
Consistency. Is the same data stored in multiple applications in a common format? For
example, is John Doe from Company XYZ the same as Mr. Jon Doe from the same
company?
Timeliness. Is the data that is required to support the SAP business processes available at the
optimal time?
Accuracy. Does the data correctly describe the properties of the object it is meant to model?
Relevance. Does the data meet and support the SAP business processes?
Data migration project teams commonly leverage custom code to support the data conversion
process required to address data quality issues. Custom code can initially offer some degree of
flexibility. However, as the number and complexity of integration touch points increase, custom
coding limitations in scale and maintenance are exposed.
SAP Interface
Technique
Read
Write
Description
SAP certified interface tailored for SAP data migration
Data Migration
Interface
(DMI)
Batch Input
Processing
Direct Input
Processing
Intermediate
Document
(IDOC)
Business
Application
Programming
Interface (BAPI)
Computer Aided
Testing Tool
(CATT)
Legacy System
Migration
Workbench
(LSMW)
White Paper
In most cases, if just a portion of the data being loaded into SAP does not pass the SAP
application validation, then SAP will reject the entire record. Examples of data validation
performed at the SAP application layer include:
Syntactical. Is the field length and data type of the material master number valid?
Semantic. What is the context of the data? Does this number identify a customer or vendor?
Structural. Does the purchase order header and line item meet proper parent/child
relationships or cardinality rules?
Dependency. Is this bill of material valid even if one of the referenced material master records
has not yet been created in SAP?
Data auditing has an enormous impact on data migration projects. Consider this example: A
developer is responsible for converting material master data. Despite assurance that a
conversion program reconciled and consolidated legacy material master data, an audit report
shows that the data loaded into SAP still contains duplicate material master data. Identifying
problems caused by the legacy extraction or the SAP conversion program requires cumbersome
and time-consuming code review sessions and exhaustive testing.
Often migration teams try to address auditing requirements by creating custom reports or spotchecking the data being migrated into SAP. These approaches are incomplete and do not identify
the root cause of any issues with the data conversion logic.
This section examines the role of a single, unified enterprise data integration platform in
addressing the challenges associated with migrating data into SAP.
The solution to address the challenges of migrating all enterprise data into an SAP solution is
Informatica PowerCenter. PowerCenter is a single, unified enterprise data integration platform
that enables companies and government organizations of all sizes to access, discover, and
integrate data from virtually any business system, in any format, and deliver that data throughout
the enterprise at any speed.
PowerCenter provides powerful capabilities to help overcome data migration challenges
associated with SAP implementations. These capabilities include:
Data profiling capabilities for identifying and analyzing source data
Universal data access capabilities for accessing source data
Built-in transformation and correction capabilities for addressing the quality of data in legacy
applications
Certified connectivity to SAP to prepare and load data into SAP
Single, unified data integration platform to support the data migration lifecycle
10
White Paper
Analysis
10%
Test
30%
Build
60%
Analysis
Test
30%
Analysis
40%
Build
Test
Build
30%
Figure 3: Proactive Analysis of Source Data Saves Both Time and Money
PowerCenters data profiling capabilities provide comprehensive, accurate information about the
content, quality, and structure of data in virtually any operational system. Organizations can
automatically assess the initial and ongoing quality of data regardless of its location or type.
With its comprehensive data profiling capabilities, PowerCenter:
Reduces data quality assessment time with easy-to-use wizards and pre-built metric-driven
reports that comprise a single interface for the entire profiling process
Addresses ongoing data quality in legacy applications with Web-based dashboards and
reports that illustrate changes in data content, quality, structure, and values over time
Ensures end user data confidence by automatically and accurately profiling any data
accessible to PowerCentervirtually any and all enterprise data formats
Figure 4 shows an example of a PowerCenter data profiling report. The report shows how
PowerCenter automatically infers the primary and foreign key relationships across three tables in
a legacy application.
Figure 4: PowerCenter Profiling Report Inferring Primary Key and Foreign Key Relationships between
Multiple Legacy Application Data Sources
11
PowerCenters data profiling capabilities enable migration teams to do much more thorough
analysis than manual profiling of the legacy systems. The platform provides the tools to
automatically scan all records across all columns and tables in a source system and dynamically
generate reports that make it easy to understand the true state of the data. These reports help
the migration teams help migration teams determine if the legacy data has quality issues and
how to properly address them.
Data Sources
Relational databases
89%
Flat files
81%
Mainframe/legacy systems
65%
Packaged application
39%
15%
EAI/messaging software
12%
Web
15%
XML
15%
Other
4%
20
40
60
80
100
2
3
12
Eckerson, Wayne and Colin White. Evaluating ETL and Data Integration Platforms. TDWI Report Series, 2003
Ibid
White Paper
PowerCenter provides universal data access, allowing the data migration team to source virtually
any and all enterprise data formats, including:
Mainframe data
Structured data
Unstructured data (e.g., Microsoft Word documents and Excel spreadsheets, email, binary
files, .pdf files, etc.)
Semi-structured data (e.g., industry-specific formats such as HL7, ACORD, FIXML, SWIFT, etc.)
Relational data (e.g., DB2, Oracle, Microsoft SQL Server, etc.)
ERP (e.g., SAP, PeopleSoft, Siebel, etc.) and file data
Message queues (e.g., Tibco, IBM MQ Series, JMS, MS MQ, etc.)
Figure 6 shows the breadth of PowerCenters data access capabilities.
Enterprise
Software Sources
Unstructured Data
PDF Word Excel
Vertical Standards
(e.g., HL7, SWIFT, ACORD)
Print Stream BLOBs
Any proprietary data
format/standard
Informatica
PowerCenter
Remote or Outsourced
Business Applications
With PowerCenter, SAP data migration teams can source directly from a mainframe application
as if it were a relational database. PowerCenters data access capabilities offer SAP migration
teams the flexibility to source these softer forms of data which traditionally would be left up to
manually interpretation and processingor worse, left unaccounted for in the migration process.
The flexibility to access all types of enterprise data in a single data integration platform offers
significant advantages over hand-coded data migration approaches, including:
Increased productivity. With the ability to centralize data access and management,
PowerCenter frees data migration teams from having to maintain and be dependent on a
cumbersome, time-consuming process where programs are developed to extract and stage
data for each source of legacy data.
Reduced risk. Sources of data for SAP implementations tend to be dynamic. Extracting data
from a client/server-based legacy application today does not insulate the team from future
requirementsfor example, having to migrate over mainframe and mid-range applications from
applications resulting from a corporate merger or acquisition. PowerCenter reduces the risk of
both current and future data migration efforts by providing access to a broad range of
enterprise data formats.
13
White Paper
Figure 7 shows a simple PowerCenter data migration mapping in which the customer master
data from multiple mainframe sources are being sourced and prepared in a valid SAP DMI
format.
Note how a single PowerCenter transformation within the mapping replaces the traditionally
coding-intensive effort for preparing a well-formed and valid file ready for loading into SAP. All of
the detail shown in the DMI customer master object is entirely imported directly from the SAP
application layer.
15
The ability to access and manage metadata makes PowerCenter unique. When using
PowerCenter for SAP data migration, the SAP metadata is automatically interpreted within the
transformation to validate all legacy data. This means that as the legacy data flows through the
DMI transformation, it is validated against rules inherent within the SAP metadata such as:
Mandatory vs. option segments
Cardinality
Minimum and maximum occurrence of segments
Field level data type and precision requirements
SAP NetWeaver
people integration
Informatica
Multi-channel access
Data cleansing
Mappings
Complex transformations
Application
integration
+
Businees
process
management
Data
integration
Business Process
User
Interaction
Business
portal
+
Collaboration
Portal
Collaboration
information integration
Bus. intelligence
Knowledge mgmt.
process integration
Integration
broker
Business
Process mgmt.
Lifecycle Management
PowerExchange
Application
Data
PowerCenter
application platform
J2EE
ABAP
DB and OS abstraction
Figure 8: Complementary Data Integration between SAP NetWeaver and Informatica PowerCenter
PowerCenter can access all non-SAP types of enterprise and legacy application data. It also
provides the application integration to prepare and move all non-SAP data in a bi-directional
manner with SAP.
SAP NetWeaver enables transactional based data integration requirements with the capabilities
offered with SAP XI. SAP XI also orchestrates all data integration related to business process
integration requirements, including SAP-to-SAP data integration (e.g., mySAP ERP to my SAP
APO). Finally, from an end user perspective, SAP Portals provides a proven front-end platform for
business intelligence and visibility across the enterprise.
White, Colin. Data Integration: using ETL, EAI, and EII Tools to Create an Integrated Enterprise TDWI Report
Series, November 2005
16
White Paper
FIREWALL
XML, Messaging,
and Web Services
Reusability/Team
Productivity
Analyze/
Profile
Extract/
Transform
Validate/
Lead
Packaged
Applications
Iterate
Relational and
Flat Files
Access source
systems/data
Access target/data
Execute
Migration
Target Application
9
Synchronize
Mainframe and
Midrange
10
Audit/Lineage
PowerCenters metadata management capabilities provide visibility across the entire data
migration processfrom sourcing legacy applications and cleansing the legacy data, to preparing
it in the format required for upload by SAP. PowerCenter enables data lineage problems to be
traced at a metadata level.
Eckerson, Wayne and Colin White. Evaluating ETL and Data Integration Platforms. TDWI Report Series, 2003
17
Figure 10 shows a PowerCenter data lineage diagram spanning multiple data migration
mappings, as well as each component responsible for sourcing, converting or targeting the
required data for SAP. This diagram shows the flow of migration logic. Once the suspect area of
data migration logic has been identified, users can drill into the object and make the appropriate
changes to the relevant data mapping object.
Figure 10: PowerCenter Data Lineage Diagram enables tracking and auditing of end-to-end migration from legacy applications to SAP
PowerCenter helps data migration teams trace and prove how data has been converted and
moved. The enhanced data visibility and tracking helps organizations comply with reporting
requirements. These capabilities also help with user adoption, instilling new SAP application
users with confidence that legacy application data has in fact been converted and moved.
Furthermore, PowerCenter alleviates the politics associated with data migration projects. Data
migration activities, whether related to legacy applications or the target SAP application, can be
centralized within a single, unified data integration platform. This promotes effective and
productive communication between legacy and SAP resources, and between technical and
functional resources.
18
White Paper
19
20
White Paper
The Data Migration Readiness Assessment is designed to help any SAP customer understanding
the challenges and risk in a data migration project:
1. Identify data risks early
2. Scope and plan migrations effectively
3. Deliver SAP implementation on time, on budget, and in scope
Figure 11 shows how the Data Migration Readiness Assessment works.
2
Identify candidate sources
Source
System
1
1
3
Source
System
2
SAP
Stage
Legacy
Stage
Source
System
3
SAP
4
Create mappings to SAP
Identify risks in mapping
Source
System
4
Figure 11: The Data Migration Readiness Assessment Jump Starts Data Migration Projects
21