Академический Документы
Профессиональный Документы
Культура Документы
Agenda
1. Overview of Cleansing Package Builder/Positioning 2. Targeted Personas 3. Impact of Cleansing Package Builder on Data Cleanse 4. Cleansing Package Builder Roles, Components, and Architecture 5. Cleansing Package Builder Requirements 6. Additional Cleansing Packages with Data Services 4.0 7. Cleansing Package Builder Workflow 8. Explore Cleansing Package Screen 9. Create a new Cleansing Package Wizard - Design Mode 10.Edit Existing Cleansing packages 11.Use Advanced Mode 12.Publish a Cleansing Package 13.Export and Import from LCM
A tool that creates a Cleansing Package that Data Services Data Cleanse transform uses parses, standardizes and cleanse business data
Such as account numbers, product codes, product descriptions, purchase dates, part numbers, SKUs, and so on.
2012 Utopia, Inc. All Rights Reserved.
Provides user interface that allows Data Steward to visualize how their data is parsed and standardized, and evaluate the impact of their customized changes
The primary drivers behind the creation of Cleansing Package Builder were:
Empower the data steward/subject matter expert to develop a data cleansing solution
2012 Utopia, Inc. All Rights Reserved.
Easily and quickly develop new data cleansing solutions for data domains SAP does
not provide out of the box
The data steward provides insight about how the data should be classified simply based on the
desired output Cleansing Package Builder automatically creates the data dictionary, rules, and patterns that make up a cleansing package, which is then consumed by Data Services Data Cleanse transforms
Parsed Output Product Category Size Material Glove Large Synthetic Leather Pro-Fit 2.3 Series Elastic Velcro Ultra-Grip
2012 Utopia, Inc. All Rights Reserved.
Input Data
Trademark
Glove ultra grip profit 2.3 large black synthetic leather elastic with Velcro Mechanix Wear
Color
Black
Vendor
Standard Description
Mechanix Wear
Glove Synthetic Leather, Black, size: Large, Cuff Style: Elastic Velcro, Ultra-Grip, Mechanix Wear
Target Personas
Line of Business wants a friendly environment to collaborate with IT Know what the data should look like
Business users have different expectations for user experience than IT Business users do not want to learn programming or scripting languages Business users want direct comparison of before and after data
Data Cleanse: Removal of Dictionary Menu Option Removal of Dictionary menu option in Data Services (DS) Designer:
Search Creating or deleting a dictionary Adding, editing, and deleting an Entry, Output, or Classification
Data Cleanse: Removal of Data Cleanse Tab in the View Data Removal of Data Cleanse tab in the View Data from Writer transform
BDB
Data Services
2012 Utopia, Inc. All Rights Reserved.
Sample
Sample Data
Source Data
Target Data
CMS Repository
BDB: Berkley Database (used Data Cleanse) BI: Business Intelligence CMS: Central Management Server CPB: Cleansing Package Builder DC: Data Cleanse
DQ: Data Quality FRS: File Repository Server IS: Information Steward UDC: Universal Data Cleanse
BDB
Data Services
2012 Utopia, Inc. All Rights Reserved.
Sample
Sample Data
Source Data
Target Data
CMS Repository
DC verifier will query the BI Platform to get the list of published Cleansing Package names (BOE InfoObjects). During runtime of the DS job DC will download the required BDB files (delta or full) from FRS: DC queries user specified publish CP name to retrieve version, number of files, and so on Checks the file system to see if the BDB file is already downloaded or not If the BDB file does not exist, it will download full BDB file (LE or BE depending on the OS) If the BDB file exist, then it will compare the published version string and download the delta file to synch
Cleansing Package Builder Requirements Cleansing Package Builder 4.0 requirements include (following IS requirements):
Browser and version
Internet Explorer 7 and 8 with Flash Player 9.0 or 10.0
Windows Server 2003 SP1 64-bit, SP2 64-bit, R2 64-bit (SP2) Windows Server 2008 SP1 64-bit, SP2 64-bit, R2 64-bit Solaris 10 (SPARC) 64-bit AIX 5.3 (p-series) 64-bit, AIX 6.1 (p-series) 64-bit HP-Itanium v11.31 64-bit Linux (RedHat 5) 64-bit Linux (Suse 10) 64-bit, (Suse 11) 64-bit SAP NW 7.2 WebLogic 9.2, 10 and 10.3 WebSphere 6.1 and 7 Tomcat 6.0 JBoss 4.2.3 and 5.0 WACS (aka Bobcat, BOBJs Tomcat)
Cleansing Package Builder Workflow: Set Up Data Cleanse Job in Data Services
Open Data Cleanse transform in Data Services
Options tab
Select Published CP from dropdown box
Contains data Contains rules
Cleansing Package Builder Workflow: Maintain Cleansing Packages Maintain Cleansing Package
Verify output data Change CP
Import more sample data
Create a job that includes the Data Cleanse transform Configure the transform option to refer to the cleansing package Run the job to cleanse your data
Save As
My Cleansing Package
Make a copy
Cleansing Package Task Screen: Migrate Data Cleanse 3.2 Dictionary and Rule Files Import Data Cleanse 3.2 Dictionary and Rule Files
More menu option
Custom Package
Saves the data entered in a normalized form. There are full-width and half-width Latin characters and the normalized form will be saved.
Select language to return suggested attributes, standard forms and variations to help build cleansing package.
Custom Package
Custom Package
Custom Package
Custom Package
Create a New Cleansing Package: Define Attributes, Standard Forms, and Variations
Custom Package
Edit an Existing Cleansing Package: Add Values to Standard Forms and Variations Custom Cleansing Package Design Screen Add Drag and Drop Import list Manually add
2012 Utopia, Inc. All Rights Reserved.
Edit an Existing Cleansing Package: Use Suggested Standard Forms and Variations Custom Cleansing Package Design Screen Use Suggestion list
Edit an Existing Cleansing Package: View Records Affected by Last User Action Custom Cleansing Package Design Screen Last User Action tab
Custom Cleansing Package Design mode Define context based on sample data
Generate Conflict
Edit an Existing Cleansing Package: Resolve Conflict - Wizard Custom Cleansing Package Design Screen Resolve Conflict
Edit an Existing Cleansing Package: Resolve Conflict Conflict Resolution Custom Cleansing Package Design Screen Resolve Conflict
Edit an Existing Cleansing Package: Add Additional Sample Rows Custom Cleansing Package Design Screen Add More Sample Rows
Delete an Row from the Input Sample Records Custom Cleansing Package Design Screen Delete a Input Row
Format Category
Format Category
Drag and Drop Attributes Remove Attributes Change Order Add Text
2012 Utopia, Inc. All Rights Reserved.
Advanced Mode
Modifying or creating a Person and Firm Cleansing Package will automatically open in Advanced mode, there is no Design mode option with Person and Firm Cleansing Package. Adding/modifying/deleting any data in Advanced mode will not automatically generate or change any rule files.
Advanced Mode: Manage Classifications and Entries Cleansing Package Advanced mode Manage Classifications and Entries
Cleansing Package Advanced mode Add/Edit/Delete Rule Auto Generated Rules Custom Rules
2012 Utopia, Inc. All Rights Reserved.
Advanced Mode: Rule Options Cleansing Package Advanced mode Rule - Options Rename Rule Note: Edit Description Modifying any of the rules, should only be done by an View History expert user. Changing the rules will affect how data is parsed and Create Copy Delete
standardized
Advanced Mode: Rules History Cleansing Package Advanced mode Rule - View History Revert Rule Pattern Definition Revert Rule Action
Advanced Mode: Edit Reference Data Cleansing Package Advanced mode Add/Edit/Delete Reference Data Phone, Email, User-defined
Advanced Mode: Edit Reference Data Cleansing Package Advanced mode Social Security Reference Data Import
Create a Data Cleanse ATL that can be used in Data Services Publish Cleansing Package Get Data Services ATL from More menu option Copy data in the text box and save with a .atl file extension
The atl will be a base atl based on the Cleansing Package settings including:
2012 Utopia, Inc. All Rights Reserved.
Questions
Thank you!