Вы находитесь на странице: 1из 78

Information Steward 4.

Cleansing Package Builder

Venkata Ramana Paidi

Agenda

1. Overview of Cleansing Package Builder/Positioning 2. Targeted Personas 3. Impact of Cleansing Package Builder on Data Cleanse 4. Cleansing Package Builder Roles, Components, and Architecture 5. Cleansing Package Builder Requirements 6. Additional Cleansing Packages with Data Services 4.0 7. Cleansing Package Builder Workflow 8. Explore Cleansing Package Screen 9. Create a new Cleansing Package Wizard - Design Mode 10.Edit Existing Cleansing packages 11.Use Advanced Mode 12.Publish a Cleansing Package 13.Export and Import from LCM

2012 Utopia, Inc. All Rights Reserved.

Positioning: What is Cleansing Package Builder?

A tool that creates a Cleansing Package that Data Services Data Cleanse transform uses parses, standardizes and cleanse business data

Such as account numbers, product codes, product descriptions, purchase dates, part numbers, SKUs, and so on.
2012 Utopia, Inc. All Rights Reserved.

Provides user interface that allows Data Steward to visualize how their data is parsed and standardized, and evaluate the impact of their customized changes

Provides ability to read and write Unicode data


Creates a Cleansing Package that will be used In Data Services Data Cleanse transform that will parse, standardize and cleanse party data such as names, firms, titles, emails, phone numbers, SSNs, and dates Allows Data Stewards to customize standard forms based on the companys data and standards
Note: Cleansing Package Builder is delivered as part of Data Services, but is installed as a component of Information Steward

Positioning: Why Was Cleansing Package Builder Created?

The primary drivers behind the creation of Cleansing Package Builder were:

Allow the user to

Empower the data steward/subject matter expert to develop a data cleansing solution
2012 Utopia, Inc. All Rights Reserved.

Easily and quickly develop new data cleansing solutions for data domains SAP does
not provide out of the box

Product data, for example:

The data steward provides insight about how the data should be classified simply based on the

Customize the cleansing packages SAP delivers to our customers

Pharmaceutical data Financial data

desired output Cleansing Package Builder automatically creates the data dictionary, rules, and patterns that make up a cleansing package, which is then consumed by Data Services Data Cleanse transforms

Positioning Cleansing Package Builder: Business Example

Parsed Output Product Category Size Material Glove Large Synthetic Leather Pro-Fit 2.3 Series Elastic Velcro Ultra-Grip
2012 Utopia, Inc. All Rights Reserved.

Input Data
Trademark

Glove ultra grip profit 2.3 large black synthetic leather elastic with Velcro Mechanix Wear

Cuff Style Palm Type

Color

Black

Vendor
Standard Description

Mechanix Wear
Glove Synthetic Leather, Black, size: Large, Cuff Style: Elastic Velcro, Ultra-Grip, Mechanix Wear

Target Personas

Cleansing Package Builder is designed to target the following personas:


Data Stewards
Act as conduits between IT and the business portion of a company with both decision support and operational help

Subject-Matter / Domain Experts


2012 Utopia, Inc. All Rights Reserved.

Line of Business wants a friendly environment to collaborate with IT Know what the data should look like
Business users have different expectations for user experience than IT Business users do not want to learn programming or scripting languages Business users want direct comparison of before and after data

Previous Cleansing Package Developers


Advanced mode

Cleansing Package Builder Customer Benefits

The major customer benefits of Cleansing Package Builder are:


Business-Oriented: Intuitive point-and-click and drag-and-drop user experience; no rules or languages to master Data Agnostic: Setup to cleanse both product/operational and party data Ease of Use: Wizard generates default starting points of attributes, data standards and corresponding rules Results Driven: Fine tune through an iterative process based on actual output

2012 Utopia, Inc. All Rights Reserved.

Data Cleanse: Removal of Dictionary Menu Option Removal of Dictionary menu option in Data Services (DS) Designer:
Search Creating or deleting a dictionary Adding, editing, and deleting an Entry, Output, or Classification

2012 Utopia, Inc. All Rights Reserved.

Data Cleanse: Transform Options Tab


Data Cleanse Transform - Options tab: Reference files and parsing dictionary all combined into one parameter
Cleansing package will include:
Dictionary data information/Parsing Dictionary Reference files (rule, email, international phone, social security file and user defined pattern files)

2012 Utopia, Inc. All Rights Reserved.

Data Services 3.2:

Data Services 4.0:

Data Cleanse: Removal of Data Cleanse Tab in the View Data Removal of Data Cleanse tab in the View Data from Writer transform

2012 Utopia, Inc. All Rights Reserved.

Cleansing Package Builder Integration with Data Services Engine


FRS
SQLite BDB Includes rules, reference data, output fields

BDB

BI Platform Information Steward

Data Services
2012 Utopia, Inc. All Rights Reserved.

Sample
Sample Data

Cleansing Package Builder

Data Cleanse / Universal Data Cleanse

Source Data

Target Data

CMS Repository

BDB: Berkley Database (used Data Cleanse) BI: Business Intelligence CMS: Central Management Server CPB: Cleansing Package Builder DC: Data Cleanse

DQ: Data Quality FRS: File Repository Server IS: Information Steward UDC: Universal Data Cleanse

Cleansing Package Builder Integration with Data Services Engine


FRS
SQLite BDB Includes rules, reference data, output fields

BDB

BI Platform Information Steward

Data Services
2012 Utopia, Inc. All Rights Reserved.

Sample
Sample Data

Cleansing Package Builder

Data Cleanse / Universal Data Cleanse

Source Data

Target Data

CMS Repository

DC verifier will query the BI Platform to get the list of published Cleansing Package names (BOE InfoObjects). During runtime of the DS job DC will download the required BDB files (delta or full) from FRS: DC queries user specified publish CP name to retrieve version, number of files, and so on Checks the file system to see if the BDB file is already downloaded or not If the BDB file does not exist, it will download full BDB file (LE or BE depending on the OS) If the BDB file exist, then it will compare the published version string and download the delta file to synch

Cleansing Package Builder Integration with Information Steward


Information Steward has a Cleansing Package Builder tab

2012 Utopia, Inc. All Rights Reserved.

Setting Up CPB Users in CMC The Administrator can:


Set up Cleansing Package Builder users in Central Management Console (CMC) Reassign a cleansing package to another user in CMC Delete a cleansing package in CMC Run Cleansing Package Builder Have all permissions and rights of a Cleansing Package Builder user
2012 Utopia, Inc. All Rights Reserved.

The Cleansing Package Builder user can:


Create new cleansing packages Publish their own (private) cleansing package Create a copy (save as) of their own (private) cleansing packages Browse and import published cleansing packages Rename and delete their private cleansing packages

Cleansing Package Builder Requirements Cleansing Package Builder 4.0 requirements include (following IS requirements):
Browser and version
Internet Explorer 7 and 8 with Flash Player 9.0 or 10.0

Support platforms and versions


2012 Utopia, Inc. All Rights Reserved.

Windows Server 2003 SP1 64-bit, SP2 64-bit, R2 64-bit (SP2) Windows Server 2008 SP1 64-bit, SP2 64-bit, R2 64-bit Solaris 10 (SPARC) 64-bit AIX 5.3 (p-series) 64-bit, AIX 6.1 (p-series) 64-bit HP-Itanium v11.31 64-bit Linux (RedHat 5) 64-bit Linux (Suse 10) 64-bit, (Suse 11) 64-bit SAP NW 7.2 WebLogic 9.2, 10 and 10.3 WebSphere 6.1 and 7 Tomcat 6.0 JBoss 4.2.3 and 5.0 WACS (aka Bobcat, BOBJs Tomcat)

Supported web services and versions

Cleansing Package Builder Workflow: Set Up Data Cleanse Job in Data Services
Open Data Cleanse transform in Data Services
Options tab
Select Published CP from dropdown box
Contains data Contains rules

2012 Utopia, Inc. All Rights Reserved.

Cleansing Package Builder Workflow: Review Standardized Output


Run Data Cleanse job in Data Services View output data

2012 Utopia, Inc. All Rights Reserved.

Cleansing Package Builder Workflow: Maintain Cleansing Packages Maintain Cleansing Package
Verify output data Change CP
Import more sample data

INFORMATION STEWARD CLEANSING PACKAGE BUILDER

Build or refine a cleansing package

Continually tweak and update CP Re-publish CP Run Data Cleanse Job


DATA SERVICES

Publish the cleansing package


2012 Utopia, Inc. All Rights Reserved.

Create a job that includes the Data Cleanse transform Configure the transform option to refer to the cleansing package Run the job to cleanse your data

As necessary, refine the cleansing package

Cleansing Package Task Screen: New Cleansing Package

Create a New Cleansing Package


Custom Cleansing Package
Wizard
Sample input file Parsing Strategy Suggested Attributes Quick Start

Person and Firm Cleansing Package


Wizard
Name, Description, Japanese Data, and Normalized Data

2012 Utopia, Inc. All Rights Reserved.

Cleansing Package Task Screen: Open an Existing Cleansing Package

Open an existing Cleansing Package


My Cleansing Packages
Opens in Design mode
Edit Cleansing Package Design mode Advanced mode
2012 Utopia, Inc. All Rights Reserved.

Cleansing Package Task Screen: Save As

Save As
My Cleansing Package
Make a copy

2012 Utopia, Inc. All Rights Reserved.

Cleansing Package Task Screen: Publish

Publish a Cleansing Package


My Cleansing Package
Publish your Cleansing Package Moves to Published Cleansing Packages Cleansing Package available for Data Cleanse
2012 Utopia, Inc. All Rights Reserved.

Cleansing Package Task Screen: Rename

Rename a Cleansing Package


My Cleansing Package
Change name

2012 Utopia, Inc. All Rights Reserved.

Cleansing Package Task Screen: Delete

Delete a Cleansing Package


My Cleansing Package

2012 Utopia, Inc. All Rights Reserved.

Cleansing Package Task Screen: Browse

Browse a Cleansing Package


Published Cleansing Package

2012 Utopia, Inc. All Rights Reserved.

Cleansing Package Task Screen: Import

Import a Cleansing Package


Published Cleansing Package

2012 Utopia, Inc. All Rights Reserved.

Cleansing Package Task Screen: Migrate Data Cleanse 3.2 Dictionary and Rule Files Import Data Cleanse 3.2 Dictionary and Rule Files
More menu option

2012 Utopia, Inc. All Rights Reserved.

Cleansing Package Task Screen: Generate Data Cleanse ATL

Get Data Services ATL


Published Cleansing Packages

2012 Utopia, Inc. All Rights Reserved.

Cleansing Package Task Screen: Package Details

Hover on Cleansing Package name


Cleansing Package details
My Cleansing Packages Published Cleansing Packages

2012 Utopia, Inc. All Rights Reserved.

Create a New Cleansing Package: Process

To create a new Cleansing Package:


Custom Cleansing Process - Wizard six-step process
1. 2. 3. 4. 5. 6. Import sample data Define sample data Select rows to analyze Determine which parsing strategy to use Select any out of the box suggestions that are provided based on your sample data Assign additional Attributes, Standard Forms and/or Variations to a category

2012 Utopia, Inc. All Rights Reserved.

Create a New Cleansing Package: Import Sample Data

Custom Package

Step 1 of 6 (Name and Data)

Enables the Japanese parsing engine

Saves the data entered in a normalized form. There are full-width and half-width Latin characters and the normalized form will be saved.

Select language to return suggested attributes, standard forms and variations to help build cleansing package.

2012 Utopia, Inc. All Rights Reserved.

Cleansing Package name needs to be an unique name of letters, numbers or underscore.

Create a New Cleansing Package: Define Sample Data Definition

Custom Package

Step 2 of 6 (Sample Definition)

2012 Utopia, Inc. All Rights Reserved.

Create a New Cleansing Package: Select Rows from Sample Data

Custom Package

Step 3 of 6 (Select Rows)

2012 Utopia, Inc. All Rights Reserved.

Create a New Cleansing Package: Determine Which Parsing Strategy to Use

Custom Package

Step 4 of 6 (Parsing Strategy)

2012 Utopia, Inc. All Rights Reserved.

Create a New Cleansing Package: Assign Attributes to a Category

Custom Package

Step 5 of 6 (Suggested Attributes)

2012 Utopia, Inc. All Rights Reserved.

Create a New Cleansing Package: Define Attributes, Standard Forms, and Variations

Custom Package

Step 6 of 6 (Suggested Attributes)

2012 Utopia, Inc. All Rights Reserved.

Create a New Cleansing Package: Person and Firm

Person and Firm Cleansing Package

2012 Utopia, Inc. All Rights Reserved.

Edit an Existing Cleansing Package: Options

To edit an existing Custom Cleansing Package in Design mode:


1. 2. 3. 4. 5. 6. 7. 8. 9. 10. Add Attributes Add values to Standard Forms and Variations Use suggested Standard Forms and Variations View records affected by Last User Action tab Use Search/Filter Panel tab Define Context Resolve Conflict Add additional rows to sample input Delete a row from the Input pane Define Format for Category

2012 Utopia, Inc. All Rights Reserved.

Edit an Existing Cleansing Package: Add Attributes

Custom Cleansing Package Design mode Add Unique Attribute name

2012 Utopia, Inc. All Rights Reserved.

Edit an Existing Cleansing Package: Add Values to Standard Forms and Variations Custom Cleansing Package Design Screen Add Drag and Drop Import list Manually add
2012 Utopia, Inc. All Rights Reserved.

Edit an Existing Cleansing Package: Use Suggested Standard Forms and Variations Custom Cleansing Package Design Screen Use Suggestion list

2012 Utopia, Inc. All Rights Reserved.

Edit an Existing Cleansing Package: View Records Affected by Last User Action Custom Cleansing Package Design Screen Last User Action tab

2012 Utopia, Inc. All Rights Reserved.

Edit an Existing Cleansing Package: Use Search/Filter Panel

Custom Cleansing Package Design Screen Search/Filter panel

2012 Utopia, Inc. All Rights Reserved.

Edit an Existing Cleansing Package: Define Context

Custom Cleansing Package Design mode Define context based on sample data

2012 Utopia, Inc. All Rights Reserved.

Edit an Existing Cleansing Package: Define Context Format

Define format for context

2012 Utopia, Inc. All Rights Reserved.

Edit an Existing Cleansing Package: Resolve Conflict Generating a Conflict

Generate Conflict

2012 Utopia, Inc. All Rights Reserved.

Edit an Existing Cleansing Package: Resolve Conflict - Wizard Custom Cleansing Package Design Screen Resolve Conflict

2012 Utopia, Inc. All Rights Reserved.

Edit an Existing Cleansing Package: Resolve Conflict Conflict Resolution Custom Cleansing Package Design Screen Resolve Conflict

2012 Utopia, Inc. All Rights Reserved.

Edit an Existing Cleansing Package: Add Additional Sample Rows Custom Cleansing Package Design Screen Add More Sample Rows

2012 Utopia, Inc. All Rights Reserved.

Delete an Row from the Input Sample Records Custom Cleansing Package Design Screen Delete a Input Row

2012 Utopia, Inc. All Rights Reserved.

Edit an Existing Cleansing Package: Define Category Format

Format Category

2012 Utopia, Inc. All Rights Reserved.

Edit an Existing Cleansing Package: Define Category Format

Format Category

Drag and Drop Attributes Remove Attributes Change Order Add Text
2012 Utopia, Inc. All Rights Reserved.

Edit an Existing Cleansing Package: View Category Format

View Category Format

2012 Utopia, Inc. All Rights Reserved.

Advanced Mode

Advanced mode enables you to:


Search for values Manage Classifications and Entries Edit Rules Note: Edit Reference Data

2012 Utopia, Inc. All Rights Reserved.

Modifying or creating a Person and Firm Cleansing Package will automatically open in Advanced mode, there is no Design mode option with Person and Firm Cleansing Package. Adding/modifying/deleting any data in Advanced mode will not automatically generate or change any rule files.

Advanced Mode: Search for Values

Cleansing Package Advanced mode Search


2012 Utopia, Inc. All Rights Reserved.

Advanced Mode: Manage Classifications and Entries Cleansing Package Advanced mode Manage Classifications and Entries

2012 Utopia, Inc. All Rights Reserved.

Advanced Mode: Rules Files

Cleansing Package Advanced mode Add/Edit/Delete Rule Auto Generated Rules Custom Rules
2012 Utopia, Inc. All Rights Reserved.

Advanced Mode: Rule Options Cleansing Package Advanced mode Rule - Options Rename Rule Note: Edit Description Modifying any of the rules, should only be done by an View History expert user. Changing the rules will affect how data is parsed and Create Copy Delete
standardized

2012 Utopia, Inc. All Rights Reserved.

Advanced Mode: Rules History Cleansing Package Advanced mode Rule - View History Revert Rule Pattern Definition Revert Rule Action

2012 Utopia, Inc. All Rights Reserved.

Advanced Mode: Edit Reference Data Cleansing Package Advanced mode Add/Edit/Delete Reference Data Phone, Email, User-defined

2012 Utopia, Inc. All Rights Reserved.

Advanced Mode: Edit Reference Data Cleansing Package Advanced mode Social Security Reference Data Import

2012 Utopia, Inc. All Rights Reserved.

Publish a Cleansing Package: Process

To publish a Cleansing Package:


1. 2. 3. 4. 5. Navigate to the Project Task screen Select Cleansing Package (under My Cleansing Package) Click Publish Enter Published Cleansing Package name View published Cleansing Package in Data Services

2012 Utopia, Inc. All Rights Reserved.

Creating a Data Cleanse Transform

Create a Data Cleanse ATL that can be used in Data Services Publish Cleansing Package Get Data Services ATL from More menu option Copy data in the text box and save with a .atl file extension

The atl will be a base atl based on the Cleansing Package settings including:
2012 Utopia, Inc. All Rights Reserved.

Cleansing Package name Japanese engine enabled Whitespace only

Life Cycle Management Console

AMERICAS | EUROPE | MIDDLE EAST | ASIA PACIFIC | INDIA

2012 Utopia, Inc. All Rights Reserved.

AMERICAS | EUROPE | MIDDLE EAST | ASIA PACIFIC | INDIA

2012 Utopia, Inc. All Rights Reserved.

AMERICAS | EUROPE | MIDDLE EAST | ASIA PACIFIC | INDIA

2012 Utopia, Inc. All Rights Reserved.

AMERICAS | EUROPE | MIDDLE EAST | ASIA PACIFIC | INDIA

2012 Utopia, Inc. All Rights Reserved.

AMERICAS | EUROPE | MIDDLE EAST | ASIA PACIFIC | INDIA

2012 Utopia, Inc. All Rights Reserved.

AMERICAS | EUROPE | MIDDLE EAST | ASIA PACIFIC | INDIA

2012 Utopia, Inc. All Rights Reserved.

AMERICAS | EUROPE | MIDDLE EAST | ASIA PACIFIC | INDIA

2012 Utopia, Inc. All Rights Reserved.

AMERICAS | EUROPE | MIDDLE EAST | ASIA PACIFIC | INDIA

2012 Utopia, Inc. All Rights Reserved.

AMERICAS | EUROPE | MIDDLE EAST | ASIA PACIFIC | INDIA

2012 Utopia, Inc. All Rights Reserved.

AMERICAS | EUROPE | MIDDLE EAST | ASIA PACIFIC | INDIA

2012 Utopia, Inc. All Rights Reserved.

AMERICAS | EUROPE | MIDDLE EAST | ASIA PACIFIC | INDIA

2012 Utopia, Inc. All Rights Reserved.

AMERICAS | EUROPE | MIDDLE EAST | ASIA PACIFIC | INDIA

2012 Utopia, Inc. All Rights Reserved.

AMERICAS | EUROPE | MIDDLE EAST | ASIA PACIFIC | INDIA

2012 Utopia, Inc. All Rights Reserved.

Questions

AMERICAS | EUROPE | MIDDLE EAST | ASIA PACIFIC | INDIA

2012 Utopia, Inc. All Rights Reserved.

Thank you!

AMERICAS | EUROPE | MIDDLE EAST | ASIA PACIFIC | INDIA

2012 Utopia, Inc. All Rights Reserved.

Вам также может понравиться