Академический Документы
Профессиональный Документы
Культура Документы
Marc Haber
Functional Architect, Infosphere Metadata Tools
Disclaimer
Copyright IBM Corporation 2010. All rights reserved.
U.S. Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP
Schedule Contract with IBM Corp.
THE INFORMATION CONTAINED IN THIS PRESENTATION IS PROVIDED FOR INFORMATIONAL
PURPOSES ONLY. WHILE EFFORTS WERE MADE TO VERIFY THE COMPLETENESS AND ACCURACY
OF THE INFORMATION CONTAINED IN THIS PRESENTATION, IT IS PROVIDED AS IS WITHOUT
WARRANTY OF ANY KIND, EXPRESS OR IMPLIED. IN ADDITION, THIS INFORMATION IS BASED ON
IBMS CURRENT PRODUCT PLANS AND STRATEGY, WHICH ARE SUBJECT TO CHANGE BY IBM
WITHOUT NOTICE. IBM SHALL NOT BE RESPONSIBLE FOR ANY DAMAGES ARISING OUT OF THE USE
OF, OR OTHERWISE RELATED TO, THIS PRESENTATION OR ANY OTHER DOCUMENTATION. NOTHING
CONTAINED IN THIS PRESENTATION IS INTENDED TO, NOR SHALL HAVE THE EFFECT OF, CREATING
ANY WARRANTIES OR REPRESENTATIONS FROM IBM (OR ITS SUPPLIERS OR LICENSORS), OR
ALTERING THE TERMS AND CONDITIONS OF ANY AGREEMENT OR LICENSE GOVERNING THE USE OF
IBM PRODUCTS AND/OR SOFTWARE.
IBM, the IBM logo, ibm.com, Infosphere, and are trademarks or registered trademarks of International Business
Machines Corporation in the United States, other countries, or both. If these and other IBM trademarked terms are
marked on their first occurrence in this information with a trademark symbol ( or ), these symbols indicate U.S.
registered or common law trademarks owned by IBM at the time this information was published. Such trademarks
may also be registered or common law trademarks in other countries. A current list of IBM trademarks is available
on the Web at Copyright and trademark information at www.ibm.com/legal/copytrade.shtml
Agenda
Introduction
InfoSphere Information Server
InfoSphere Foundation Tools
Metadata Primer
Getting Started
Goals
Architecture
Administration Tasks
Product Demonstration
Import, Manage and Deliver
Introduction
InfoSphere Vision
An Industry Unique Information Platform
Discover, model,
define, and govern
information structure
and content
Standardize, merge,
and correct information
Combine and
restructure information
for new uses
Synchronize, virtualize
and move information
for in-line delivery
Business Glossary
Information Analyzer
FastTrack
Manage
Business Terms
Assess Data
Quality
Capture
Design Specifications
Metadata
Data Architect
Discovery
Understand Data
Relationships
Design
Enterprise Models
Metadata
Workbench
Monitor
Data Flows
Design trusted
information structures for
business optimization
Data De-identification
Data Quality
Data Integration
Master Data Management
Data Warehousing
Manage Business
Terms
Discover Data
Relationships
Design Enterprise
Models
Capture Design
Specifications
Assess, Monitor,
Manage Data Quality
Business Glossary
New Discovery
Data Architect
FastTrack
Information Analyzer
Metadata Workbench
Discover
10
Design
Govern
Compliance
Standards
Change
Understanding and reacting to the impact of
change of Data Sources and structures
Metadata Primer
Literally, data about data
helps to describe a companys information from
business, technical, and operational perspectives
Technical Metadata
Audience: Specific Tool Users BI, Data Integration, Profiling, Modeling
Purpose: Defines source and target systems, table and field/attribute
structures, derivations and dependencies
Operational Metadata
Audience: Operations, Management
Purpose: Information about application runs: frequency, record counts,
component by component analysis, other statistics
Metrics
Understand the cardinality, range, valid values, frequency of a concept
Usage
Trace the data flow through systems and applications, understand
what processes and logic is involved in moving, transforming or
otherwise aggregating data
Features
Benefits
IT Developers
Administrators
Project Managers
& DBAs
Data Lineage
Business Lineage
Business oriented view of
Data Lineage Analysis report
Asset Details
display asset information, including
relationships and usage details
Asset Information
display base information, including
description, container and relationships
Asset Usage
understand ETL Jobs or Mapping
consumption, Business Glossary defined
meaning, Data Steward, Mapping
Specification requirement from FastTrack or
Analysis Profiling data from Information
Analyzer
Homepage
quickly search, display or query
Information Assets
Query Results
formatted as a spreadsheet, for
easier understanding and readability
Results
Formatted as a spreadsheet, for easier understanding and readability
Grouped according to Type
Ability to save as Spreadsheet or Text File
Query Construction
Getting Started
Design Specification
Design Document
Abstract definition and
specification which govern the
flow of information from Source
System for Reporting, OLAP
and Mining deliverables.
Governance and Auditing
requirements dictate the need
for Data Lineage reporting
analysis.
1.
2.
3.
4.
5. DataStage Jobs
6. Data Scripts
System Application
Data File
Database Warehouse & Mart
BI Reports
Goals
Data Lineage
Ability to view Data Flow, validate Systems of Record, validate
Business Logic
Data Reporting
Ensure compliance and data re-use, understand data consumption
Data Terminology
Ensure standardized language, descriptions and methodology
Data Consistency
Ensure proper Data Formatting, Data Type and Value Range
Metadata Preperation
Import metadata about Database Tables and Files that are used
in Job Design and Production
Did you know? Design metadata for DataStage and QualityStage jobs is automatically stored
in the metadata repository as well as metadata from all other suite tools.
BUSINESS
GLOSSARY
FAST TRACK
METADATA
WORKBENCH
INFORMATION
ANALYZER
MANAGE CONTENT
INFOSPHERE
DATA
ARCHITECT
Metadata
Workbench
METADATA SERVER
Data Structure
Lineage
Operational
Business
ETL Design
Technical
IMPORT/EXPORT MANAGER OR
DATASTAGE CONNECTORS
ETL Operational
BI Structure
Querying
IT ASSETS
Understanding
Benefits
Leverage
common
metadata
exchange
environment
for
application
development
consistency
IT Developers
IT Administrators
Data Source
import and maintain application,
procedure or file definitions from
spreadsheets
Data Flow
import and maintain source to target
mappings, their business logic and
function from spreadsheets
IT Administrators
IT Administrators
Custom Attributes
extend the properties of a mapping to
record specific and proprietary
information, including runtime data,
specification or organizational data
Create or Import
create Extended Data Flow Mapping
documents within the Metadata
Workbench or import from a file
Database Connectors
Job Parameters and Environment Variables
Load Column information from Shared Table
Supported DataStage Stage Types
DataStage Common Connector Stages
Build SQL vs. User Defined SQL
DB2 Native
Server Name
Schema Name
Table Name
RDBMS Native
Server Name
Schema Name
Table Name
MSOLE Native
MS OLEDB (S)
Server Name
Schema Name
Table Name
MSSQL Native
Server Name
Schema Name
Table Name
Oracle Native
Server Name
Schema Name
Table Name
Sybase Native
Server Name
Schema Name
Table Name
ODBC
ODBC (S)
ODBC Connector (P)
ODBC Enterprise (P)
Server Name
Schema Name
Table Name
TeraData
Server Name
Table Name
File Name
File Name
Hash File
File Name
Sequential File
Product Demonstration
Summary
Step 1: Understanding the objectives
Step 2: Defining the Tasks
Step 3: IBM Infosphere Delivering Lineage and Understanding
Thank You!
Your Feedback is Important to Us
44
Fri - May 21
Thu May 20