Академический Документы
Профессиональный Документы
Культура Документы
Course Contents
Module 01 : Introduction
Module 02: Deployment
Module 03 : Administering DataStage
Module 04 : DataStage Designer
Module 05 : Repository Functions
Module 06 : DataStage Utilities
Unit objectives
WebSphere
Information
service
director
WebSphere WebSphere
Business
Glossary
Information
Analyzer
Metadata Access
Services
WebSphere WebSphere
DataStage
Quality
Stage
WebSphere
Federation
Server
Metadata Analysis
Services
Metadata Server
Information Server
Console
7
Whats Different?
Sequential files
Database
10
11
12
13
Datastage Administrator
14
DataStage Designer
15
DataStage Director
16
Unit Summary
17
18
Unit objectives
19
Metadata Server
Backbone
Clients
Clients
DataStage Server
Deployment : DataStage on
separate machine
Here the domain is split
Between two machines
- Datastage server
- Metadata server and DB2
repository
Datastage Server
DB2 instance
With repository
Clients
22
Metadata Server
Backbone
Clients
DataStage server
DB2 Instances
With repository
23
24
25
26
27
28
29
Checkpoint
1. What application components make up a domain ?
2. Can a domain contain multiple Datastage servers ?
3. Does the DB2 instance and the repository database
need to be on the same machine as the application
server ?
4. Suppose Datastage is on the separate machine from
the Application server. What two components need to
be running before you log onto DataStage ?
30
31
Unit Summary
Having completed this unit, you should be able to :
Identify the components of information server that need to
be installed
Describe what a Deployment domain consist of
Describe different domain deployment options
Describe the installation process
Start the information server
32
33
Unit objectives
34
35
36
37
39
40
42
DataStage Administrator
43
Logging on to Administrator
Host name,
port number of
application server
DataStage
administrator ID and
Password
Name or IP address of
DataStage server
machine
44
Permission Tab
45
46
47
Check point
1. Authorization can be assigned to what two
items?
2. What two types of authorization roles can be
assigned to a user or group?
48
49
Unit Summary
Having completed this unit, you should be able to :
Open the administrative console
Create new user and groups
Assign suite roles and product roles to users and groups
Give user DataStage credentials
Log on to DataStage Administrator
Add a DataStage administrator
Add a DataStage user on the permissions tab and
specify the users role
50
51
Unit Objectives
After completing this unit, you should be able to be able:
Log on to DataStage
Navigate around DataStage designer
Create a parameter Set
Build a range lookup job
Import and export DataStage objects to a file
52
54
55
Parameter Sets
56
Parameter Sets
Store a collection of parameters in a named object
One more values file can be named and specified
- A value file stores values for specified parameters
- values are picked up at runtime
Parameter sets can be added to the job parameters specified on the
parameters tab in the job properties
57
58
Parameters Tab
59
Values Tab
60
61
62
63
64
65
66
67
68
69
70
71
Repository Window
72
Export Window
73
Import Options
74
Checkpoint
The directory to which you export is on the DataStage client
machine, not on the DataStage Server machine.
75
Checkpoint Solution
1.True
76
Unit Summary
Having completed this unit, you should be able to:
Log on to DataStage
Navigate around DataStage Designer
Create a Parameter Set
Build a Range Lookup job
Import and Export DataStage objects to a file
77
78
Unit Objectives
79
80
Quick Find
81
Found Results
82
83
84
85
Impact Analysis
86
88
89
90
91
92
93
94
Comparison Results
95
96
97
Checkpoint
1. You can compare the difference between what two kinds
of objects?
2. What Wild Card characters can be used in a Find?
3. You have a job whose name begins with abc. You cant
remember the rest of the name or where the job is
located what would be the fastest way to export the job
to a file ?
4. Name three Filters you can use in a Advance Find ?
98
Checkpoint Solutions
1. Jobs. Table definition
2. Asterisk(*). It stands for any zero or more characters.
3. Do a Find for objects matching abc. Filter by type job.
Locate the job in the result set. Click the right mouse
button over it, and then click Export.
4 Type of object. Creation date range, last modified date
range, last modified date range, where used,
dependencies of, other options including case sensitivity
within last result set.
99
Unit summary
Having completed this Unit, you should be able to :
Perform a simple find
Perform an Advance Find
Perform an impact analysis
Compare the differences between two table
definitions
Compare the differences between two jobs
100
101
Unit Objectives
After completing this unit, you should be able to :
Analyze the performance of a job
Estimate the resources needed by a job
102
103
Performance Analyzer
Visualization tool that provides deeper insight in to job run time behavior
Offers several categories of visualizations:
- Record Throughput(rows/sec)
- CPU utilization
- Job Timing
- Job memory utilization
- Physical machine utilization
Performance data to be visualized can be :
- Filtered in selected ways, including
Hide startup processes
Hide license operators
Hide inserted operators
- Isolated to selected stages (operators ) , partitions, and phases
Charts can be saved and printed
104
105
Example Job
106
107
108
109
110
111
112
113
Filters
114
Resource Estimator
115
Resource Estimation
To start :
- open Job
- Click the Resource Estimation icon in the Designer toolbar
- Click Run to build statics based on a Job run
Generate models
- Static Model: Computed worst case scenarios of resources usage
- Dynamic Model: computed from a sampling of data
- View resource estimates by stage
- compare Model resource estimates
Generate Projections
- View Projection resource estimates
116
Example Job
117
118
Model Tab
119
120
121
Projection Tab
122
Projection Estimates
123
Checkpoint
1. What are the five types of visualization that can be
created by the Performance Analyzer ?
2. How do you enable the collection of performance data?
3. Describe the two types of models that can be generated
by the Resource Estimator?
124
Checkpoint Solutions
1. Record Throughput(rows/sec).CPU utilization. Job
timing. Job memory utilization
2. Click on the Execution tab in Job Properties. Select
record Job Performance Data in Job Properties.
3. Static Model: computed worst case scenarios of
resource usage. Dynamic model: computed from the
sampling of the data
125
Unit Summary
Having completed this unit, you should be able to :
Analyze the performance of a job
Estimate the resources needed by a job
126