Академический Документы
Профессиональный Документы
Культура Документы
-Prepared By
Himanshu Thakkar
Contents
1
2
3
4
5
6
7
Contd
Contd
Node
It is a logical processing unit.
Each node in a configuration file is distinguished by a virtual name and
defines a number and speed of CPUs, memory availability, page and swap
space, network connectivity details, etc.
Within a configuration file, the number of processing nodes defines the
degree of parallelism and resources that a particular job will use to run.
A configuration file with a larger number of nodes generates a larger
number of processes that use more memory (and perhaps more disk
activity) than a configuration file with a smaller number of nodes.
While the DataStage documentation suggests creating half the number of
nodes as physical CPUs, this is a conservative starting point that is highly
dependent on system configuration, resource availability, job design, and
other applications sharing the server hardware.
10
Contd
Fastname
The fastname is the physical node name that stages use to open
connections for high volume data transfers.
Typically, you can get this name by using Unix command uname -n.
In SMP , it is the principal node name as all nodes uses same fastname .
Pool
Based on the characteristics of the processing nodes you can group
nodes into set of pools.
A pool can be associated with many nodes and a node can be part of
many pools.
A node belongs to the default pool unless you explicitly specify a pools
list for it, and omit the default pool name () from the list.
11
Contd
Resource disk :
Here a disk path is defined. The data files of the dataset that are
accessible to each nodes are stored in the resource disk.
Resource scratch disk :
Here also a path to folder is defined. This path is used by the parallel job
stages for buffering of the data when the parallel job runs.
12
13
Thank You