Академический Документы
Профессиональный Документы
Культура Документы
The Sequential File stage is a file stage that allows you to read data from or write
data one or more flat files.
The stage can have a single input link or a single output link, and a single rejects
link.
The stage executes in parallel mode if reading multiple files but executes
sequentially if it is only reading one file. By default a complete file will be read by a
single node (although each node might read more than one file). For fixed-width
files, however, you can configure the stage to behave differently:
You can specify that single files can be read by multiple nodes. This can improve
performance on cluster systems.
You can specify that a number of readers run on a single node. This means, for
example, that a single file can be partitioned as it is read (even though the stage is
constrained to running sequentially on the conductor node).
RCP:
RCP does stand for runtime column propagation. Its purpose is to eliminate the need
for a developer to specifically name any column that does not need to be named in
the design (for example because it is being used in a transformation) and yet have
the data in that column automatically propagate to the stage's output link.