Вы находитесь на странице: 1из 23

c   

  
  

  
  

c   


   

!  " # 


$c

   

% 

&
  


'c  


(

)*  +   &


 
  

, 

   "- 

. 
 /
0c   

1& 
 

2*   &"c 


34456 

  + /&


 
   

!   

  7  &  7
   

%++  8


+" 

' 

 "- 
" 9c  "    

)+  
 
 
 & 

, 
 
   *  


.  
  
c   

1/&
 &&  "c+& &&

&  7

2   
& & 7 
  
     

**     +7


c   

 
& 

   $  +  + 
   

!* 
+  
- 7 
    

%*   :7
c   

'c  "      - 



)*   
&" + 
c


c  " 

,c

c       c      

.*   ""


&

& c   7

1  

      
 
 

!2/&
  7    ;c"<4=">

! 
  
c

!!*    


   +  

!%"?@A!$*  +     & &  


&  7

!'c
 * 
  "- 
B


!)*  + 
/C4       C
c  " 

!,? 
 
  

   
'
 =  2 +
 2

  +&

   
/


!.
& 

   ? " 
&  7

!1 
 

&& 

&  / 
 
   7
 

 +
 

%2    


 /
   

% 
 
  
 9

  

% 
 - 

  

& 


%!? D   


E*    F/+0GHI 

%%  
 
 


%'*  +  & 


   7
 

  2222 $


  '222    :
  7
 $
 
'222  $*  
  

%) 
4 &


 $   &
   J  
4 &      & 
 
   ?(



%,:+&!2c 

KJA"B
%.*  
 ?
 
" 

  

%1*  + / 7&   




'2

   "+ 


*    




!*   
& & c  " 

%   &  

'  

  
      


)   


 & +  
c  " 

,*   
&3 &
c  " " 7

.*   
&" + 
c


c  " 

1*   7
4    "- 



2 

     


51What are the difficulties faced in using DataStage ? or what are the constraints in using
DataStage ?

52. Have you ever involved in updating the DS versions like DS 5.X, if so tell us some the steps
you have

53. What r XML files and how do you read data from XML files and what stage to be used?

54. How do you track performance statistics and enhance it?

55. Types of vies in Datastage Director?


There are 3 types of views in Datastage Director a) Job View - Dates of Jobs Compiled. b) Log
View - Status of Job last run c) Status View - Warning Messages, Event Messages, Program
Generated Messag

56. What is the default cache size? How do you change the cache size if needed?
Default cache size is 256 MB. We can incraese it by going into Datastage Administrator and
selecting the Tunable Tab and specify the cache size over there.

57. How do you pass the parameter to the job sequence if the job is running at night?

58. How do you catch bad rows from OCI stage?


59. what is quality stage and profile stage?

60. what is the use and advantage of procedure in datastage?

61. What are the important considerations while using join stage instead of lookups.

62. how to implement type2 slowly changing dimenstion in datastage? give me with example?

63. How to implement the type 2 Slowly Changing dimension in DataStage?

64. What are Static Hash files and Dynamic Hash files?

65. What is the difference between Datastage Server jobs and Datastage Parallel jobs?

66. What is ' insert for update ' in datastage

67. How did u connect to DB2 in your last project?


Using DB2 ODBC drivers.

68. How do you merge two files in DS?


Either used Copy command as a Before-job subroutine if the metadata of the 2 files are same or
created a job to concatenate the 2 files into one if the metadata is different.

69. What is the order of execution done internally in the transformer with the stage editor having
input links on the lft hand side and output links?

70. How will you call external function or subroutine from datastage?

71. What happens if the job fails at night?

72. Types of Parallel Processing?


Parallel Processing is broadly classified into 2 types. a) SMP - Symmetrical Multi Processing. b)
MPP - Massive Parallel Processing.

73. What is DS Administrator used for - did u use it?

74. How do you do oracle 4 way inner join if there are 4 oracle input files?

75. How do you pass filename as the parameter for a job?

76. How do you populate source files?

77. How to handle Date convertions in Datastage? Convert a mm/dd/yyyy format to yyyy-dd-
mm?
We use a) "Iconv" function - Internal Convertion. b) "Oconv" function - External Convertion.
Function to convert mm/dd/yyyy format to yyyy-dd-mm is Oconv(Iconv(Filedname,"D/M

78. How do you execute datastage job from command line prompt?
Using "dsjob" command as follows. dsjob -run -jobstatus projectname jobname

79. Differentiate Primary Key and Partition Key?


Primary Key is a combination of unique and not null. It can be a collection of key values called
as composite primary key. Partition Key is a just a part of Primary Key. There are several
methods of

80 How to install and configure DataStage EE on Sun Micro systems multi-processor hardware
running the Solaris 9 operating system?
Asked by: Kapil Jayne

81. What are all the third party tools used in DataStage?

82. How do you eliminate duplicate rows?

83. what is the difference between routine and transform and function?

84. Do you know about INTEGRITY/QUALITY stage?

85. how to attach a mtr file (MapTrace) via email and the MapTrace is used to record all the
execute map errors

86. Is it possible to calculate a hash total for an EBCDIC file and have the hash total stored as
EBCDIC using Datastage?
Currently, the total is converted to ASCII, even tho the individual records are stored as EBCDIC.

87. If your running 4 ways parallel and you have 10 stages on the canvas, how many processes
does datastage create?

88. Explain the differences between Oracle8i/9i?

89. How will you pass the parameter to the job schedule if the job is running at night? What
happens if one job fails in the night?

90. what is an environment variable??

91. how find duplicate records using transformer stage in server edition

92. what is panthom error in data stage

93. How can we increment the surrogate key value for every insert in to target database
94. what is the use of environmental variables?

95. how can we run the batch using command line?

96. what is fact load?

97. Explain a specific scenario where we would use range partitioning ?

98. what is job commit in datastage?

99. hi..Disadvantages of staging area Thanks,Jagan

100. How do you configure api_dump

102. Does type of partitioning change for SMP and MPP systems?

103. what is the difference between RELEASE THE JOB and KILL THE JOB?

104. Can you convert a snow flake schema into star schema?

105. What is repository?

106. What is Fact loading, how to do it?

107. What is the alternative way where we can do job control??

108.Where we can use these Stages Link Partetionar, Link Collector & Inter Process (OCI) Stage
whether in Server Jobs or in Parallel Jobs ?And SMP is a Parallel or Server ?

109. Where can you output data using the Peek Stage?

110. Do u know about METASTAGE?

111. In which situation,we are using RUN TIME COLUMN PROPAGATION option?

112. what is the difference between datasatge and datastage TX?

113. 1 1. Difference between Hashfile and Sequential File?. What is modulus?2 2. What is iconv
and oconv functions?.3 3. How can we join one Oracle source and Sequential file?.4 4. How can
we implement Slowly Changing Dimensions in DataStage?.5 5. How can we implement Lookup
in DataStage Server jobs?.6 6. What are all the third party tools used in DataStage?.7 7. what is
the difference between routine and transform and function?.8 8. what are the Job parameters?.9
9. Plug-in?.10 10.How can we improv

114. Is it possible to query a hash file? Justify your answer...

115. How to enable the datastage engine?

116. How I can convert Server Jobs into Parallel Jobs?

117. Suppose you have table "sample" & three columns in that tablesample:Cola Colb Colc1 10
1002 20 2003 30 300Assume: cola is primary keyHow will you fetch the record with maximum
cola value using data stage tool into the target system

118. How to parametarise a field in a sequential file?I am using Datastage as ETL


Tool,Sequential file as source.

119. What is TX and what is the use of this in DataStage ? As I know TX stand for Transformer
Extender, but I don't know how it will work and where we will used ?

120. What is the difference betwen Merge Stage and Lookup Stage?

121. Importance of Surrogate Key in Data warehousing?


Surrogate Key is a Primary Key for a Dimension table. Most importance of using it is it is
independent of underlying database. i.e Surrogate Key is not affected by the changes going on
with a databas

122. What is the difference between Symetrically parallel processing,Massively parallel


processing?

123.What is the diffrence between the Dynamic RDBMS Stage & Static RDBMS Stage ?

124. How to run a job using command line?

125. What is user activity in datastage?

126. how can we improve the job performance?

127. how we can create rank using datastge like in informatica


128. What is the use of job controle??

129. What does # indicate in environment variables?

130. what are two types of hash files??

131. What are different types of star schema??

132. what are different types of file formats??

133. What are different dimension table in your project??Plz explain me with an example??

134. what is the difference between buildopts and subroutines ?

135. how can we improve performance in aggregator stage??

136. What is SQL tuning? how do you do it ?

137. What is the use of tunnable??

138. how to distinguish the surogate key in different dimensional tables?how can we give for
different dimension tables?

139. how can we load source into ODS?

140. What is the difference between sequential file and a dataset? When to use the copy stage?

141. how to eleminate duplicate rows in data stage?

142. What is complex stage? In which situation we are using this one?

143. What is the sequencer stage??

144. where actually the flat files store?what is the path?

145. what are the different types of lookups in datastage?

146. What are the most important aspects that a beginner must consider doin his first DS project
?
147. how to find errors in job sequence?

148. it is possible to access the same job two users at a time in datastage?

149. how to kill the job in data stage?

150. how to find the process id?explain with steps?

151. Why job sequence is use for? what is batches?what is the difference between job sequence
and batches?

152. What is Integrated & Unit testing in DataStage ?

153. What is iconv and oconv functions?

154. For what purpose is the Stage Variable is mainly used?

155. purpose of using the key and difference between Surrogate keys and natural key

156. how to read the data from XL FILES?my problem is my data file having some commas in
data,but we are using delimitor is| ?how to read the data ,explain with steps?

157. How can I schedule the cleaning of the file &PH& by dsjob?

158. Hot Fix for ODBC Stage for AS400 V5R4 in Data Stage 7.1

159. what is data stage engine?what is its purpose?

160. What is the difference between Transform and Routine in DataStage?

161. what is the meaning of the following..1)If an input file has an excessive number of rows and
can be split-up then use standard 2)logic to run jobs in parallel3)Tuning should occur on a job-
by-job basis. Use the power of DBMS.

162. Why is hash file is faster than sequential file n odbc stage??

163. Hello,Can both Source system(Oracle,SQLServer,...etc) and Target Data warehouse(may be


oracle,SQLServer..etc) can be on windows environment or one of the system should be in
UNIX/Linux environment.Thanks,Jagan
164. How to write and execute routines for PX jobs in c++?

165. what is a routine?

166. how to distinguish the surrogate key in different dimentional tables?

167. how can we generate a surrogate key in server/parallel jobs?

168. what is NLS in datastage? how we use NLS in Datastage ? what advantages in that ? at the
time of installation i am not choosen that NLS option , now i want to use that options what can i
do ? to reinstall that datastage or first uninstall and install once again ?

169. how to read the data from XL FILES?explain with steps?

170. whats the meaning of performance tunning techinque,Example??

171. differentiate between pipeline and partion parallelism?

172. What is the use of Hash file??insted of hash file why can we use sequential file itself?

173. what is pivot stage?why are u using?what purpose that stage will be used?

174. How did you handle reject data?

175. Hiwhat is difference betweend ETL and ELT?

176. how can we create environment variables in datasatage?

177. what is the difference between static hash files n dynamic hash files?

178. how can we test the jobs?

179. What is the difference between reference link and straight link ?

180. What are the command line functions that import and export the DS jobs?

181. what is the size of the flat file?

182. Whats difference betweeen operational data stage (ODS) & data warehouse?
183. I have few questions1. What ar ethe various process which starts when the datastage engine
starts?2. What are the changes need to be done on the database side, If I have to use dB2 stage?3.
datastage engine is responsible for compilation or execution or both?

184. Could anyone plz tell abt the full details of Datastage Certification.Title of
Certification?Amount for Certification test?Where can v get the Tutorials available for
certification?Who is Conducting the Certification Exam?Whether any training institute or person
for guidens?I am very much pleased if anyone enlightwn me abt the above saidSuresh

185. how to use rank&updatestratergy in datastage

186. What is Ad-Hoc access? What is the difference between Managed Query and Ad-Hoc
access?

187. What is Runtime Column Propagation and how to use it?

188. how we use the DataStage Director and its run-time engine to schedule running the solution,
testing and debugging its components, and monitoring the resulting e/xecutable versions on ad
hoc or scheduled basis?

189. What is the difference bitween OCI stage and ODBC stage?

190. Is there any difference b/n Ascential DataStage and DataStage.

191. How do you remove duplicates without using remove duplicate stage?

192. if we using two sources having same meta data and how to check the data in two sorces is
same or n
if we using two sources having same meta data and how to check the data in two sorces is same
or not?and if the data is not same i want to abort the job ?how we can do this?

193. If a DataStage job aborts after say 1000 records, how to continue the job from 1000th
record after fixing the error?

194. Can you tell me for what puorpse .dsx files are used in the datasatage

195. how do u clean the datastage repository.

196. give one real time situation where link partitioner stage used?
197. What is environment variables?what is the use of this?

198. How do you call procedures in datastage?

199. How to remove duplicates in server job

200. What is the exact difference betwwen Join,Merge and Lookup Stage??

202. What are the new features of Datastage 7.1 from datastage 6.1

203. How to run the job in command prompt in unix?

204. How to know the no.of records in a sequential file before running a server job?

205. Other than Round Robin, What is the algorithm used in link collecter? Also Explain How it
will works?

206. how to drop the index befor loading data in target and how to rebuild it in data stage?

207. How can ETL excel file to Datamart?

208. what is the transaction size and array size in OCI stage?how these can be used?

209. what is job control?how it is developed?explain with steps?

210. My requirement is like this :Here is the codification suggested:


SALE_HEADER_XXXXX_YYYYMMDD.PSVSALE
My requirement is like this :Here is the codification suggested:
SALE_HEADER_XXXXX_YYYYMMDD.PSVSALE_LINE_XXXXX_YYYY
MMDD.PSVXXXXX = LVM sequence to ensure unicity and continuity of file
exchangesCaution, there will an increment to implement.YYYYMMDD = LVM date of file
creation COMPRESSION AND DELIVERY TO:
SALE_HEADER_XXXXX_YYYYMMDD.ZIP AND
SALE_LINE_XXXXX_YYYYMMDD.ZIPif we run that job the target file names are like this
sale_header_1_20060206 & sale_line_1_20060206.If we run next time means the

211. what is the purpose of exception activity in data stage 7.5?

212. How to implement slowly changing dimentions in Datastage?


213. What does separation option in static hash-file mean?

214. how to improve the performance of hash file?

215. Actually my requirement is like that :Here is the codification suggested:


SALE_HEADER_XXXXX_YYYYMM
Actually my requirement is like that :Here is the codification suggested:
SALE_HEADER_XXXXX_YYYYMMDD.PSVSALE_LINE_XXXXX_YYYY
MMDD.PSVXXXXX = LVM sequence to ensure unicity and continuity of file
exchangesCaution, there will an increment to implement.YYYYMMDD = LVM date of file
creation COMPRESSION AND DELIVERY TO:
SALE_HEADER_XXXXX_YYYYMMDD.ZIP AND
SALE_LINE_XXXXX_YYYYMMDD.ZIPif we run that job the target file names are like this
sale_header_1_20060206 & sale_line_1_20060206.if we run next

216. How do u check for the consistency and integrity of model and repository?

217. how we can call the routine in datastage job?explain with steps?

218. what is job control?how can it used explain with steps?

219. how to find the number of rows in a sequential file?

220. If the size of the Hash file exceeds 2GB..What happens? Does it overwrite the current rows?

221. where we use link partitioner in data stage job?explain with example?

222 How i create datastage Engine stop start script.Actually my idea is as


below.!#bin/bashdsadm - usersu - rootpassword
(encript)DSHOMEBIN=/Ascential/DataStage/home/dsadm/Ascential/DataStage/DSEngine/binif
check ps -ef | grep DataStage (client connection is there) { kill -9 PID (client connection) }uv -
admin - stop > dev/nulluv -admin - start > dev/nullverify processcheck the connectionecho
"Started properly"run it as dsadm

223. can we use shared container as lookup in datastage server jobs?

224. what is the meaning of instace in data stage?explain with examples?

225. wht is the difference beteen validated ok and compiled in datastage.


226. hi all what is auditstage,profilestage,qulaitystages in datastge please explain indetail

227what is PROFILE STAGE , QUALITY STAGE,AUDIT STAGE in datastage..please expalin


in detail.thanks in adv

228. what are the environment variables in datastage?give some examples?

229. What is difference between Merge stage and Join stage?

230. Hican any one can explain what areDB2 UDB utilitiesub

231. What is the difference between drs and odbc stage

232. Will the data stage consider the second constraint in the transformer once the first condition
is satisfied ( if the link odering is given)

233. How do you do Usage analysis in datastage ?

234. how can u implement slowly changed dimensions in datastage? explain?2) can u join flat
file and database in datastage?how?

235. How can you implement Complex Jobs in datastage

236. DataStage from Staging to MDW is only running at 1 row per second! What do we do to
remedy?

237. what is the mean of Try to have the constraints in the 'Selection' criteria of the jobs i
what is the mean of Try to have the constraints in the 'Selection' criteria of the jobs itself. This
will eliminate the unnecessary records even getting in before joins are made?

238. * What are constraints and derivation?* Explain the process of taking backup in
DataStage?*What are the different types of lookups available in DataStage?

239. # How does DataStage handle the user security?

240. What are the Steps involved in development of a job in DataStage?

241. What is a project? Specify its various components?


242. What does a Config File in parallel extender consist of?
Config file consists of the following. a) Number of Processes or Nodes. b) Actual Disk Storage
Location.

243. how to implement type2 slowly changing dimensions in data stage?explain with example?

244. How much would be the size of the database in DataStage ?What is the difference between
Inprocess and Interprocess ?

245. Briefly describe the various client components?

246. What are orabulk and bcp stages?

247. What is DS Director used for - did u use it?

248. what is meaning of file extender in data stage server jobs.can we run the data stage job from
one job to another job that file data where it is stored and what is the file extender in ds jobs.

249. What is the max capacity of Hash file in DataStage?

250. what is merge and how it can be done plz explain with simple example taking 2 tables .......
251. it is possible to run parallel jobs in server jobs?

252. what are the enhancements made in datastage 7.5 compare with 7.0

253. If I add a new environment variable in Windows, how can I access it in DataStage?

254. what is OCI?

255. Is it possible to move the data from oracle ware house to SAP Warehouse using with
DATASTAGE Tool.

256. How can we create Containers?

257. what is data set? and what is file set?

258. How can I extract data from DB2 (on IBM iSeries) to the data warehouse via Datastage as
the ETL tool. I mean do I first need to use ODBC to create connectivity and use an adapter for
the extraction and transformation of data? Thanks so much if anybody could provide an answer.
259. it is possible to call one job in another job in server jobs?

260. how can we pass parameters to job by using file.

261. How can we implement Lookup in DataStage Server jobs?

262. what user varibale activity when it used how it used !where it is used with real example

263. Did you Parameterize the job or hard-coded the values in the jobs?
Always parameterized the job. Either the values are coming from Job Properties or from a
µParameter Manager¶ ± a third part tool. There is no way you will hard±code some parameters in
your jobs. The o

264. what is hashing algorithm and explain breafly how it works?

265. what happends out put of hash file is connected to transformer ..what error it throughs

266. what is merge ?and how to use merge? merge is nothing but a filter conditions that have
been used for filter condition

267. What will you in a situation where somebody wants to send you a file and use that file as an
input What will you in a situation where somebody wants to send you a file and use that file as
an input or reference and then run job.

268. What is the NLS equivalent to NLS oracle code American_America.US7ASCII on


Datastage NLS?

269. Why do you use SQL LOADER or OCI STAGE?

270. What about System variables?

271. what are the differences between the data stage 7.0 and 7.5in server jobs?

272. How the hash file is doing lookup in serverjobs?How is it comparing the key values?

273. how to handle the rejected rows in datastage?

274. how is datastage 4.0 functionally different from the enterprise edition now?? what are the
exact changes?
275. What is Hash file stage and what is it used for?
Used for Look-ups. It is like a reference table. It is also used in-place of ODBC, OCI tables for
better performance.

276. What is the utility you use to schedule the jobs on a UNIX server other than using Ascential
Director?
Use crontab utility along with d***ecute() function along with proper parameters passed.

277. How can I connect my DB2 database on AS400 to DataStage? Do I need to use ODBC 1st
to open the database connectivity and then use an adapter for just connecting between the two?
Thanks alot of any replies.

278. what is the OCI? and how to use the ETL Tools?
OCI means orabulk data which used client having bulk data its retrive time is much more ie.,
your used to orabulk data the divided and retrived Asked by: ramanamv

279. what is difference between serverjobs & paraller jobs

280. What is the difference between Datastage and Datastage TX?

281. Hi!Can any one tell me how to extract data from more than 1 hetrogenious Sources.mean,
example 1 sequenal file, Sybase , Oracle in a singale Job.

282. How can we improve the performance of DataStage jobs?

283. How good are you with your PL/SQL?


On the scale of 1-10 say 8.5-9

284. What are OConv () and Iconv () functions and where are they used?
IConv() - Converts a string to an internal storage formatOConv() - Converts an expression to an
output format.

285. If data is partitioned in your job on key 1 and then you aggregate on key 2, what issues
could arise?

286. How can I specify a filter command for processing data while defining sequential file output
data?
287. There are three different types of user-created stages available for PX. What are they?
Which would you use? What are the disadvantage for using each type?

288. What is DS Manager used for - did u use it?

289. What are Sequencers?


Sequencers are job control programs that execute other jobs with preset Job parameters.

290. Functionality of Link Partitioner and Link Collector?

291. Containers : Usage and Types?


Container is a collection of stages used for the purpose of Reusability. There are 2 types of
Containers. a) Local Container: Job Specific b) Shared Container: Used in any job within a
project.

292. Does Enterprise Edition only add the parallel processing for better performance?Are any
stages/transformations available in the enterprise edition only?

293. what are validations you perform after creating jobs in designer.what r the different type of
errors u faced during loading and how u solve them

294. how can you do incremental load in datastage?

295. how we use NLS function in Datastage? what are advantages of NLS function? where we
can use that one? explain briefly?

296. Dimension Modelling types along with their significance


Data Modelling is Broadly classified into 2 types. a) E-R Diagrams (Entity - Relatioships). b)
Dimensional Modelling.

297. Did you work in UNIX environment?


Yes. One of the most important requirements.

298. What other ETL's you have worked with?


Informatica and also DataJunction if it is present in your Resume.

299. What is APT_CONFIG in datastage

300. Does the BibhudataStage Oracle plug-in better than OCI plug-in coming from DataStage?
What is theBibhudataStage extra functions?
301. How do we do the automation of dsjobs?

302. what is trouble shhoting in server jobs ? what are the diff kinds of errors encountered while
running any job?

303. what is Data stage Multi-byte, Single-byte file conversions?how we use that conversions in
data stage?

304. What are other Performance tunings you have done in your last project to increase the
performance of slowly running jobs?
Staged the data coming from ODBC/OCI/DB2UDB stages or any database on the server using
Hash/Sequential files for optimum performance also for data recovery in case job aborts.Tuned
the OCI stage for '

305. what is DataStage Multi-byte, Single-byte file conversions in Mainframe jobs? what is UTF
8 ? whats use of UTF 8 ?

306. What Happens if RCP is disable ?

307. What are Routines and where/how are they written and have you written any routines
before?
Routines are stored in the Routines branch of the DataStage Repository, where you can create,
view or edit. The following are different types of routines: 1) Transform functions

308. What is version Control?

309. Hi, What are the Repository Tables in DataStage and What are they?

310. I want to process 3 files in sequentially one by one , how can i do that. while processing the
files it should fetch files automatically .

311. where does unix script of datastage executes weather in clinet machine or in server.suppose
if it eexcutes on server then it will execute ?

312. please list out the versions of datastage Parallel , server editions and in which year they are
realised.

313. what are the Job parameters?


314. defaults nodes for datastage parallel Edition

315. Orchestrate Vs Datastage Parallel Extender?

316. Dimensional modelling is again sub divided into 2 types.


a)Star Schema - Simple & Much Faster. Denormalized form. b)Snowflake Schema - Complex
with more Granularity. More normalized form.

317. Tell me the environment in your last projects


Give the OS of the Server and the OS of the Client of your recent most project

318. How can we join one Oracle source and Sequential file?.

319. What is Modulus and Splitting in Dynamic Hashed File?


In a Hashed File, the size of the file keeps changing randomly. If the size of the file increases it is
called as "Modulus". If the size of the file decreases it is called as "Splitting

320. Scenario based Question ........... Suppose that 4 job control by the sequencer like (job 1, job
2, job 3, job 4 )if job 1 have 10,000 row ,after run the job only 5000 data has been loaded in
target table remaining are not loaded and your job going to be aborted then.. How can short out
the problem.
Suppose job sequencer synchronies or control 4 job but job 1 have problem, in this condition
should go director and check it what type of problem showing either data type problem, warning
massage, job Asked by: Mukesh Kumar Madhav
321. What is the Batch Program and how can generate ?
Batch programe is the programe it's generate run time to maintain by the datastage it self but u
can easy to change own the basis of your requirement (Extraction, Transformation,Loading)
.Batch progr

322. How many places u can call Routines?


Four Places u can call (i) Transform of routine (A) Date Transformation (B) Upstring
Transformation (ii) Transform of the Before & After Subroutines(iii) XML
transformation(iv)Web base t Asked by: Mukesh Kumar Madhav

323. How many jobs have you created in your last project?
100+ jobs for every 6 months if you are in Development, if you are in testing 40 jobs for every 6
months although it need not be the same number for everybody

324. what's the difference between Datastage Developers and Datastage Designers. What are the
skill's required for this.
325. Could you please help me with a set of questions on Parallel Extender?

326. what is difference between data stage and informatica

327. Suppose if there are million records did you use OCI? if not then what stage do you prefer?

328. What are types of Hashed File?

329. How do you eliminate duplicate rows?

330. What is DS Designer used for - did u use it?

331. Compare and Contrast ODBC and Plug-In stages?


ODBC : a) Poor Performance. b) Can be used for Variety of Databases. c) Can handle Stored
Procedures. Plug-In: a) Good Performance. b) Database specific.(Only one database) c) Cannot
handle Stored Pr

332. What is project life cycle and how do you implement it?

333. Explain your last project and your role in it.?

334. What are the often used Stages or stages you worked with in your last project?
A) Transformer, ORAOCI8/9, ODBC, Link-Partitioner, Link-Collector, Hash, ODBC,
Aggregator, Sort.

335. Have you ever involved in updating the DS versions like DS 5.X, if so tell us some the steps
you have taken in doing so?
Yes. The following are some of the steps; I have taken in doing so:1) Definitely take a back up of
the whole project(s) by exporting the project as a .dsx file2) See that you are using the same
parent

336. What versions of DS you worked with?


DS 7.0.2/6.0/5.2

337. If worked with DS6.0 and latest versions what are Link-Partitioner and Link-Collector used
for?
Link Partitioner - Used for partitioning the data.Link Collector - Used for collecting the
partitioned data.

338. How did you handle an 'Aborted' sequencer?


In almost all cases we have to delete the data inserted by this from DB manually and fix the job
and then run the job again.

339. How did u connect with DB2 in your last project?


Most of the times the data was sent to us in the form of flat files. The data is dumped and sent to
us. In some cases were we need to connect to DB2 for look-ups as an instance then we used
ODBC drive

340. Read the String functions in DS


Functions like [] -> sub-string function and ':'

341. How would call an external Java function which are not supported by DataStage?
Starting from DS 6.0 we have the ability to call external Java functions using a Java package
from Ascential. In this case we can even use the command line to invoke the Java function and
write the re

342. The above might rise another question: Why do we have to load the dimensional tables first,
then fact tables:
As we load the dimensional tables the keys (primary) are generated and these keys (primary) are
Foreign keys in Fact tables.

343. Tell me one situation from your last project, where you had faced problem and How did u
solve it?
A. The jobs in which data is read directly from OCI stages are running extremely slow. I had to
stage the data before sending to the transformer to make the jobs run faster.B. The job aborts

344. Does the selection of 'Clear the table and Insert rows' in the ODBC stage send a Truncate
statement to the DB or does it do some kind of Delete logic.
There is no TRUNCATE on ODBC stages. It is Clear table blah blah and that is a delete from
statement. On an OCI stage such as Oracle, you do have both Clear and Truncate options. They
are radically di

345. How do you rename all of the jobs to support your new File-naming conventions? Create a
Excel spreadsheet with new and old names. Export the whole project as a dsx. Write a Perl
program, which can do a simple rename of the strings looking up the Excel file.
346. When should we use ODS?
DWH's are typically read only, batch updated on a scheduleODS's are maintained in more real
time, trickle fed constantly

347. how to create batches in Datastage from command prompt