Вы находитесь на странице: 1из 14

Infosys Interview Questions

1. Explain u r project architecture?


2. What is staging area and why u r using in ur project?
3.if u have a source system contain records of three months jan, feb, march I want to load
the data in to three targets jan,feb,mar draw the mapping?
4. Diff b/w static lkp and dynamic lkp?
5. What are different types of joins in jnr t/r?
6. In filter t/r I can give filter condition to 4 what can happened?
7. In agg t/r I can use sorted input option but I cant pass sorted data what happened?
8. Explain normalize t/r give one example how it can convert single input record to multiple
o/p records?
9. What is verbose initialization?

Hp interview Questions and answers


1. What is meant by in line view?
Ans: inline view is a SELECT statement in the FROM-clause of another SELECT statement.
In-line views are commonly used simplify complex queries by removing join operations
and condensing several separate queries into a single query.
2.what is conformed dimension?
Ans:a dimension table can be shared by more than one facttable is known conformed
dmension.
ex:date dimension.
3.what is factlessfact table?
Ans:factless fact table captures many to many relationship b/w dimensions,and it dosen't
contain any numeric values.
ex:studance attandence tracking.
store that did not sell a product over aperiod of time.
4.i have 3 flatfiles with same stracture how to join without using joiner
transformation?
Ans:by using file list concept with file type as indirect .
5.how to delete the duplicate records in unix?
Ans:sort filename.txt|uniq -u
6.what is surrogated key?
Ans: surrogated key is a system generated sequence number to be used as a primary key.
7.what is diff b/w surrogatekey and primarykey?
Ans: surrogatekey
primarykey
1.artificialkey
1.naturalkey
2.generated by system sequence
2.generated by database
number

8.what is subqurey & corelated subquarey?


Ans:subquarey:-A query nested inside a SELECT statement is known as a subquery
and is an alternative to complex join statements.
CorelatedsubQuarey:-It is a sub-query (a query nested inside another query)
that uses values from the outer query in its WHERE clause. The sub-query
is evaluated once for each row processed by the outer query.
Accenture telephonic Interview Questions (6-06-2011)
1. What is the diff b/w 7.x/8.x?
2. What is the SQL transformation and explain?
3. What is domain and node?
4. What is the use of associated port in lkp transformation?
5. What are the diff b/w sql source qualifier and flat file source qualifier?
6. I have 1000 flat files I had already loaded 700 flat files how to you load remaining 300 flat
files?
7.i have to load 1 lakh of records to the production system in b/w the session
fail what u can do(u can't restart,and also u r not assiging session to a recovery mode)?
8. How can u migrate the code from developed environment to productionenvironment?
9. Which type of scd2 can u use in ur project?
10.if ur source data contain no duplictes and also u don't want to perform incremental
agrregation
and no need of sorting the data then what is the use of sorter t/r before aggregator?
11.my parameter file dosen't contain any value then how the integration service executed
the parameter file?
12.if ur using mapping variable and run the session if the session fails then what is the value
in repository?
13. what are the reusable tasks ur used?
TCS(white field,bangalore) Interview Questions (4-06-2011)
1.explain ur project?
2.how to display duplicate records in a table by using sql query?
ex:100 aaa
100 aaa
100 aaa
101 bbb
102 ccc
i want to dispaly like this 100 aaa

100 aaa
100 aaa
3.explain scd type2 i have the data d1 1 mum
d2 1 kol
d3 1 hyd
how it will be inserted?
4.what is mapplet and expalin where u can use mapplet in realtime?
5.emp
boss
sales transaction
hra a/c
in which what are the fact and dimensional tables?
6.explain mapping parameters and variables?
7.what is factless fact table and explain where u can use in realtime?
8.diff b/w in and exist?(sub quaries)
9.i have 100 records in my target table and next day i want to insert 5 records,
and i want to update 10 records then what is tot no.of records in my target table?(by using
scd2)
10.diff b/w connected and unconnected lkp?
11.by using which transformation u can capture the rejected records and where it will be
stored?
12.what is the shortcut and reusable transformation?
13.in my table 1000 records are there i want to get 100 records what is the sqlquery?
14.diff b/w sourcequalifier and filter?
15.what is abort and stop where we can use abort?
16.what are types of tracing levels and what is the verbose initialization?
17. diff b/w union and joiner?
18. what are the reusable and nonreusable tasks?

HEADSTRONG INTERVIEW QUESTIONS


1.How to update records in Target, without using Update Strategy?
YES WITHOUT A pk WE CANT UPDATE THE REC u can write sql query at target
level. in target table properties there is a option update
override.
IN SESSION PROPERTIES WE HAVE ONE OPTION TREAT ALL ROWS

AS... THERE U SELECT UPDATE OPTION AND IN THE TARGET


PROPERTIES SELECT UPDTAE OPTION.
2. Can we look up a table from source qualifier transformation?
yes we can look up a table by using S.Q by using sql override.
3. How can we stop batch
by using server manager or pmcmd.
4. Can u start a batches with in a batch?
U can not. If u want to start batch that resides in a batch, create a new independent batch
and copy the necessary sessions into the new batch.
5.Can u start a session inside a batch idividually?
We can start our required session only in case of sequential batch.in case of concurrent
batch
we cant do like this.
6.how do you connect to remote server?
using ftp through telnet/putty/command prompt.
7.What is a command that used to run a batch?
pmcmd is a command line programme. this can be used to stop,run and abort the session.
8.If i done any modifications for my table in back end does it reflect in informatca warehouse
or maping desginer or source analyzer?
NO. Informatica is not at all concern with back end database.It displays you all the
information that is to be
stored in repository.If want to reflect back end changes to informatica screens, again you
have to import from back end
to informatica by valid connection.And you have to replace the existing files with imported
files.
9.How many ways you create ports?
1.Drag the port from another transforamtion
2.Click the add buttion on the ports tab.
10. Which is better among incremental load, Normal Load and Bulk load?
It depends on the requirement. Otherwise Incremental load which can be better as it takes
only that data which is not available previously on the target.
11. What is the difference between Informatica server, Repository Server and Power center
Server?
Informatica Server and Power Center Server both mean the same. It is concerned with
running of sessions and workflows.
The Repository Server or the Informatica Rep Server is concerned with connecting to the
repository through the Rep Agent.
One Repository Server can connect to many repositories at a time.
For the Power Center Server to be up and working it is necessary that the Repository Server
is running.
12.in my source table different date formats i want to load the data into my target single
date format how it can be achieved?
by using to_date(datecolumn,'MM/DD/YYYY')

13.while extracting the data from a mainframe sources which transformation should we use?
The mainframe files are Used as VSAM files in Informatica by using the Normaliser
transformation
14.i have 7 flatfiles i want to join those files how many joinertransformations required?
6.(n-1)
15.what is pushdownoptimization?
Pushdown optimization is a way of load-balancing among servers in order to achieve optimal
performance.
Veteran ETL developers often come across issues when they need to determine the
appropriate place to perform ETL logic.
Suppose an ETL logic needs to filter out data based on some condition. One can either do it
in database by using WHERE condition
in the SQL query or inside Informatica by using Informatica Filter transformation. Sometimes,
we can even "push" some transformation logic
to the target database instead of doing it in the source side (Especially in the case of EL-T
rather than ETL).
Such optimization is crucial for overall ETL performance.
16.explain persistance lookupcache?
If the lookup source does not change between sessions, configure the Lookup transformation
to use a persistent lookup cache.
The Integration Service then saves and reuses cache files from session to session,
eliminating the time required to read the lookup source.
17.i want to send an email, if email task will run only when no data is loaded into the target?
After your session, create an email task. Connect session and email task with the link
condition
TgtSuccessRows = 0 and PrevTaskStatus = SUCCEEDED.
hp recent interview questions which i have faced
1.WHAT IS A DEGENARATIVE DIMENSION?
ans:A degenerate dimension is data that is dimensional in nature but stored in a fact table.
2. WHAT IS CONFORMED DIMENSION?
ans: a dimensional table which can be shared by multiple fact tables is known as conformed
dimension
3.what is normalization?explain 3rd normalform
the process of organizing data to minimize redundancy is called normalization.
Normalization usually involves dividing large tables into smaller (and less redundant) tables
and defining relationships between them.
Third Normal Form (3NF): No duplicate information is permitted. So, for example, if two
tables
both require a birthdate field, the birthdate information would be separated into a separate
table,
and the two other tables would then access the birthdate information via an index field in
the birthdate table.
Any change to a birthdate would automatically be reflect in all tables that link to the
birthdate table.
4.what is denormalization?
Denormalization is the process of attempting to optimize the read performance of a

database
by adding redundant data or by grouping data.
5.how can you receive a mapping specification document in word or xl format?
6.how can you identify a index on a column in a table?
7.how often should you contact ur clinet?

1.How could we generate the sequence of key values without using sequence generator
transformation in the target ??
TAKE EXPRESSION TRANSFORMATION AND DOUBLE CLICK ON THE
EXPRESSION TRANSFORMATION AND ADD TWO NEW PORTS
1.SEQ
2.SEQ_OUT

1.IN THE SEQ PORT CHECK V=VARIABLEPORT THERE YOU MENTION


EXPRESSION SEQ+1.
2.IN THE SEQ_OUT PORT CHECK O=OUTPUTPORT THERE YOU MENTION
EXPRESSION IIF(ISNULL(SEQ),O,SEQ)
IT WILL GIVE CORRECT RESULT

2. Normalizer transformation is not involved in Mapplet.. why???


Mapplet is a reusable logic that you can use across different mappings. Normalizer is a
dynamic transformation
which converts rows to columns or vice-versa, so that is dependendent on the input to the
normalizer, it is not
fixed logic that you can reuse in other mappings.
3. Re: i want to load data in to two targets..one is dimension table and the other is fact
table?how can i load ata a time?
Using dynamic lookup, you can load both simultaneously.
4. what is shortcut in informatica? difference between shortcut,reusable object?

Short cut is a referenced object .We cannot edit on short cut objects. Shortcut objects are
present under
shared folder. Mostly sources and targets will be the shortcut objects. Reusable object can
be used multiple
times in a mapping. Short cuts can be used in all the folders available in that repository, but
reusable objects
are limited to that folder.

5. can v update d records in target using update stargey without generationg primary
key ? explain
no using update strategy without primary keys update is not possible. try and read teh
session log file once.
it will display a msg updates are not supported without primary keys.update override in the
target is to update
the function in the update strategy t/tion and it updates only on non-primary key columns
like dname, loc but not on deptno.
6. What is the exact difference b/w IN and EXIST in Oracle..?
EXIST is used as conditional operator in which u can write a queryIN is also used as a
conditional operator in
which we gives a list of elements

7. what is threshold error in informatica?


When the target is used by the update strategy DD_REJECT,DD_UPDATE and some limited
count, then if it the number
of rejected records exceed the count then the session ends with failed status. This error is
called Threshold Error.

8. If the values in a row are 1,2,3,3,4,,4,5 then how a rank transformation can rank these
values
1,2,3,3,4,,4,5
ANS:1,2,3,3,5,5,7
9. In seqence generator transformation maximum limit is reached,after reaching maximum
limit how will u insert the data
When a Seq Gen Transformation reaches its maximum value and the sequence is not
configured to cycle, the session fails

.Either you need to cycle the sequence or increase the end value.

10. i have 2 flat files one table contains duplicates another one no duplicates.... so i need
like this i don't want duplicates...pl's give me ans...
place sorter transformation after source qualifier of duplicates flatfile and enable property
distinct

11. what is meant by data driven.. in which scenario we use that..?


data driven is available at session level.it says that when we r using update strategy t/r ,how
the integration service
fetches the data and how to update/insert row in the database log.
Or
you can select the data driven option, where the Integration Service follows instructions
coded into Update Strategy
transformations within the session mapping.

12. Can we lookup a table from source qualifier transformation. ie. unconnected lookup
You cannot lookup from a source qualifier directly. However, you can override the SQL in the
source qualifier to join with the lookup table to perform the lookup.

13.how can we stop batch


By using server manager or pmcmd.

14. Batches - It Provides A Way to Group Sessions For Either Serial Or Parallel Execution By
The Informatica Server. There Are Two Types Of Batches :

15.Can u start a batches with in a batch?

17. whether Sequence generater T/r uses Caches? then what type of Cache it is
the seq t/r uses index cache for the sequential range of numbers for the generated keys.

18. HOW DO YOU CONNECT TO REMOTE SERVER ?


USING FTP THRU TELNET/PUTTY/COMMAND PROMPT
V CAN CONNECT TO REMOTE SERVERS..
19. What is a command that used to run a batch?
pmcmd is a command line programme. this can be used to stop,run and abort the session.

20. If i done any modifications for my table in back end does it reflect in informatca
warehouse or maping desginer or source analyzer?
NO. Informatica is not at all concern with back end database.It displays you all the
information that is to be
stored in repository.If want to reflect back end changes toinformatica screens, again you
have to import from back endto
informatica by valid connection.And you have to replacethe existing files with imported
files.

21. how many repositories can v create in informatica .


2 types of repositories can be created
1. standalone
2. Shared or Global repository

22. in which situations we go for pesistent cache in lookup and


which situations go for shared lookup cache?

If the lookup does not change between sessions, you can configure the transformation to use
a persistent
lookupcache. When you run the session, the Integration Service rebuilds the persistent
cache if any cache file
is missing or invalid.

23. What are the data movement modes in informatcia?


Datamovement modes determines how informatcia server handlesthe charector data.yoU
choose the datamovement in

the informatica server configuration settings.Two types ofdatamovement modes avialable in


informatica.
ASCII mode
Uni code mode

25. How do you create single lookup transformation using multiple tables?
we have the lOOKUP OVERRIDE Query in the Lookup transformation. Use the SQl Query to
join the tables
you lookup on.Thsi is similar to what yo do at the Source Qualifier

26.Discuss which is better among incremental load, Normal Load and Bulk load

It depends on the requirement. Otherwise Incremental load which can be better as it takes
only that data which is not available previously on the target.

According to performence bulk is better than normal.


But bolh having some conditions in source data
Conditions are like
1)does not containn any constraint in data.
2)dont use the double datatype if neccesory to use then use it as last row of the table.
3)it does not support
28. ow can we access MAINFRAME tables in INFORMATICA as a source ?
We got mainframe source as VSAM (Virtual Storage Access Method) files mainframe
developers
converts these files as Flat files to access data into Informatica for our convenience. The
bottom line is
The mainframe data files are in .Dat files(Flat Files) .

Interview Question&ANSWERS-2

1.What is the advantages of converting stored procedures into Informatica mappings?


Ans:Informatica is intermediate tool which talks to database. If you use the stored proc , the
informatica needs to talks to Database for each record since Informatica process record by
record. Making a call to database every time is costly.It is good to avoid stored proc as much
as possible unless it is very necessary.If you write the same code in informatica, the
informatica server will execute the code.
2.How to list Top 10 salary, without using Rank Transmission?
Ans:BY USING SORTER TRANSFORMATION USING SORTED PORT AS SAL AND FILTER
TRANSFORMATION TO GET FIRST 10 RECORDS

3.I want load the data into target with out 1st and last record.
BY USING FIRST FUNCTION, TO GET THE 1ST RECORD,TO GET THE LAST RECORD ,
SIMPLY USEAGGREGATE TRANSFORAMATION IN THIS DO NOT APPLY GROUP BY FUNCTION
BY DEFALUT IT WILL GENERATE LAST RECORD
OR
USE RANK TRANS
IN =>PROPERTIES NO OF TOP RANKS:1
NO OF BOTTOM RANKS:LAST RECORD
4. can u apply SCD2 on flat file tgt? if yes wat is the procedure?
No, you can't apply SCD2 on flat files.

5. we have 6 records in source , i need 2nd record in one target and 5th record in one target
or 2nd & 5th record in same target.
Another way is to use rank t/r and load data to target by
filtering 2 and 5 rows.
or
We can use Router t/r also.
create as many conditions u want.
say, if rownum=2 then flow the data to target 1
if rownum=5 then flow to another target
6. Re: If u r giving different parameter file names in workflow and session for a single
mapping parameter, then which one will be executed? The one u given in workflow or the
one u given in the session?

workflow parameter file


7. How to update records in Target, without using Update Strategy?
YES WITHOUT A pk WE CANT UPDATE THE REC u can write sql query at targetlevel.

in target table properties there isa option updateoverride.


IN SESSION PROPERTIES WE HAVE ONE OPTION TREAT ALL ROWS AS... THERE U SELECT
UPDATE OPTION AND IN THE TARGET PROPERTIES SELECT UPDTAE OPTION.
8. A TABLE CONTAINS SOME NULL VALUES . HOW TO GET (NOT APPLICABLE(NA))
IN PLACE OF THAT NULL VALUE IN TARGET .?
in a ports tab select particular column write N/A
in the Default value text box for the particular column
9. How to load time dimension?
Run the procedure to load the ttime dimension.Its not loaded frequently,but
once or twice a year.
10. What is the main data object present inbetween sourceand target.
The main data object present inbetween source and target is staging layer only,
Staging layer will do eliminate the inconsistency data and gives the result data
object
11.Can we use unconnected lookup as dynamic lookup?
NO
Unconnected lookup will return one port only. But dynamiclookup will return
more than one port and it update andinsert the targt while session runs.
12. LOOKUP Condition is nothing but a Join condition? What type of join
condition it,by default ? Using the LookUP Condition How many types of
relational conditionswe can make ?
lkp is always behave like left outer join..it ll give you all matched
records as well as ummatched records which is not present in base table...and
those unmatched records is ll be null in case of unconnected lkp trn.....
13. have two flat files.. containing same type of data i want to load it
to dwh..how many source qualifires i need
If the 2 flat files have the same structure,then we can go for filelist
concept ininformatica.
only one source qualifier is needed and the source should be either of the
flat files.
14.when will we use unconnected & connected lookup? Howit will effect on the
performance of mapping?
Connected Lookup
Receives input values directly from the pipeline.
We can use a dynamic or static cache
Supports user-defined default values
Unconnected Lookup
Receives input values from the result of a :LKP expression

in another transformation.
We can use a static cache
Does not support user-defined default values
When you compared both basically connected lookup will return more values
and unconnected returns one value. conn lookup is in the same pipeline of
source and it
will accept dynamic caching. Unconn lookup don't have that faclity but
in some special cases we can use Unconnected. if output of one lookup is going
as input of another lookup this unconnected lookups are favourableMoreover if the
mapping requires multiple lookup using samelookup conditions then its better
to use unconnected lookup& call the lookup wherever required.This also
prevents the mapping from being complex.
15. how do u use sequence created in oracle in informatica?
Explain with an simple example
Using Stored Porcedure Transformation we can call the Sequence Generator
16. in which situations do u go for sequence generator ?
In the Fallowing Situations we use Sequence Generator
Transformation
1)Creating Primary Key Values
2)Replacing The Missing Key Values
3)When You Apply the Cycle Through a Sequential Range Of
Numbers
17. write a query for how to eliminate the duplicate rows
without using distinct?
DELETE FROM EMP WHERE ROWID NOT IN (SELECT MAX(ROWID)FROM
EMP)GROUP BY EMPNO;
18. source is a flat file empname, empno, sal ram, 101, 1,000 sam, 102, 2,000 ques: my
target needs the data to be loaded as sal -1000 and 2000 excluding commas target
empname, empno, sal ram, 101, 1000 sam, 102, 2000 how to implement this?
OPEN THE SOURCE FILE-->EDIT-->REPLACE ALL , to NULL.
USE FIXED WITH LENGTH WHILE IMPORTING TO SOURCE AND .....
HOPE IT WILL HELP
19. wht is cdc?how to use it in creation of mappings?
CDC stands for change data capture
CDC defines Whenever data is changed in OLTP Systems Only
that data Will be captured and loaded into our Target
SCD's works internally based on cdc logic
basically cdc's implented by using effective date

20. how can we store previous session logs


Go to Session-->right click -->Select Edit Task then Goto -->Config Object
then set the property
Save Session Log By --Runs
Save Session Log for These Runs --->To Number of Historical Session logs you want

21.How to read rejected data or bad data from bad file and reload it to target?

Correction the rejected data and send to target relational tables using loadorder utility. Find
out the rejected data by using column indicatior and row indicator.
21. In my source table 1000 rec's are there. I want to load 501 rec to 1000 rec into
my Target table ?
select * from tab_name where rownum<=1000
minus
select * from tab_name where rownum<=500;
or
hi connect a sequence generator next to source qualifier into the expression.Use the filter
trans to filter rows like where nextval>500.
22. I have a cobol program with a sub program. How ca i find that it is a dynamic call? or
static call..?
1st u check the stmt call 'sub-name' or call 'ws-name'...
if it is call 'sub-name' then it is STATIC CALL
OR
If it is call 'ws-name' then it is DYNAMIC CALL.

Вам также может понравиться