ETL Interview Questions & Answers - Learning Mode

ETL is important, as it is the way data actually gets loaded into the warehouse.ETL can also be used for the integration with legacy systems. Usually ETL implementations store an audit trail on positive and negative process runs.

Question: How can we use mapping variables in Informatica? Where do we use them?

Answer: Yes. we can use mapping variable in Informatica.

The Informatica server saves the value of mapping variable to the repository at the end of session run and uses that value next time we run the session.

Question: What are the different versions of Informatica?

Answer: Here are some popular versions of Informatica.

Informatica Powercenter 4.1, Informatica Powercenter 5.1, Powercenter Informatica 6.1.2, Informatica Powercenter 7.1.2, Informatica Powercenter 8.1, Informatica Powercenter 8.5, Informatica Powercenter 8.6.

Question: What is the difference between Power Center & Power Mart?

Answer: PowerCenter - ability to organize repositiries into a data mart domain and share metadata across repositiries.

PowerMart - only local repositiry can be created.

Question: Is there any way to read the MS Excel Data's directly into Informatica?Like IS there any Possibilities to take excel file as target?

Answer: we cant directly import the xml file in informatica.
we have to define the microsoft excel odbc driver on our system. and define the name in exce sheet by defining ranges.
then in inforematica open the folder using sources ->import from database->select excel odbc driver->connect->select the excel sheet name . Source:
Question: Can Informatica load heterogeneous targets from heterogeneous sources?

Answer: No, In Informatica 5.2 and
Yes, in Informatica 6.1 and later.

yes, you can use heterogenous source and target in single mapping. But to join data from heterogenous source you have to use joiner transformation.

Question: Can we override a native sql query within Informatica? Where do we do it? How do we do it?

Answer: Yes,we can override a native sql query in source qualifier and lookup transformation.

In lookup transformation we can find "Sql override" in lookup using this option we can do this.

Yes we can do it in source qualifier transformation. Edit the transformation and in properties tab select User Query or SQL option and change it accordingly.

Question: Can we lookup a table from source qualifier transformation. ie. unconnected lookup

Answer: You cannot lookup from a source qualifier directly. However, you can override the SQL in the source qualifier to join with the lookup table to perform the lookup.
Question: What is ODS (operation data source)

Answer: ODS - Operational Data Store.

ODS Comes between staging area & Data Warehouse. The data is ODS will be at the low level of granularity.

Once data was poopulated in ODS aggregated data will be loaded into into EDW through ODS.
Question: What is Full load & Incremental or Refresh load?

Answer: Full Load: completely erasing the contents of one or more tables and reloading with fresh data.

Incremental Load: applying ongoing changes to one or more tables based on a predefined schedule Source:
Question: How do we call shell scripts from informatica?

Answer: Specify the Full path of the Shell script the "Post session properties
of session/workflow".

Question: What are active transformation / Passive transformations?

Answer: Active transformation can change the number of rows that pass through it. (decrease or increase rows)

Passive transformation can not change the number of rows that pass through it.

Question: Can we use procedural logic inside Infromatica? If yes how , if now how can we use external procedural logic in informatica?

Answer: Yes, you can use advanced external transformation. for more detail you can refer the manual of informatica transformation guide in that advance external transformation. You can use c++ language on unix and c++, vb vc++ on windows server.

Question: I'am new to SAS, can anybody explain the process of extracting data from source systems,storing in ODS and how data modelling is done.

Answer: There are various ways of Extracting Data from Source Systems.For example , You can use a DATA step, an Import Process .It depends with your input data styles. What kind of File/database it is residing in. Storing ur data in an ODS can be done thru an ODS stmt/export stmt/FILE stmt, again which depends on the file & data format ,You want your output to be in.

Question: What are the various test procedures used to check whether the data is loaded in the backend, performance of the mapping, and quality of the data loaded in INFORMATICA.

Answer: The best procedure to take a help of debugger where we monitor each and every process of mappings and how data is loading based on conditions breaks.
Question: What are the various methods of getting incremental records or delta records from the source systems?

Answer: One foolproof method is to maintain a field called 'Last Extraction Date' and then impose a condition in the code saying 'current_extraction_date > last_extraction_date'. Source:
Question: What is partitioning? What are the types of partitioning?

Answer: If you use PowerCenter, you can increase the number of partitions in a pipeline to improve session performance. Increasing the number of partitions allows the Informatica Server to create multiple connections to sources and process partitions of source data concurrently.

When you create a session, the Workflow Manager validates each pipeline in the mapping for partitioning. You can specify multiple partitions in a pipeline if the Informatica Server can maintain data consistency when it pr Source:
Question: Techniques of Error Handling - Ignore , Rejecting bad records to a flat file , loading the records and reviewing them (default values)

Answer: Rejection of records either at the database due to constraint key violation or the informatica server when writing data into target table.These rejected records we can find in the badfiles folder where a reject file will be created for a session.we can check why a record has been rejected.And this bad file contains first column a row indicator and second column a column indicator.
These row indicators or of four types
D-valid data,
O-overflowed data,
N-null data,
T- Truncated Source:
Question: What is ETL?

Answer: ETL stands for extraction, transformation and loading.<br><br>ETL provide developers with an interface for designing source-to-target mappings, ransformation and job control parameter.<br>? Extraction<br> Take data from an external source and move it to the warehouse pre-processor database.<br><br>? Transformation<br> Transform data task allows point-to-point generating, modifying and transforming data.<br><br>? Source:
Question: What are parameter files ? Where do we use them?

Answer: Parameter file defines the value for parameter and variable used in a workflow, worklet or session.

