<img height="1" width="1" style="display:none" src="https://www.facebook.com/tr?id=555865791448670&amp;ev=PageView&amp;noscript=1">

Hire job fit candidates. Faster.

All Posts

ETL Developer Interview Questions


Are you a recruiter or hiring manager? Hunting for highly talented ETL developer?

We all know that business users, higher executives/authorities use the data warehouse information to make important business decisions. It’s the biggest hurdle to get the best out of the company’s unstructured data. To overcome this real big challenge, hiring the top-notch ETL developer on board is a must. And believe me, it’s even more challenging!

But don’t worry, I think I can surely help you with this!

In this article, along with proven ETL interview questions, I have written down the detailed process to be followed in order to grab the right talent. It mainly consists of identifying the responsibilities/skills required, screening & then interviewing the right candidates. Let’s discuss them in detail.

Step 1: Look for Skills & Responsibilities of ETL Developer:

1. The Responsibilities of ETL Developer:

Before proceeding further, let’s understand what exactly is ETL?

ETL (Extract, Transform & Load) is the process performed in data warehouse environment which involves the extraction of data from multiple data sources, the transformation of data into suitable format & then loading the transformed data into destination system. In simple words, if an information is supposed to be oil, then the pipeline created to carry that oil is nothing but the ETL process & this is what ETL developers do!  They work on creating & strengthening this pipeline to extract & transfer the data from one place to another.


I. Extracting the Data (Data Mining):

ETL developer is responsible to manage & develop extraction tools i.e. homogeneous or heterogeneous data is extracted from the source data like relational databases, SaaS services, mobile applications, Flat files, etc. The size of extracted data varies from hundreds of kilobytes up to gigabytes. Extracting the data is the toughest task among all & it is time-consuming. Therefore all the three tasks are performed in tandem.

II. Transformation /Transportation of Data:

ETL developer is also responsible to transform the data of source into a suitable format in order to meet the specification of the destination database. It involves data mapping & efficient code generation to perform business logic.

III. Loading the Data:

This is the last & the reverse process of extraction of data. It focuses on how effectively & efficiently the transformed data is loaded into the targeted system (database, file, another data warehouse).

2. ETL Developer Skills Required:

  1. Knowledge of one or more programming languages: Python, Java, R, SCALA. Map Reduce, etc.
  2. Big Data Stores Knowledge: Hadoop, MongoDB, Spark, etc.
  3. Hands on ETL Tools: Data Stage, Informatica, etc.
  4. Data Modeling: The technique used for defining the detailed business requirements for specific database in order to design the best possible ETL process.
  5. Testing Knowledge: Should be able to perform the unit testing on generated code.
  6. Analytical & Logical Thinking: A good ETL developer/ Data engineer should be able to analyze the given data & judge all the possible outcomes.
  7. Communication skills: ETL Developer should have good verbal & written communication skills.

Step 2: Assess Candidates Before The Interview:

If you think that resumes speak the truth, then you are in a really big trouble! Blindly trusting the resumes & asking the candidate to directly come for the interview is risky, time-consuming and a tedious job. At the time of the interview, you can gauge about candidate’s personality, whether he/she is dedicated, passionate or cultural fit or not. What about their skills? As an employer, you must opt for a scientific approach to validate the technical expertise of the candidate. Skill assessment test is one of the best ways to gauge the practical knowledge of the candidates.

Before asking the candidate to come directly for the interview, assess their working skills using skill testing using Interview Mocha’s valid & reliable tests that are listed below:

For more details, you can visit Interview Mocha, an applicant testing software. Employers can even ask for the customizable assessments. Our content team will be happy to provide you the custom test as per your specific needs! All our pre employment assessment tests are developed by global subject mattter experts.

Assess Skills of ETL Developers

Step 3: Interview Round:

After screening the qualified candidates, ask them to appear for the interview. Here, I have compiled the proven ETL interview questions to ask potential prospects that will help you to assess ETL skills of applicants.

ETL Interview Questions to Assess & Hire ETL Developers:

  1. The models such as budgeting, financial reporting, allocations, etc. are heavy on calculations and do they not translate well into SQL. Among Multi-dimensional, Real Time and Relational OLAP, Which type of OLAP is not suitable when computations have to be carried out on these models and why?
  2. What do you know about Operational data store in data warehousing?
  3. How is a distributed data warehouse different from a virtual data warehouse?
  4. Do you know about a data cube? What is the significance of data cubes?
  5. You have a product dimension and a sales fact table in your data warehouse. A new product is created in the OLTP system and sales transactions happen for that product and then you extract the OLTP system. Due to this you get only the sales transaction into the staging environment and not the products. Why did this happen?
  6. Can you explain me the difference between parametric models and nonparametric models of predicting? Also given one example of each.
  7. Shown is a Graph Database model skeleton. Consider N be the nodes (representing entity), E be the edges (represents relationships among entities). How do you represent entity attributes, plus metadata in such data model?
    ETL developer interview questions_img
  8. What can you say about any column if a database has high cardinality value?
  9. Among fact table and a dimension table which one remains in denormalized form?
  10. A business model has employee details, company details, and salary details. The model (schema) has only one foreign key i.e. Employee_ID for all the dimensions which are necessary. What type of model is being referred to here?
  11. Do you know about association rule mining? Explain with one of its practical use case.
  12. Suppose i1, i2, i3, i4, i5, i6, i7, i8 are the items sold at a grocery store. Customers can buy any combination of the above items as per their requirements. After collecting the data from all transaction it was found that i5, i6, i7 is the most frequently bought itemset. While i2, i4, i5 is bought least frequently. What can you say about the purchase behavior of i5i6 itemset and i2i4i5i7 itemset?
  13. What do you think is the primary role of a data mining engine in any data mining system?
  14. I give you the below ways that explain working of any typical data mining process?
    i) Perform ETL onto the data warehouse system. ii) Provide data access to analysts iii) Store and manage the data in a multidimensional DB system. iv)Analysis and Presentation of data
    Can you rearrange these steps for a true data mining process?
  15. Consider data mining techniques like Clustering, Associations, Link analysis, and Deviation detection, which one would you use for finding a relationship between two entities?
  16. Suppose you're asked to reduce an existing data model using IBM DataStage ETL (Assume you are not using Parallel Extender). Can you do this as percentage wise reductions on the rows in the data source? If yes, how? If no, what is the workaround?
  17. Among Hash file and Sequential file, which file is faster to search data for and why?
  18. Why is the sparse lookup in DataStage faster than Normal lookup?
  19. You have two-time data type field values in a DataStage transformer. In one record the value is 00:59:59 while in the other record the value is 00:10:10. If you want to add these two values so as to get the output as 01:09:09, how would you do that?
  20. Can you explain me about merge stage in DataStage in the context of jobs? Also, explain how is merge performed and what is mandatory when the merge is performed on a key field?




ETL is a never ending process & so are the techniques & methods used in it! Though I have written down many aspects, role & responsibilities of ETL developers, there may be more to this depending upon the job-role assigned. But this guide along with our Skill assessment software will surely help you to streamline your hiring process & pick up the few small but essential things to make a big difference, to get the right hire!

Our pre employment testing software provide a great method to evaluate the candidate on his/ her knowledge and skills. With features like cheating prevention, quick report generation, and integration with Applicant Tracking systems (ATS), we provide the best pre-employment skill testing experience.

If you have any queries or other approaches that worked really well for you, we would really like to hear that. Please feel free to write it down in comments below.

irrelevant candidates

Sanghamitra Sarate
Sanghamitra Sarate
A passionate Digital Marketer with expertise in SEO & paid advertising. When not busy with the campaigns, I like to stay updated with latest trends in technology that are useful for recruitment & producing the content.

Topics: Technical Hiring

Related Posts

Benefits of One Way Video Interview!

Human civilization has had technological advancements one after the other over the years. These advancements have taken over the recruitment industry too. The recruitment industry has come a long way, refining and making its recruitment process more advanced and easier for both candidate and management.

Recruitment Trends To Expect In 2020!

In recent years the recruitment process has evolved and continues to do so! It is all about data-driven recruitment now, with a pinch of human emotions! Many new trends came in the recruitment industry over the past couple of years, years pass and trends change!

Employee Engagement Trends to Watch for in 2020

In a landscape full of expensive work laptops, catered lunch, and other fancy benefits, engaging employees today is harder than ever. Despite these perks, however, the New York Daily reports that 70% of US employees are miserable at work. This is a large indicator of how leaders are still having trouble stimulating engagement with today’s employees, which is a workforce more diverse and younger than ever before. As new developments in technology and social norms occur at such a rapid pace, keeping up with the changing workforce and knowing how to engage employees can get a little difficult. That said, we’ve outlined a few ways you can stay ahead and have listed the incoming employee engagement trends of 2020. Be sustainable The millennial generation, who are in-line to be the next executives, want to work for companies who are sustainable — and the numbers don’t lie. A survey conducted by Swytch found that out of 1,000 employees, 70% are more likely to work for a company that has a strong green footprint, with 30% leaving a company due to not having a corporate sustainability agenda. Fortunately, it’s not that hard for companies to adapt — even using renewable energy is something that most companies can do, as it’s not as hard nor as expensive as it was before. In fact, thanks to decades of improved technology, Marcus notes that it's cheaper than ever to generate power from a solar grid. This is mainly because the cost to make it has lowered, making it possible for solar-powered projects to thrive, and companies to get into sustainability. While it isn’t meant to be a marketing scheme, being sustainable also provides an opportunity for outward marketing and branding as well. Offer remote or flexible work schedules Today’s generation highly values a work-life balance, and by offering remote or flexible work options, you’re fostering a culture that values these principles. This will increase employee retention as they see their employers value the same thing they do. Not to mention, quality of work and productivity will increase as granting them control over their time lets them choose the work times and environments that are best for them. If you want to start offering flexible work schedules, transitioning isn’t as hard as you think. Our article on ‘5 Ways to Effectively Improve Employee Engagement’ explains that you can start by offering telecommuting days, which lets employees work from home for 2 or 3 days a week. Download time-tracking tools and messaging apps such as Slack or Microsoft Teams to make collaborating and transitioning easier. Remember to also train managers to handle their new team of remote/flexible workers, as managing a flexible workforce isn’t the same as a traditional one. Provide opportunities for growth With the current technology, employees have an easier time developing their skillset — but they don’t want to do this without an opportunity to grow. Providing employees a clear path for growth along with some guidance will make them more motivated to reach higher and engage with the company. Without this, employees will become stagnant or feel demotivated as there are no goals to reach within the company. That said, make sure to pair your employees with mentors, or let them know about the next step in the corporate ladder and how to reach it. Provide seminars, give them challenging tasks, and let them have more opportunities to grow through conventions or events. An employee who feels like they’re valued by a company enough for the company to invest in their growth is an engaged one, and there’s no better way to do this than to give them opportunities