Give three reasons why you think ETL functions are most challenging in a data warehouse environment.
Name any five types of activities that are part of the ETL process. Which of these are time-consuming?
The tremendous diversity of the source systems is the primary reason for their complexity. Do you agree? If so, explain briefly why.
What are the two general categories of data stored in source operational systems? Give two examples for each.
Name five types of the major transformation tasks. Give an example for each.
Describe briefly the entity identification problem in data integration and consolidation. How do you resolve this problem?
What is key restructuring? Explain why it is needed.
Define initial load, incremental load, and full refresh.
Explain the difference between destructive merge and constructive merge for applying data to the data warehouse repository. When do you use these modes?
When is a full data refresh preferable to an incremental load? Can you think of an example?