17.7. EXERCISES

  1. Match the columns:

    1. knowledge discovery process

    2. OLAP

    3. cluster detection

    4. decision trees

    5. link analysis

    6. hidden layers

    7. genetic algorithms

    8. data warehouse

    9. MBR

    10. banking application

    1. reveals reasons for the discovery

    2. neural networks

    3. distance function

    4. feeds data for mining

    5. data-driven

    6. fraud detection

    7. user-driven

    8. forms groups

    9. highly iterative

    10. associations discovery

  2. As a data mining consultant, you are hired by a large commercial bank that provides many financial services. The bank already has a data warehouse that it rolled out two years ago. The management wants to find the existing customers who are most likely to respond to a marketing campaign offering new services. Outline the knowledge discovery process, list the phases, and indicate the activities in each phase.

  3. Describe how decision trees work. Choose an example and explain how this knowledge discovery process works.

  4. What are the basic principles of genetic algorithms? Give an example. Use the example to describe how this technique works.

  5. In your project you are responsible for analyzing the requirements and selecting a toolset for data mining. Make a list of the criteria you will use for the toolset selection. Briefly explain why each criterion is necessary.

Get DATA WAREHOUSING FUNDAMENTALS: A Comprehensive Guide for IT Professionals now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.