376Solving Operational Business Intelligence with InfoSphere Warehouse Advanced Edition
10.2.2 Data preparation
It has long been true that one of the most time-consuming stages in a data
mining project is data preparation. This step involves several activities to make
the source data suitable for data mining processing, including but not limited to
the following tasks:
Integrate or consolidate data from multiple sources into a single data set
suitable for data mining
Transfer data values or calculate new data values for inclusion in the data
mining solution
Align granularity (for example, transaction level versus daily summary) of data
from different sources
Eliminate or correct “bad” data values in the source data, such as null values ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month, and much more.