Chapter 19
Attribute selection
19.1 Introduction
Unlike the classification, regression, or clustering tasks, the attribute selection task is not a data mining task per se, as—contrary to the former—it is not concerned with delivering models representing generalizations of predictively useful relationships discovered in the data. While the results of attribute selection—taking the form of a subset of attributes—can be interesting and insightful on their own—there is always another task (usually classification, regression, or clustering) for which attribute selection serves as preprocessing, and which provides the motivation, context, and quality criteria therefore. This puts attribute selection in the same category as data transformation techniques discussed in Chapter 17. What distinguishes attribute selection from the latter is that it often requires much more refined and computationally demanding algorithms to be adequately performed, and its impact on the final model quality may be even more substantial. It cannot be therefore reduced to something purely technical and trivial by any means, and definitely deserves more interest than it usually receives.
Get Data Mining Algorithms: Explained Using R now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.