Chapter 19

Attribute selection

19.1 Introduction

Unlike the classification, regression, or clustering tasks, the attribute selection task is not a data mining task per se, as—contrary to the former—it is not concerned with delivering models representing generalizations of predictively useful relationships discovered in the data. While the results of attribute selection—taking the form of a subset of attributes—can be interesting and insightful on their own—there is always another task (usually classification, regression, or clustering) for which attribute selection serves as preprocessing, and which provides the motivation, context, and quality criteria therefore. This puts attribute selection in the same category as data transformation techniques discussed in Chapter 17. What distinguishes attribute selection from the latter is that it often requires much more refined and computationally demanding algorithms to be adequately performed, and its impact on the final model quality may be even more substantial. It cannot be therefore reduced to something purely technical and trivial by any means, and definitely deserves more interest than it usually receives.

Get Data Mining Algorithms: Explained Using R now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Data Mining Algorithms: Explained Using R by Pawel Cichosz

Chapter 19

Attribute selection

19.1 Introduction

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly