Chapter 10

Feature Selection and Dimensionality Reduction

Contents

Preamble

Feature selection techniques remove features that have low information content, clarifying the data by identifying the more important features. Similarly, dimensionality reduction techniques extract new features from the data by combining two or more existing features. A simple example of dimensionality reduction strategy is to replace factors of “length” and “width” by multiplying them together to create the new factor, “area.” Area has information elements related to length and width, in addition to elements ...

Get Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.