Feature selection and engineering are important steps in a machine learning pipeline and involves all the techniques adopted to reduce their dimensionality. Most of the time, these steps come after cleaning the dataset.
Most algorithms have strong assumptions about the input data, and their performance can be negatively affected when raw datasets are used. Moreover, the data is seldom isotropic; there are often features that determine the general behavior of a sample, while others that are correlated don’t provide any additional ...