Building new features
Sometimes, you'll find yourself in a situation when features and target variables are not really related. In this case, you can modify the input dataset. You can apply linear or nonlinear transformations to improve the accuracy of the system, and so on. It's a very important step for the overall process because it completely depends on the skills of the data scientist, who is the one responsible for artificially changing the dataset and shaping the input data to better fit the learning model. Although this steps intuitively just adds complexity, this approach often boosts the performance of the learner; that's why it is used by bleeding-edge techniques, such as deep learning.
For example, if you're trying to predict the value ...
Get Python Data Science Essentials - Second Edition now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.