12. Feature Engineering
Overview
By the end of this chapter, you will be able to merge multiple datasets together; bin categorical and numerical variables; perform aggregation on data; and manipulate dates using pandas.
This chapter will introduce you to some of the key techniques for creating new variables on an existing dataset.
Introduction
In the previous chapters, we learned how to analyze and prepare a dataset in order to increase its level of quality. In this chapter, we will introduce you to another interesting topic: creating new features, also known as feature engineering. You already saw some of these concepts in Chapter 3, Binary Classification, but we will dive a bit deeper into it in this chapter.
The objective of feature ...
Get The Data Science Workshop - Second Edition now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.