Chapter 3Feature Engineering

In the previous chapter, we discussed why it's important to have quality data. We covered some of the ways to get quality data such as removing missing values, removing outliers, and using data normalization. In this chapter, we will cover feature‐engineering techniques. We will also cover encoded structured data type, class imbalance, feature cross, and TensorFlow Transform to perform feature engineering.

Feature engineering is the process of transforming raw data coming from various sources such as log files and weather readings to useful features for model training. The features can be numerical or categorical. Let's understand the primary reasons for data transformation.

  • Transforming for Data Compatibility Data consists of both numeric features ...

Get Official Google Cloud Certified Professional Machine Learning Engineer Study Guide now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.