Data enrichment
Once we have our cleaned data in the format we need for our analysis, we may find the need to enrich the data a bit. Data enrichment (which will be covered in Chapter 4, Aggregating Pandas DataFrames) improves the quality of the data by adding to it in one way or another. This process becomes very important in modeling and in machine learning, where it forms part of the feature engineering process (which we will touch on in Chapter 10, Making Better Predictions – Optimizing Models).
When we're looking to enrich the data, we can either merge new data with the original data (by appending new rows or columns) or use the original data to create new data. The following are ways to enhance our data using the original data:
- Adding ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access