April 2021
Intermediate to advanced
360 pages
6h 24m
English
This chapter focuses on exploratory data analysis (EDA), which is the first step in processing any dataset. The objective of EDA is to load data into data structures most suitable for further analysis to identify and rectify any wrong/bad data and get basic insight into the data—the types of fields there are; whether they are categorical or not; how many missing values there are; how the fields are related; and so on.
These are the main topics discussed in this chapter:
The Python code used in this chapter is available in the Chapter02/eda.ipynb notebook in the book's code repository.
EDA is the process ...