Chapter 2: Exploratory Data Analysis in Python

This chapter focuses on exploratory data analysis (EDA), which is the first step in processing any dataset. The objective of EDA is to load data into data structures most suitable for further analysis to identify and rectify any wrong/bad data and get basic insight into the data—the types of fields there are; whether they are categorical or not; how many missing values there are; how the fields are related; and so on.

These are the main topics discussed in this chapter:

  • Introduction to EDA
  • Special Python libraries for EDA

Technical requirements

The Python code used in this chapter is available in the Chapter02/eda.ipynb notebook in the book's code repository.

Introduction to EDA

EDA is the process ...

Get Hands-On Financial Trading with Python now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.