Chapter 3. Exploratory Data Analysis in Soccer
In the previous chapter, you built a toolkit of essential Python skills. Now, it’s time to put those skills to work on real soccer data.
This chapter introduces exploratory data analysis (EDA), the initial, open-ended process of inspecting a dataset to understand its structure, find patterns, and identify anomalies. Just as a coach studies match footage before deciding on tactics, EDA is a structured investigation of the data. It’s how analysts gain their first understanding of what has been recorded, how often, and where the gaps or oddities lie. Instead of jumping straight to predictive models, we start by asking simple questions to build intuition. This crucial first step prevents us from drawing conclusions on shaky ground and lays the foundation for all subsequent analysis.
As we perform EDA, we’ll see how to adapt general-purpose data science tools into soccer-specific analyses. Rather than just counting rows or calculating averages, we’ll ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access