1. Pandas DataFrame Basics
1.1 Introduction
Pandas is an open source Python library for data analysis. It gives Python the ability to work with spreadsheet-like data for fast data loading, manipulating, aligning, and merging, among other functions. To give Python these enhanced features, Pandas introduces two new data types to Python: Series
and DataFrame
. The DataFrame
represents your entire spreadsheet or rectangular data, whereas the Series
is a single column of the DataFrame
. A Pandas DataFrame
can also be thought of as a dictionary or collection of Series
objects.
Why should you use a programming language like Python and a tool like Pandas to work with data? It boils down to automation and reproducibility. If a particular set of analyses ...
Get Pandas for Everyone: Python Data Analysis, First Edition now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.