Complex operations

We've seen how to load, select, filter, and operate on data with pandas. In this section, we will show more complex manipulations that are typically done on full-blown databases based on SQL.



Structured Query Language is a domain-specific language widely used to manage data in relational database management systems (RDBMS). pandas is somewhat inspired by SQL, which is familiar to many data analysts. Additionally, pandas can connect to SQL databases. You will find more information about the links between pandas and SQL at

Let's first import our NYC taxi dataset as in the previous sections.

In [1]: import numpy as np
        import pandas as pd
 import matplotlib.pyplot ...

Get Learning IPython for Interactive Computing and Data Visualization - Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.