Skip to Content
Data Science with Python and Dask
book

Data Science with Python and Dask

by Jesse Daniel
July 2019
Intermediate to advanced content levelIntermediate to advanced
296 pages
9h 1m
English
Manning Publications
Content preview from Data Science with Python and Dask

6 Summarizing and analyzing DataFrames

This chapter covers

  • Producing descriptive statistics for a Dask Series
  • Aggregating/grouping data using Dask’s built-in aggregate functions
  • Creating your own custom aggregation functions
  • Analyzing time series data with rolling window functions

At the end of the previous chapter we arrived at a dataset ready for us to start digging in and analyzing. However, we didn’t perform an exhaustive search for every possible issue with the data. In reality, the data cleaning and preparation process can take a far longer time to complete. It’s a common adage among data scientists that data cleaning can take 80% or more of the total time spent on a project. With the skills you learned in the previous chapter, you ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Practical Data Science with Python

Practical Data Science with Python

Nathan George
Python: End-to-end Data Analysis

Python: End-to-end Data Analysis

Phuong Vothihong, Martin Czygan, Ivan Idris, Magnus Vilhelm Persson, Luiz Felipe Martins

Publisher Resources

ISBN: 9781617295607OtherSupplemental ContentPublisher SupportPublisher WebsiteSupplemental ContentPurchase Link