5

Data Wrangling and Manipulation

It is not an exaggeration to say that the majority of work done by the average data analyst revolves around preparing data for use. A large part of this is cleaning the data, as covered in the previous chapter, but it is more than just dealing with things that will cause errors or introduce bias. You will often have to get the data into a specific shape or format before you can use it. This step is often called data wrangling or manipulation. To be clear, when we use the word “manipulation,” we do not mean we are changing the outcome in any way; we are using it in the literal sense of handling and managing the data in a skillful way.

In this chapter, we will go over some of the most important skills in the data-wrangling ...

Get CompTIA Data+: DAO-001 Certification Guide now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.