Performing data filtering activities

This is a bit of a recap recipe. In the workflow proposed here, we will sum up the tricks and knowledge gained throughout the book in order to perform a data-filtering activity.

Data filtering includes all the activities performed on a dataset to make it ready for further analysis.

Isn't it the same as data cleansing?

Well, in a sense… yes. However, not exactly the same, since data filtering usually refers to some specific techniques and not to others, while data cleansing can be considered a more comprehensive concept.

That said, here we will make tests for our data frame, performing subsequent filtering activities and reporting about these activities. The following diagram shows the flow:

As you can see in the ...

Get RStudio for R Statistical Computing Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.