Performing data filtering activities

This is a bit of a recap recipe. In the workflow proposed here, we will sum up the tricks and knowledge gained throughout the book in order to perform a data-filtering activity.

Data filtering includes all the activities performed on a dataset to make it ready for further analysis.

Isn't it the same as data cleansing?

Well, in a sense… yes. However, not exactly the same, since data filtering usually refers to some specific techniques and not to others, while data cleansing can be considered a more comprehensive concept.

That said, here we will make tests for our data frame, performing subsequent filtering activities and reporting about these activities. The following diagram shows the flow:

As you can see in the ...

Get RStudio for R Statistical Computing Cookbook now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.