Chapter 6
Cleansing and Profiling Data
This chapter covers Objective 2.2 (Identify common reasons for cleansing and profiling datasets) of the CompTIA Data+ exam and includes the following topics:
Duplicate data
Redundant data
Missing values
Invalid data
Non-parametric data
Data outliers
Specification mismatch
Data type validation
For more ...
Get CompTIA Data+ DA0-001 Exam Cram now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.