Chapter 6

Cleansing and Profiling Data

This chapter covers Objective 2.2 (Identify common reasons for cleansing and profiling datasets) of the CompTIA Data+ exam and includes the following topics:

  • Images Duplicate data

  • Images Redundant data

  • Images Missing values

  • Images Invalid data

  • Non-parametric data

  • Data outliers

  • Specification mismatch

  • Data type validation

For more ...

Get CompTIA Data+ DA0-001 Exam Cram now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.