Chapter 2

Identification, Deidentification, and Reidentification

Outline

Many errors, of a truth, consist merely in the application the wrong names of things.

Baruch Spinoza

Background

Data identification is certainly the most underappreciated and least understood Big Data issue. Measurements, annotations, properties, and classes of information have no informational meaning unless they are attached to an identifier that distinguishes one data object from all other data ...

Get Principles of Big Data now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.