3

Identification, Deidentification, and Reidentification

Abstract

This chapter describes, in some detail, the available methods for data identification and the minimal properties of identified information (uniqueness, exclusivity, completeness, authenticity, and reconciliation). The dire consequences of faulty identification will be discussed, along with real-world examples. Once data objects have been properly identified, they can be properly deidentified and reidentified. The ability to deidentify data objects confers enormous advantages when issues of confidentiality, privacy, and intellectual property emerge.

Keywords

Identification; Identifier; Data uniqueness; Deidentification; Reidentification; Anonymization; One-way hash; Random character ...

Get Principles and Practice of Big Data, 2nd Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.