3

Identification, Deidentification, and Reidentification

Abstract

This chapter describes, in some detail, the available methods for data identification and the minimal properties of identified information (uniqueness, exclusivity, completeness, authenticity, and reconciliation). The dire consequences of faulty identification will be discussed, along with real-world examples. Once data objects have been properly identified, they can be properly deidentified and reidentified. The ability to deidentify data objects confers enormous advantages when issues of confidentiality, privacy, and intellectual property emerge.

Keywords

Identification; Identifier; Data uniqueness; Deidentification; Reidentification; Anonymization; One-way hash; Random character ...

Get Principles and Practice of Big Data, 2nd Edition now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.