Chapter 18

Data Privacy and Privacy-Preserving Data Publishing

Abstract

This chapter discusses “the other side of the coin” to data mining—which is not often dealt with in data mining books—namely, data privacy and privacy-preserving data publishing. The chapter first considers how some popular Internet applications deal with data privacy, followed by a brief look at some of the legal aspects. Then it looks at “privacy-preserving data publishing,” which is perhaps the area of most concern for data miners: concepts, anonymization techniques, and document sanitization.

Keywords

data privacy

privacy-preserving data publishing

anonymity

information loss

risk of disclosure

confidentiality

document sanitization

Introduction

This chapter discusses “the other side ...

Get Commercial Data Mining now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.