Chapter 38. Securing Your Data Against Breaches Will Help Us Improve Health Care

Fred Nugen

When you go to a new health care clinic in the United States, doctors and nurses pull up your patient record based on your name and birthdate. Except sometimes it’s not your chart they pull up. This is not only a health care problem; it’s also a data science problem.

Two things (at least) contribute to this error: a lack of consistent and uniform patient records and public mistrust in protection of data. Both of them hold health care back from data science revolutions.

When patient records are transferred from one major hospital system to another, patient data passes through health information exchanges. The current rate of correctly matching patients between systems is estimated to be around 30%.1 With considerable effort from data scientists put into data cleaning and better algorithms, we could potentially match as often as 95%. This is an important opportunity for data science to improve health care! It’s called “master data management” or “data governance,” and while we have a long way to go, we’re getting better.

The health care industry works hard to prevent misidentification. It is standard practice to use at least two patient identifiers, such as name and birthdate.2 Unfortunately, name and birthdate do not uniquely identify a patient; a third ...

Get 97 Things About Ethics Everyone in Data Science Should Know now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.