Skip to Main Content
PostgreSQL 9 Administration Cookbook - Second Edition
book

PostgreSQL 9 Administration Cookbook - Second Edition

by Simon Riggs, GIANNI CIOLLI, Hannu Krosing, Gabriele Bartolini
April 2015
Intermediate to advanced content levelIntermediate to advanced
504 pages
11h 41m
English
Packt Publishing
Content preview from PostgreSQL 9 Administration Cookbook - Second Edition

Identifying and removing duplicates

Relational databases work on the idea that items of data can be uniquely identified. However hard we try, there will always be bad data arriving from somewhere. This recipe shows you how to diagnose that and clean up the mess.

Getting ready

Let's start by looking at our example table, cust. It has a duplicate value in customerid:

postgres=# SELECT * FROM cust;
 customerid | firstname | lastname | age
------------+-----------+----------+-----
          1 | Philip    | Marlowe  |  38
          2 | Richard   | Hannay   |  42
          3 | Holly     | Martins  |  25
          4 | Harry     | Palmer   |  36
          4 | Mark      | Hall     |  47
(5 rows)

Before you delete duplicate data, remember that sometimes, it isn't the data that is wrong; it is your understanding of it. In those cases, ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

PostgreSQL 10 Administration Cookbook - Fourth Edition

PostgreSQL 10 Administration Cookbook - Fourth Edition

Simon Riggs, Gianni Ciolli
Mastering PostgreSQL 10

Mastering PostgreSQL 10

Hans-Jürgen Schönig

Publisher Resources

ISBN: 9781849519069Supplemental Content