Counting and Identifying Duplicates
Problem
You want to find out if a table contains duplicates, and to what extent they occur. Or you want to see the records that contain the duplicated values.
Solution
Use a counting summary that looks for and displays duplicated values. To see the records in which the duplicated values occur, join the summary to the original table to display the matching records.
Discussion
Suppose that your web site includes a sign-up page that allows visitors to add themselves to your mailing list to receive periodic product catalog mailings. But you forgot to include a unique index in the table when you created it, and now you suspect that some people are signed up multiple times. Perhaps they forgot they were already on the list, or perhaps people added friends to the list who were already signed up. Either way, the result of the duplicate records is that you mail out duplicate catalogs. This is an additional expense to you, and it annoys the recipients. This section discusses how to find out if duplicates are present in a table, how prevalent they are, and how to display the duplicated records. (For tables that do contain duplicates, Recipe 14.7 describes how to eliminate them.)
To determine whether or not
duplicates occur in a table, use a counting summary, a topic covered
in Chapter 7. Summary techniques can be applied to
identifying and counting duplicates by grouping records with
GROUP BY and counting the rows
in each group using COUNT( ). For the examples, ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access