Skip to Main Content
Making Sense of Data I: A Practical Guide to Exploratory Data Analysis and Data Mining, 2nd Edition
book

Making Sense of Data I: A Practical Guide to Exploratory Data Analysis and Data Mining, 2nd Edition

by Glenn J. Myatt, Wayne P. Johnson
August 2014
Beginner to intermediate content levelBeginner to intermediate
248 pages
5h 54m
English
Wiley
Content preview from Making Sense of Data I: A Practical Guide to Exploratory Data Analysis and Data Mining, 2nd Edition

CHAPTER 5 IDENTIFYING AND UNDERSTANDING GROUPS

5.1 OVERVIEW

It is often useful to decompose a data set into simpler subsets to help make sense of the entire collection of observations. These groups may reflect the types of observations found in a data set. For example, the groups might summarize the different types of customers who visit a particular shop based on collected demographic information. Finding subgroups may help to uncover relationships in the data such as groups of consumers who buy certain combinations of products. The process of grouping a data set may also help identify rules from the data, which can in turn be used to support future decisions. For example, the process of grouping historical data can be used to understand which combinations of clinical treatments lead to the best patient outcomes. These rules can then be used to select an optimal treatment plan for new patients with the same symptoms. Finally, the process of grouping also helps discover observations dissimilar from those in the major identified groups. These outliers should be more closely examined as possible errors or anomalies.

The identification of interesting groups is not only a common deliverable for a data analysis project, but can also support other data mining tasks such as the development of a model to use in forecasting future events (as described in Chapter 6). This is because the process of grouping and interpreting the groups of observations helps the analyst to thoroughly understand ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Making Sense of Data: A Practical Guide to Exploratory Data Analysis and Data Mining

Making Sense of Data: A Practical Guide to Exploratory Data Analysis and Data Mining

Glenn J. Myatt
Data Mining, 4th Edition

Data Mining, 4th Edition

Ian H. Witten, Eibe Frank, Mark A. Hall, Christopher J. Pal

Publisher Resources

ISBN: 9781118422106Purchase book