Chapter 12
Getting Familiar with Your Data
In This Chapter
Organizing data properly
Importing data
Examining your data
Knowing data-mining terminology
Before a French chef whips up a dazzling dish, she sets out all the ingredients and tools. She checks that the ingredients are fresh and good, and that the tools work properly. She does not begin to cook until she puts everything in place.
A data miner is no different. Before you whip up a dazzling predictive model, you get acquainted with the data that you will use. You put it where you need it. You make sure that you understand what data you have, how it’s arranged and stored, and whether it is complete and correct.
This chapter shows you how to analyze and evaluate your data.
Organizing Data for Mining
Data mining has very strict requirements for data organization. They are not exotic, complex, or difficult requirements to meet, but they are strict.
Let me use an example to show how data must be organized for data mining. Figure 12-1 shows a sample of data viewed as a table in data-mining software. (See Chapter 2 for more about ...
Get Data Mining For Dummies now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.