Chapter 8
Digging into Your Data
In This Chapter
Focusing on specific problems
Building on business knowledge
Appreciating the advantages of your own data
Many organizations possess a mountain of data that’s been collected in the course of routine business, and they’re adding new data each day. As a data miner, you’ll use this internal data as your primary natural resource.
This chapter focuses on framing a problem and finding relevant data within your existing resources. If you have more data on hand than you know what to do with, you’re in the very situation that data mining was created to address. But on the other hand, if your data resources seem skimpy, don’t worry. The ideas in this chapter still apply to you. Make the most of whatever you have! (See more about expanding your data resources in Chapters 9, 10, and 11.)
Focusing on a Problem
A data-mining project begins when you identify a specific business issue to investigate. The narrower and better-defined the question, the more effectively it can be answered. The more clearly the question is defined, the more clearly the data requirements can be understood, as well as the limitations of the answer. If you’re faced with ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access