Skip to Main Content
Handbook of Statistical Analysis and Data Mining Applications
book

Handbook of Statistical Analysis and Data Mining Applications

by Robert Nisbet, John Elder, Gary Miner
May 2009
Beginner to intermediate content levelBeginner to intermediate
864 pages
23h 13m
English
Elsevier Science
Content preview from Handbook of Statistical Analysis and Data Mining Applications
Chapter 4

Data Understanding and Preparation

OUTLINE

Preamble

Once the data mining process is chosen, the next step is to access, extract, integrate, and prepare the appropriate data set for data mining. Input data must be provided in the amount, structure, and format suited to the modeling algorithm. In this chapter, we will describe the general structure in which we must express our data for modeling and describe the major data cleaning operations that must be performed. In addition, we will describe how to explore your data prior to modeling and how to clean it up. From a database standpoint, a body of data can be regarded ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

R Data Mining

R Data Mining

Enrico Pegoraro, Andrea Cirillo
Data Mining and Machine Learning Applications

Data Mining and Machine Learning Applications

Rohit Raja, Kapil Kumar Nagwanshi, Sandeep Kumar, K. Ramya Laxmi
R: Predictive Analysis

R: Predictive Analysis

Tony Fischetti, Eric Mayor, Rui Miguel Forte
Predictive Analytics and Data Mining

Predictive Analytics and Data Mining

Vijay Kotu, Bala Deshpande

Publisher Resources

ISBN: 9780080912035