June 2011
Beginner to intermediate
744 pages
25h 11m
English
Classification is a form of data analysis that extracts models describing important data classes. Such models, called classifiers, predict categorical (discrete, unordered) class labels. Such analysis can help provide users with a better understanding of the data at large. Classification and numeric prediction are the two major types of prediction problems. Many classification methods have been proposed by researchers in machine learning, pattern recognition, and statistics. Most algorithms are memory resident, typically assuming a small data size. Recent data mining research has built on such work, developing scalable classification and prediction techniques capable of handling large amounts of disk-resident data. Classification ...