7.1 Classification Task
Perhaps the most common data mining task is that of classification. Examples of classification tasks may be found in nearly every field of endeavor:
- Banking: determining whether a mortgage application is a good or bad credit risk, or whether a particular credit card transaction is fraudulent
- Education: placing a new student into a particular track with regard to special needs
- Medicine: diagnosing whether a particular disease is present
- Law: determining whether a will was written by the actual person deceased or fraudulently by someone else
- Homeland security: identifying whether or not certain financial or personal behavior indicates a possible terrorist threat
In classification, there is a target categorical variable, (e.g., income bracket), which is partitioned into predetermined classes or categories, such as high income, middle income, and low income. The data mining model examines a large set of records, each record containing information on the target variable as well as a set of input or predictor variables. For example, consider the ...