Chapter 7k-Nearest Neighbor Algorithm

7.1 Classification Task

Perhaps the most common data mining task is that of classification. Examples of classification tasks may be found in nearly every field of endeavor:

Banking: determining whether a mortgage application is a good or bad credit risk, or whether a particular credit card transaction is fraudulent
Education: placing a new student into a particular track with regard to special needs
Medicine: diagnosing whether a particular disease is present
Law: determining whether a will was written by the actual person deceased or fraudulently by someone else
Homeland security: identifying whether or not certain financial or personal behavior indicates a possible terrorist threat

In classification, there is a target categorical variable, (e.g., income bracket), which is partitioned into predetermined classes or categories, such as high income, middle income, and low income. The data mining model examines a large set of records, each record containing information on the target variable as well as a set of input or predictor variables. For example, consider the ...

Get Discovering Knowledge in Data: An Introduction to Data Mining, 2nd Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Discovering Knowledge in Data: An Introduction to Data Mining, 2nd Edition by Daniel T. Larose

Chapter 7k-Nearest Neighbor Algorithm

7.1 Classification Task

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly