Chapter 4

k-Nearest Neighbor Classification II

M. Fareed Akhtar

Fastonish, Australia

4.1 Introduction

The use case of this chapter applies the k-NN operator on the Glass Identification dataset (overview of the k-NN algorithm has been discussed in the previous chapter). The purpose of this use case is to predict the type of the glass depending on its components. The operators explained in this chapter are: Read CSV, PCA, Split Data, and Performance (Classification).

4.2 Dataset

Glass Identification Dataset This dataset has been taken from UCI repositories. This dataset can be accessed through this link: http://archive.ics.uci.edu/ml/datasets/Glass+Identification.

Basic Information: The aim of this dataset is to classify the glass into one of 7 ...

Get RapidMiner now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.