Classification
Supervised learning problems can be further divided into two groups: classification and regression. For the classification problem, the output variable, such as y, could be a binary variable, that is, 0 or 1, or several categories. For a regression, variables or values could be discrete or continuous. In the Titanic example, we have 1 for survived and 0 for not survived. For a regression problem, the output could be a value, such as, 2.5 or 0.234. In the previous chapter, we discussed the concept of distance between group members within the same group and between groups.
The logic for classification is that the distance between (among) group members is shorter than the distance between different groups. Alternatively speaking, ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access