Chapter 10. Classification Trees

In Chapter 9, Linear Regression we discussed regression. In the previous chapter, we were interested in classification using k-NN and Naïve Bayes. In this chapter, we will continue the topic of classification and discuss it in the context of decision trees. Decision trees notably allow class predictions (group membership) of previously unseen observations (testing datasets or prediction datasets) using statistical criteria applied on the seen data (training set).

Here, we will briefly examine the statistical criteria of six algorithms:

  • ID3
  • C4.5
  • C5.0
  • Classification and regression trees (CART)
  • Random forest
  • Conditional inference trees

We will also examine how to use decision trees in R, notably, how to measure the reliability ...

Get R: Predictive Analysis now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.