In this chapter, we will continue with classification as a form of supervised learning. Our objective is to develop, train, and evaluate a decision tree classification model for predicting the species of an Iris flower based on its feature measurements. We will leverage the well-known Iris dataset, which consists of measurements of four features (sepal length, sepal width, petal length, and petal width) from three distinct species of Iris flowers (setosa, ...
8. Decision Tree Classification with Pandas, Scikit-Learn, and PySpark
Get Distributed Machine Learning with PySpark: Migrating Effortlessly from Pandas and Scikit-Learn now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.