Building Machine Learning Systems with Decision Tree and Ensemble Models

In this chapter, we will cover:

  • Getting and preparing real-world medical data for exploring Decision Trees and Ensemble models in Spark 2.0
  • Building a classification system with Decision Trees in Spark 2.0
  • Solving regression problems with Decision Trees in Spark 2.0
  • Building a classification system with Random Forest Trees in Spark 2.0
  • Solving regression problems with Random Forest Trees in Spark 2.0
  • Building a classification system with Gradient Boosted Trees (GBT) in Spark 2.0
  • Solving regression problems with Gradient Boosted Trees (GBT) in Spark 2.0

Get Apache Spark 2.x Machine Learning Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.