Chapter 6: Advanced Model Building – Part II

In the previous chapter, Chapter 5, Advanced Model Building – Part I, we detailed the process for building an enterprise-grade supervised learning model on the H2O platform. In this chapter, we round out our advanced model-building topics by doing the following:

  • Demonstrating how to build H2O supervised learning models within an Apache Spark pipeline
  • Introducing H2O's unsupervised learning method
  • Discussing best practices for updating H2O models
  • Documenting requirements to ensure H2O model reproducibility

We begin this chapter by introducing Sparkling Water pipelines, a method for embedding H2O models natively within a Spark pipeline. In enterprise settings where Spark is heavily utilized, we ...

Get Machine Learning at Scale with H2O now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.