1 Ensemble methods: Hype or hallelujah?

This chapter covers

  • Defining and framing the ensemble learning problem
  • Motivating the need for ensembles in different applications
  • Understanding how ensembles handle fit versus complexity
  • Implementing our first ensemble with ensemble diversity and model aggregation

In October 2006, Netflix announced a $1 million prize for the team that could improve movie recommendations by 10% via Netflix’s own proprietary recommendation system, CineMatch. The Netflix Grand Prize was one of the first-ever open data science competitions and attracted tens of thousands of teams.

The training set consisted of 100 million ratings that 480,000 users had given to 17,000 movies. Within three weeks, 40 teams had already beaten ...

Get Ensemble Methods for Machine Learning now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.