O'Reilly logo

Effective Amazon Machine Learning by Alexis Perrier

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Summary

In this chapter, we focused on two important elements of a predictive analytics project: the data and the evaluation of the predictive power of the model. We first listed the most common problems encountered with raw data, their impact on the linear regression model, and ways to solve them. The reader should now be able to identify and deal with missing values, outliers, imbalanced datasets, and normalization.

We also introduced the two most frequent problems in predictive analytics: underfitting and overfitting. L1 and L2 regularization is an important element in the Amazon ML platform, which helps overcome overfitting and make models more robust and able to handle previously unseen data.

We are now ready to dive into the Amazon ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required