O'Reilly logo

Apache Mahout Essentials by Jayani Withanawasam

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Linear regression with Apache Spark

Linear regression is the most commonly used method for describing the relationship between predictors (or covariates) and outcomes.

Linear regression can solve problems that require a continuous target variable to predict real, valued outcomes. There can be multiple variables or features in the problem domain.

How does linear regression work?

Linear regression focuses on finding optimal values for parameters (coefficients) to fit the best possible hyper plane to the training data.

Linear regression is approached as a minimization problem. The mean squared error, which is the difference between the expected outcome and the actual outcome for the training set, is minimized.

The squared error function is also known ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required