Polynomial Regression

The relationship between y and x often turns out not to be a straight line. However, Occam's razor requires that we fit a straight-line model unless a non-linear relationship is significantly better at describing the data. So this begs the question: how do we assess the significance of departures from linearity? One of the simplest ways is to use polynomial regression.

The idea of polynomial regression is straightforward. As before, we have just one continuous explanatory variable, x, but we can fit higher powers of x, such as x2 and x3, to the model in addition to x to explain curvature in the relationship between y and x. It is useful to experiment with the kinds of curves that can be generated with very simple models. Even if we restrict ourselves to the inclusion of a quadratic term, x2, there are many curves we can describe, depending upon the signs of the linear and quadratic terms:

In the top left-hand panel, there is a curve with positive but declining slope, with no hint of a hump (y = 4 + 2x − 0.1x2). The top right-hand graph shows a curve with a clear maximum (y = 4 + 2x − 0.2x2), and at bottom left we have a curve with a clear minimum (y = 12 − 4x + 0.35x2). The bottom right-hand curve shows a positive association between y and x with the slope increasing as x increases (y = 4 + 0.5x + 0.1x2). So you can see that a simple quadratic model with three parameters (an intercept, a slope for x, and a slope for x2) is capable of describing a wide range ...

Get The R Book now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.