Given observations of the target variables, rows of features values, and each row of dimension , OLS seeks to find the weights of dimension that minimize the residual sum of squares of differences between the target variable and the predicted variable predicted by linear approximation:
- , which is the best fit for the equation ...