13.3.3 The Problem of Heteroskedasticity

Besides the presuppositions previously discussed, the distribution of probability for each random term of Yi = a + b1 ⋅ X1i + b2 ⋅ X2i + ⋯ + bk ⋅ Xki + ui (i = 1, 2, …, n) is such that all distributions should present the same variance, or rather, the distributions should be homoskedastic. Therefore:

Varui=Eui2=σu2

si124_e  (13.39)

Fig. 13.39 provides, for a simple linear regression models, a view of the heteroskedasticity problem, or rather, the nonconstancy of variance of the residuals along the explanatory variable. In other words, there should be a correlation between the terms of error and the X variable, ...

Get Data Science for Business and Decision Making now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.