4.6. Overdispersion

When estimating logit models with grouped data, it often happens that the model doesn’t fit—the deviance and Pearson chi-square are large, relative to the degrees of freedom. Lack of fit is sometimes described as overdispersion. Overdispersion has two possible causes:

  • An incorrectly specified model: more interactions and/or nonlinearities are needed in the model.

  • Lack of independence of the observations: this can arise from unobserved heterogeneity that operates at the level of groups rather than individuals.

We’ve already seen examples of the first cause. Now let’s look at an example where overdispersion may arise from dependence among the observations. The sample consists of 40 U.S. biochemistry departments in the late 1950s ...

Get Logistic Regression Using SAS®: Theory and Application now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.