13.4 Testing Goodness Of Fit

Having discussed the particulars of the chi-square distribution, let us return to the use of the test statistic (Eq. 13.4). Why is this test statistic actually structured to reveal goodness of fit? It should be intuitively clear that the notion of goodness of fit should be assessed on the basis of the degree of disparity between the sample or empirical distribution and the expected or theoretical distribution given that the latter is specified by the null hypothesis. Under H₀, the expected or theoretical frequencies for the k categories or cells are simply That is, e_i is the product between the sample size and the hypothesized relative frequency or theoretical probability . So if H₀ is true, e_i = n should be the expected number of occurrences for cell i under n repeated trials of our k-fold alternative experiment. The expected cell frequencies are given in column 4 of Table 13.1. Under this discussion, we may modify Equation (13.4) to read

(13.5)

Here U₀ serves as an index of goodness of fit given that H₀ specifies the theoretical distribution that is fitted ...

Get Statistical Inference: A Short Course now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Statistical Inference: A Short Course by Michael J. Panik

13.4 Testing Goodness Of Fit

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly