ANOVA test

Now let's talk about another popular hypothesis test, called ANOVA. It is used to test whether similar data points coming from different groups or under a different set of experiments are statistically similar to each other or different—for example, the average height of different sections of a class in a school or the peptide length of a certain protein found in humans across different ethnicities.

ANOVA calculates two metrics to conduct this test:

  • Variance among different groups
  • Variance within each group

Based on these metrics, a statistic is calculated with variance among different groups as a numerator. If it is a statistically large enough number, it means that the variance among different groups is larger than the variance ...

Get Mastering pandas - Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.