O'Reilly logo

Practical Predictive Analytics by Ralph Winters

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Output from the summary() function

The summary() function breaks out the diabetes variable into counts. Of 768 observations, there are 500 negative and 268 positive observations.

Note that the summary output has detected no missing values; however, if you look closely, you can see that there are zeros present for variables that should have a measurement (glucose, pressure, insulin, and mass). A zero for the number of times pregnant is acceptable.

We will address the zeros after we perform some plotting:

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required