z score standardization

This technique consists of subtracting the mean of the column from each value in a column, and then dividing the result by the standard deviation of the column. The formula to achieve this is the following:

The result of standardization is that the features will be rescaled so that they’ll have the properties of a standard normal distribution, as follows:

  • μ=0
  • σ=1

μ is the mean and σ is the standard deviation from the mean.

In summary, the z score (also called the standard score) represents the number of standard deviations with which the value of an observation point or data differ than the mean value of what is observed ...

Get Hands-On Machine Learning on Google Cloud Platform now with the O’Reilly learning platform.

O’Reilly members experience live online training, plus books, videos, and digital content from nearly 200 publishers.