Chapter 5. The Data Factor

The sculpture is already complete within the marble block before I start my work. It is already there; I just have to chisel away the superfluous material.

—Michelangelo Buonarroti

From incorrect (or inadequate or poorly relevant) data only stems incorrect (or inadequate or poorly relevant) answers. It’s the underlying equation of machine learning, and it is not different at all from the fundamental equation that rules life and behavior of human beings.

An intelligent system learns how to achieve its declared goals from provided data. Therefore, it’s data that drives the algorithm toward the expected outcome. For this reason, low relevance of content, inaccuracy, or even shortage of facts inevitably leads to little ...

Get Introducing Machine Learning now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.