Chapter 4. Computer Vision
Some people call this artificial intelligence, but the reality is this technology will enhance us. So instead of artificial intelligence, I think we’ll augment our intelligence.
Ginni Rometty, Executive Chairman of IBM
Take a moment and look up from this book. Examine the room around you and take a quick inventory of what you see. Perhaps a desk, some chairs, bookshelves, and maybe even your laptop. Identifying these items is an effortless process for a human, even a young child.
Speaking of children, it’s quite easy to teach them the difference between multiple objects. Over time, parents show them items or pictures and then repeat the name or description. Show them a picture of an apple, and then repeat the word apple. In the kitchen, hand them an apple, and then repeat the word apple. Eventually, through much repetition, the child learns what an apple is along with its many color and shape variations—red, green, yellow. Over time, we provide information as to what is a correct example and what isn’t. But how does this translate to machines? How can we train computers to recognize patterns visually, as the human brain does?
Training computer vision models is done in much the same way as teaching children about objects. Instead of a person being shown physical items and having them identified, however, the computer vision algorithms are provided many examples of images that have been tagged with their contents. In addition to these positive examples ...