In this section, we'll outline three approaches:
- Classic sliding window: Here, we'll use a regular classification network (classifier). This approach can work with any type of classification algorithm, but it's relatively slow and error-prone:
- Build an image pyramid. This is a combination of different scales of the same image (see the following photograph). For example, each scaled image can be two times smaller than the previous one. In this way, we'll be able to detect objects regardless of their size in the original image.
- Slide the classifier across the whole image. That is, we'll use each location of the image as an input to the classifier and the result will determine what type of object is in that location. ...