March 2020
Intermediate to advanced
366 pages
9h 8m
English
You are already aware that networks such as YOLO and SSD predict objects with predefined anchor boxes. Out of all available boxes, only one box is chosen, which corresponds to the object. During prediction time, the box is assigned with the class of the object and the offsets are predicted.
So, the question is, how do we choose that single box? You might already have guessed that IoU is used for that purpose. The correspondence between the ground truth boxes and anchor boxes can be made as follows: