August 2018
Intermediate to advanced
272 pages
7h 2m
English
Localization can be achieved using a similar network architecture to the one we learned about in Chapter 3, Image Classification in TensorFlow.
In addition to predicting the class labels, we will output a flag indicating the presence of an object and also the coordinates of the object's bounding box. The bounding box coordinates are usually four numbers representing the upper-left x and y coordinates, along with the height and width of the box.
For example, in this case, we have two classes, C1 (Car) and C2 (People), to predict. The output of our network will look something like this:

The model would work as follows: ...
Read now
Unlock full access