January 2018
Intermediate to advanced
310 pages
7h 48m
English
Johnson et al., in the paper https://www.cv-foundation.org/openaccess/content_cvpr_2016/papers/Johnson_DenseCap_Fully_Convolutional_CVPR_2016_paper.pdf, proposed a method for dense captioning. First, let's see some results, to understand the task:

As you can see, separate captions are generated for objects and actions in the image; hence the name; dense captioning. Here is the architecture proposed by Johnson et al.:

The architecture is essentially a combination ...
Read now
Unlock full access