Understanding an image caption generator

A very popular domain of computer science is that of image processing. It deals with the manipulation of images and the various methods by which we can extract information from them. Another popular domain, Natural Language Processing (NLP), deals with how we can make machines that can understand and produce meaningful natural languages. Image captioning defines a mixture of the two topics, which attempts to first extract the information of objects appearing in any image and then to generate a caption describing the objects.

The caption should be generated in such a way that it is a meaningful string of words and is expressed in the form of a natural language sentence. 

Consider the following image: ...

Get Mobile Deep Learning with TensorFlow Lite, ML Kit and Flutter now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.