Penetration into other research fields
Next we will discuss three different areas, which have blended with NLP to produce some interesting machine learning tasks. We will be discussing three specific areas:
- NLP and computer vision
- NLP and reinforcement learning
- NLP and generative adversarial networks
Combining NLP with computer vision
First we will discuss two applications where NLP is combined with various computer vision applications to process multimodal data (that is, images and text).
Visual Question Answering (VQA)
VQA is a novel research area, where the focus is to produce an answer to a textual question about an image. For example, consider these questions about Figure 11.5:
Q1: What color is the sofa?
Q2: How many black chairs are there?
Get Natural Language Processing with TensorFlow now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.