Chapter 5. Using the Snorkel-Labeled Dataset for Image Classification

The techniques described in this chapter can be used for image classification for any image datasets. This chapter will provide you with a holistic set of discussions and code that can help you get started quickly with using the dataset that has been labeled by Snorkel (from Chapter 3).

The chapter starts with a gentle introduction to different types of visual object recognition tasks and discussions on how image features are represented. Next, we discuss how transfer learning for image classification works. In the remainder of the chapter, we will use the indoor/outdoor dataset that has been labeled by Snorkel to fine-tune an image classification model using PyTorch.

Visual Object Recognition Overview

Visual object recognition is commonly used to identify objects in digital images, and consists of one or more computer vision tasks:

Image classification

Predict the type/class of an image (e.g., does the image consist of an indoor or outdoor scene?)

Object localization

Identify the objects present in an image with bounding boxes

Object detection

Identify the objects present in an image with bounding boxes and the type or class of the object corresponding to each bounding box

Image instance segmentation

Identify the objects present in an image and identify the pixels that belong to each of those objects

ImageNet is a visual database that consists of millions of images and is used by many computer vision ...

Get Practical Weak Supervision now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.