Reading the data

For the dogs versus cats competition, there are two datasets; training, with 25,000 images of dogs and cats, 50% each, and testing. For the purposes of this chapter, we only need to download the training dataset. Once you have downloaded it, unpack it somewhere.

The filenames look like the following:

dog.9993.jpg dog.9994.jpg dog.9995.jpg cat.10000.jpg cat.10001.jpg cat.10002.jpg

The label (dog or cat) is encoded into the filename.

As you know, the first thing we always do is to split the data into training and validation ...

Get Java: Data Science Made Easy now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.