3

Combining CNNs and LSTMs

Convolutional Neural Networks (CNNs) are a type of deep learning model used to solve machine learning problems related to images, video, speech, and audio, such as image classification, object detection, segmentation, speech recognition, audio classification, and more. This is because CNNs use a special type of layer called convolutional layers, which have shared learnable parameters. The weight or parameter sharing works because the patterns to be learned in an image (such as edges or contours) are assumed to be independent of the location of the pixels in the image. Just as CNNs are applied to images, Long Short-Term Memory (LSTM) networks – which are a type of Recurrent Neural Network (RNN) – prove to be extremely ...

Get Mastering PyTorch - Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.