Chapter 7

Long Short-Term Memory (LSTM)

Learning Objectives

By the end of this chapter, you will be able to:

  • Describe the purpose of an LSTM
  • Evaluate the architecture of an LSTM in detail
  • Develop a simple binary classification model using LSTMs
  • Implement neural language translation and develop an English-to-German translation model

This chapter briefly introduces you to the LSTM architecture and its applications in the world of natural language processing.

Introduction

In the previous chapters, we studied Recurrent Neural Networks (RNNs) and a specialized architecture called the Gated Recurrent Unit (GRU), which helps combat the vanishing gradient problem. LSTMs offer yet another way to tackle the vanishing gradient problem. In this ...

Get Deep Learning for Natural Language Processing now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.