12 Sequence-to-sequence learning: Part 2

This chapter covers

Implementing the attention mechanism for the seq2seq model
Generating visualizations from the attention layer to glean insights from the model

In the previous chapter, we built an English-to-German machine translator. The machine learning model was a sequence-to-sequence model that could learn to map arbitrarily long sequences to other arbitrarily long sequences. It had two main components: an encoder and a decoder. To arrive at that, we first downloaded a machine translation data set, examined the structure of that data set, and applied some processing (e.g., adding SOS and EOS tokens) to prepare it for the model. Next, we defined the machine translation model using standard Keras ...

Get TensorFlow in Action now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

TensorFlow in Action by Thushan Ganegedara

12 Sequence-to-sequence learning: Part 2

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly