6 Transformers

The transformer-based architectures have become almost universal in Natural Language Processing (NLP) (and beyond) when it comes to solving a wide variety of tasks, such as:

Neural machine translation
Text summarization
Text generation
Named entity recognition
Question answering
Text classification
Text similarity
Offensive message/profanity detection
Query understanding
Language modeling
Next-sentence prediction
Reading comprehension
Sentiment analysis
Paraphrasing

and a lot more.

In less than four years, when the Attention Is All You Need paper was published by Google Research in 2017, transformers managed to take the NLP community by storm, breaking any record achieved over the previous thirty years.

Transformer-based models ...

Get Deep Learning with TensorFlow and Keras - Third Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Deep Learning with TensorFlow and Keras - Third Edition by Amita Kapoor, Antonio Gulli, Sujit Pal

6

Transformers

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly