S. M. JainIntroduction to Transformers for NLPhttps://doi.org/10.1007/978-1-4842-8844-3_3

3. BERT

Shashank Mohan Jain¹

(1)

Bangalore, India

In this chapter, you will learn one of the implementations of the transformer architecture, developed by Google, called BERT.

Recent work done by researchers at Google AI Language resulted in the publication of a paper known as “BERT (Bidirectional Encoder Representations from Transformers).”

The most important technical advancement that BERT has made is the application of bidirectional training of the popular attention model, transformer, to language modeling. According to the findings of the study on language models, ...

Get Introduction to Transformers for NLP: With the Hugging Face Library and Models to Solve Problems now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Introduction to Transformers for NLP: With the Hugging Face Library and Models to Solve Problems by Shashank Mohan Jain

3. BERT

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly