1

Overview of Large Language Models

In 2017, a team at Google Brain introduced an advanced artificial intelligence (AI) deep learning architecture called the Transformer. Since then, the Transformer has become the standard for tackling various natural language processing (NLP) tasks in academia and industry. It is likely that you have interacted with models built on top of the Transformer architecture in recent years without even realizing it. Google, for example, experimented with using the Bidirectional Encoder Representations from Transformer (BERT), an LLM the company created, to enhance its search engine by better understanding users’ search queries. In more recent years, Google started to use Gemini, another LLM it created, to overhaul ...

Get Quick Start Guide to Large Language Models: Strategies and Best Practices for ChatGPT, Embeddings, Fine-Tuning, and Multimodal AI, 2nd Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.