5 LLM Pretraining Methods

Anitha Velu
Raghu Ramamoorthy
S. M. Manasa
A. Prasanth

Abstract

Generative artificial intelligence (AI), an AI technique produces original text, sounds, 3D models, animation, and images. It is powered by large-scale machine learning (ML) models that leverage pretrained deep neural networks on massive datasets. Pretraining mostly aims to guess the next word in a sentence or fill in masked words inside the sequence. Through this unsupervised learning exercise, the model learns to comprehend the linguistic structures and statistical trends. Through pretraining, large language model (LLM) acquires a general understanding of syntax, grammar, and semantics. It helps in establishing a strong basis for language comprehension ...

Get Generative AI and LLMs now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.