2 Training large language models

This chapter covers

Explaining how LLMs are trained
Introducing the emergent properties of LLMs
Exploring the harms and vulnerabilities that come from training LLMs

For decades, the digital economy has run on the currency of data. The digital economy of collecting and trading information about who we are and what we do online is worth trillions of dollars, and as more of our daily activities have moved on to the internet, the mill has ever more grist to grind through. Large language models (LLMs) are inventions of the internet age, emulating human language by vacuuming up terabytes of text data found online.

The process has yielded both predictable and unpredictable results. Notably, there are significant questions ...

Get Introduction to Generative AI now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Introduction to Generative AI by Numa Dhamani, Maggie Engler

2 Training large language models

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly