Skip to Content
AI and ML for Coders in PyTorch
book

AI and ML for Coders in PyTorch

by Laurence Moroney
June 2025
Beginner to intermediate
444 pages
11h 32m
English
O'Reilly Media, Inc.
Content preview from AI and ML for Coders in PyTorch

Chapter 8. Using ML to Create Text

With the release of ChatGPT in 2022, the words generative AI entered the common lexicon. This simple application that allowed you to chat with a cloud-based AI seemed almost miraculous in how it could answer your queries with knowledge of almost everything in human experience. It worked by using a very advanced evolution beyond the recurrent neural networks you saw in the last chapter, by using a technique called transformers.

A transformer learns the patterns that turn one piece of text into another. With a large enough transformer architecture and a large enough set of text to learn from, the GPT model (GPT stands for generative pretrained transformers) could predict the next tokens to follow a piece of text. When GPT was wrapped in an application that made it more user friendly, a whole new industry was born.

While creating models with transformers is beyond the scope of this book, we will look at their architecture in detail in Chapter 15.

The principles involved in training models with transformers can be replicated with smaller, simpler, architectures like RNNs or LSTM. We’ll explore that in this chapter and with a much smaller corpus of text—traditional Irish songs.

So, for example, consider this line of text from a famous TV show:

You know nothing, Jon Snow.

A next-token-predictor model, created with RNNs, came up with these song lyrics in response:

  • You know nothing, Jon Snow

  • the place where he’s stationed

  • be it Cork or in the ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Generative AI with LangChain - Second Edition

Generative AI with LangChain - Second Edition

Ben Auffarth, Leonid Kuligin

Publisher Resources

ISBN: 9781098199166Errata Page