Chapter 8: Project: Text Generation with Autoregressive Models
In this chapter, we will apply what we have learned about autoregressive models in a practical, hands-on project. The goal of this project is to generate human-like text using an autoregressive model. Our model will learn to predict the next word in a sequence based on the previous words, and in this way, it will be able to generate entirely new sequences of text that mimic the style and structure of the training data.
8.1 Data Collection and Preprocessing
The first step in any machine learning project is data collection and preprocessing. For our text generation project, we will need a large corpus of text data to train our model. The choice of dataset can have a significant impact ...
Get Generative Deep Learning with Python now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.