1
Deconstructing the Training Process
We already know that training neural network models takes a long time to finish. Otherwise, we would not be here discussing ways to run this process faster. But which characteristics make the building process of these models so computationally heavy? Why does the training step take so long? To answer these questions, we need to understand the computational burden of the training phase.
In this chapter, we will first remember how the training phase works under the hood. We will understand what makes the training process so computationally heavy.
Here is what you will learn as part of this first chapter:
- Remembering the training process
- Understanding the computational burden of the training phase
- Understanding ...
Get Accelerate Model Training with PyTorch 2.X now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.