Chapter 1. Foundations of Video Generation
Learning objective: In this chapter, you’ll learn the foundational principles and emerging methods used to generate video with AI.
Video generation represents one of the most challenging frontiers in AI. While generating a single image requires understanding spatial relationships within a frame, video generation introduces challenges that exponentially multiply this complexity. Models must account not only for the content of each frame but also for how those elements change logically and coherently over time.
This chapter guides you from understanding these challenges to creating high-resolution videos with advanced generative models. By the end, you’ll realize why diffusion transformers (DiTs) have transformed the field and you’ll gain practical skills for working with cutting-edge video AI systems.
We begin with immediate, hands-on success—generating a sample video in minutes—then build a deeper understanding of the technology that makes it possible. ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access