Chapter 8. From Models to Systems
Learning Objective: In this chapter, you will learn what changes when a generative AI model becomes a product. By the end of this chapter, you will recognize the eight architectural patterns that distinguish production AI applications from research scripts, see how each is solved differently across LTX Desktop, ComfyUI, and InvokeAI, and have implemented the same custom extension on all three.
Throughout this book we’ve moved through video AI from the inside out: foundations and a first generated video (Chapter 1); the data behind generation (Chapter 2); the model itself across pipelines, training, and fine-tuning (Chapters 3 through 5); understanding video with vision-language models (Chapter 6); and synchronizing audio with cross-modal attention (Chapter 7). By the end of Chapter 7, you could load LTX-2’s 22-billion-parameter checkpoint from Hugging Face, drive it from Python, and reason about why each architectural decision was made.
But a model is not ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access