December 2025
Beginner to intermediate
360 pages
10h 48m
English
Text-to-image generation has seen remarkable progress in recent years, largely thanks to two classes of models: vision transformers (ViTs) and diffusion models. Diffusion models create images through a two-step process. First, they learn to gradually add random noise to clean images, step-by-step, until the images become pure noise. This is called the forward diffusion process. Then, the models are trained to remove noise from images in the reverse ...
Read now
Unlock full access