January 2025
Beginner to intermediate
432 pages
13h 16m
English
This chapter covers
In recent years, multimodal large language models (LLMs) have gained significant attention for their ability to handle various content formats, such as text, images, video, audio, and code. A notable example of this is text-to-image Transformers, such as OpenAI’s DALL-E 2, Google’s Imagen, and Stability AI’s Stable Diffusion. These models are capable of generating high-quality images based on textual descriptions. ...
Read now
Unlock full access