9 Using Textual Inversion

Textual inversion (TI) is another way to provide additional capabilities to a pretrained model. Unlike Low-Rank Adaptation (LoRA), discussed in Chapter 8, which is a fine-tuning technique applied to the text encoder and the UNet attention weights, TI is a technique to add new embedding space based on the trained data.

In the context of Stable Diffusion, text embedding refers to the representation of text data as numerical vectors in a high dimensional space, allowing for manipulation and processing by machine learning algorithms. Specifically, in the case of Stable Diffusion, text embeddings are typically created using the Contrastive Language-Image Pretraining (CLIP) [6] model.

To train a TI model, you only need a ...

Get Using Stable Diffusion with Python now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Using Stable Diffusion with Python by Andrew Zhu

9

Using Textual Inversion

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly