Skip to Content
Generative AI on AWS
book

Generative AI on AWS

by Chris Fregly, Antje Barth, Shelbee Eigenbrode
November 2023
Intermediate to advanced
312 pages
8h 15m
English
O'Reilly Media, Inc.
Book available
Content preview from Generative AI on AWS

Chapter 6. Parameter-Efficient Fine-Tuning

As we discussed in previous chapters, training generative models is computationally expensive. Adapting models to your domain through full fine-tuning requires memory not just to store the model, but also various other parameters that are required during the training process. In contrast to full fine-tuning, parameter-efficient fine-tuning (PEFT) provides a set of techniques allowing you to fine-tune your models while utilizing less compute resources.

There are a variety of PEFT techniques and categories explored in a paper on scaling.1 The techniques vary in implementation, but in general, each focuses on freezing all or most of the model’s original parameters and extending or replacing model layers by training an additional, much smaller, set of parameters. The most commonly used techniques fall into the additive and reparameterization categories.

Additive techniques, such as prompt tuning, augment the model by fine-tuning and adding extra parameters or layers to the pretrained model. Reparameterization techniques, such as Low-Rank Adaptation (LoRA), allow for adaptation using low-rank representations to reduce the number of training parameters and compute resources required to fine-tune.

In this chapter, you’ll learn about a few specific PEFT techniques that can be applied to generative models, including prompt tuning, LoRA, and QLoRA. This chapter focuses on key concepts illustrated through large language model (LLM) examples; ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Kubernetes for the Absolute Beginners - Hands-On

Kubernetes for the Absolute Beginners - Hands-On

KodeKloud
Building AI Agents with LLMs: Harnessing the Power of Generative AI with Autonomous Agents

Building AI Agents with LLMs: Harnessing the Power of Generative AI with Autonomous Agents

Abi Aryan, Shawn “swyx” Wang, Div Garg, Kence Anderson, Yohei Nakajima, Jaya Gupta, Arjun Bansal

Publisher Resources

ISBN: 9781098159214Errata PageSupplemental Content