This chapter serves as a continuation of Chapter 5, which focused on various fine-tuning techniques for a language model. As you may recall, we explored different methods such as LoRA, QLoRA, and prompt tuning.
Apart from introducing information to the model or influencing the response format through the fine-tuning process, there is a very important step that must be taken before publishing a model for use: alignment with the needs or preferences of the users. The ultimate goal of alignment is to take ...