March 2026
Intermediate to advanced
402 pages
11h 1m
English
This book is about aligning AI agents with human intent. Capabilities of AI have improved exponentially over the period of the last two decades, and some milestones in disciplines such as machine learning, computer vision, natural language processing, deep learning, and Reinforcement Learning (RL) played a prominent role in this rise. Some of these capabilities and applications demonstrate how well AI scales and impacts almost everything. This makes it very important that the behavior or actions taken by AI align with human intent, and this very necessity is also the path toward the safety of using this technology. In this book, we will delve into Reinforcement Learning from Human Feedback (RLHF), which ...
Read now
Unlock full access