July 2024
Beginner to intermediate
190 pages
5h 30m
English
In the previous chapter, we established the key distinction between generative and discriminative models. Discriminative models focus on predicting outputs by learning p(output∣input), or the conditional probability of some expected output given an input or set of inputs. In contrast, generative models, such as Generative Pretrained Transformer (GPT), generate text by predicting the next token (a partial word, whole word, or punctuation) using p(next token∣previous tokens), based on the probabilities of possible continuations given the current context. Tokens are represented as vectors containing embeddings that capture latent features and rich semantic dependencies ...
Read now
Unlock full access