9
Understanding Discrete Distributions
Latent Dirichlet Allocation (LDA) is named based on its mathematical concepts. Its name contains the word latent because it finds a similarity between documents in the latent space. It contains the word allocation because it allocates a document to topics. But most of all, it contains the word Dirichlet because it is based on the Dirichlet (pronounced as “Deer-e-kh-let”) distribution.
The Dirichlet distribution belongs to the discrete distribution family, which includes the Bernoulli, binomial, multinomial, beta, and Dirichlet distributions. The binomial and multinomial distributions are already frequently used in many machine-learning models that many of you will be familiar with. The beta distribution ...
Get The Handbook of NLP with Gensim now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.