April 2019
Intermediate to advanced
544 pages
17h 29m
English
Part 1 gathered the tools for natural language processing and dove into machine learning with statistics-driven vector space models. You discovered that even more meaning could be found when you looked at the statistics of connections between words.[1] You learned about algorithms such as latent semantic analysis that can help make sense of those connections by gathering words into topics.
1 Conditional probability is one term for these connection statistics (how often a word occurs given that other words occur before or after the “target” word). Cross correlation is another one of these statistics (the likelihood of words occurring together). The singular values and singular vectors of the word--document ...
Read now
Unlock full access