6

Topic Modeling

In this chapter, we will cover topic modeling, or the classification of topics present in a corpus of text. Topic modeling is a very useful technique that can give us an idea about which topics appear in a document set. For example, topic modeling is used for trend discovery on social media. Also, in many cases, it is useful to do topic modeling as part of the preliminary data analysis of a dataset to understand which topics appear in it.

There are many different algorithms available to do this. All of them try to find similarities between different texts and put them into several clusters. These different clusters indicate different topics.

You will learn how to create and use topic models via various techniques with the BBC ...

Get Python Natural Language Processing Cookbook - Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.