10 Topic modeling

This chapter covers

  • Introducing topic modeling with latent Dirichlet allocation
  • Exploring gensim, an NLP toolkit for topic modeling
  • Implementing an unsupervised topic modeling approach using gensim
  • Introducing several visualization techniques for topic exploration in data

The previous chapter introduced various NLP and machine-learning techniques for topic classification and topic analysis. Here is a reminder of the scenario that you’ve worked on: suppose you work as a content manager for a large news platform. Your platform hosts texts from a wide variety of authors and mainly specializes in the following set of well-established topics: Politics, Finance, Science, Sports, and Arts. Your task is to decide, for every incoming ...

Get Getting Started with Natural Language Processing now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.