3
Classifying Topics of Newsgroup Posts
The large volumes of unstructured text that large corporations and organizations need to sort daily necessitate automatizing tedious and time-consuming manual tasks. The good news is that machine learning (ML) is also of assistance when analyzing this type of data. This chapter will educate us on how to tag a text document using a list of predefined topics. The aim is to assign each sample to one and only one label, which becomes more challenging as the number of topics increases.
We will attack the problem by utilizing supervised and unsupervised ML techniques. First, we expand on the basic exploratory data analysis presented in the previous chapter and create richer visualizations with extra meaning ...
Get Machine Learning Techniques for Text now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.