Skip to Content
Machine Learning in Java - Second Edition
book

Machine Learning in Java - Second Edition

by AshishSingh Bhatia, Bostjan Kaluza
November 2018
Intermediate to advanced
300 pages
7h 42m
English
Packt Publishing
Content preview from Machine Learning in Java - Second Edition

Text Mining with Mallet - Topic Modeling and Spam Detection

In this chapter, we'll first discuss what text mining is, what kind of analysis it is able to offer, and why you might want to use it in your application. We'll then discuss how to work with Mallet, a Java library for natural-language processing, covering data import and text pre-processing. Afterward, we will look into two text-mining applications: topic modeling, where we will discuss how text mining can be used to identify topics found in  text documents without reading them individually, and spam detection, where we will discuss how to automatically classify text documents into categories.

This chapter will cover the following topics:

  • Introducing text mining
  • Installing and working ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Mastering Java Machine Learning

Mastering Java Machine Learning

Uday Kamath, Krishna Choppella
Java: Data Science Made Easy

Java: Data Science Made Easy

Richard M. Reese, Jennifer L. Reese, Alexey Grigorev

Publisher Resources

ISBN: 9781788474399Supplemental Content