Skip to Main Content
Modern Scala Projects
book

Modern Scala Projects

by Ilango gurusamy
July 2018
Intermediate to advanced content levelIntermediate to advanced
334 pages
8h 20m
English
Packt Publishing
Content preview from Modern Scala Projects

Project overview – problem formulation

In this chapter, the stated goal is to build a spam classifier, one that is capable of distinguishing spam terms in email messages that are mixed in with regular or expected email content as well. It is important to know that spam messages are email messages that are sent out to multiple recipients with the same content, as opposed to regular messages. We start with two email datasets, one that represents ham and one that represents spam. After stages of preprocessing, we fit the model on a training set, say 70% of the entire dataset.

This application is a typical spam filtering application in the sense that it works on text. We then put algorithms to work that help the ML process detect words, phrases, ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Scala Programming Projects

Scala Programming Projects

Mikael Valot, Nicolas Jorand

Publisher Resources

ISBN: 9781788624114Supplemental Content