July 2017
Intermediate to advanced
382 pages
9h 13m
English
The final task of this chapter will be to apply our newly gained skills to a real spam filter!
Naive Bayes classifiers are actually a very popular model for email filtering. Their naivety lends itself nicely to the analysis of text data, where each feature is a word (or a bag of words), and it would not be feasible to model the dependence of every word on every other word.
There are a bunch of good email datasets out there, such as the following:
Read now
Unlock full access