Skip to Content
Data Science in R
book

Data Science in R

by Deborah Nolan
April 2015
Beginner to intermediate
539 pages
15h 21m
English
Chapman and Hall/CRC
Content preview from Data Science in R

Chapter 3

Using Statistics to Identify Spam

Deborah Nolan

University of California, Berkeley

Duncan Temple Lang

University of California, Davis

3.1 Introduction

People are terrific at spotting spam in their mail reader with a quick glance at the subject line and sender, and when that approach is not conclusive, a glimpse at the contents of the message is usually enough to classify the message. But how do we design an automated procedure to classify and eliminate these unwanted messages to save us the time and irritation of having to sort through them in our inbox? Spam filters used by mail readers examine various characteristics of an email before deciding whether to place it in your inbox or spam folder. This decision is in part based on a statistical ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Functional Programming in R: Advanced Statistical Programming for Data Science, Analysis and Finance

Functional Programming in R: Advanced Statistical Programming for Data Science, Analysis and Finance

Thomas Mailund
Analyzing Baseball Data with R

Analyzing Baseball Data with R

Max Marchi, Jim Albert

Publisher Resources

ISBN: 9781482234817