November 2019
Intermediate to advanced
346 pages
9h 36m
English
In the following steps, we build a classifier for wanted and unwanted email:
import osspam_emails_path = os.path.join("spamassassin-public-corpus", "spam")ham_emails_path = os.path.join("spamassassin-public-corpus", "ham")labeled_file_directories = [(spam_emails_path, 0), (ham_emails_path, 1)]
email_corpus = []labels = []for class_files, label in labeled_file_directories: files = os.listdir(class_files) for file in files: file_path = os.path.join(class_files, file) try: with open(file_path, "r") as currentFile: email_content = currentFile.read().replace("\n", ...