207
Electronic Engineering and Information Science – Wang (Ed.)
© 2015 Taylor & Francis Group, London, ISBN: 978-1-138-02772-5
Random forest algorithm for spam filtering based on machine learning
W.B. Wang, F. Yin, H. Sun & P. Li
College of Computer Science and Technology, Harbin University of Science and Technology, Harbin, China
ABSTRACT: A mass of spam greatly affects the use of email. The machine learning techniques achieve suc-
cess in tackling spam. This paper employed a random forest algorithm to tackle spam. The algorithm is appro-
priate for this task as it runs fast on real-time setting. In order to speed up the performance of the automatic task,
a method of novel feature selection is proposed to reduce the feature vector dimension. Ev ...