SPAM detection: Naïve bayesian classification and RPN expression-based LGP approaches compared

Citation data:

Advances in Intelligent Systems and Computing, ISSN: 2194-5357, Vol: 465, Page: 399-411

Publication Year:
2016
Usage 321
Downloads 173
Abstract Views 148
Captures 4
Readers 4
Citations 1
Citation Indexes 1
Repository URL:
http://publikace.k.utb.cz/handle/10563/1006428; http://hdl.handle.net/10563/1006428
DOI:
10.1007/978-3-319-33622-0_36
Author(s):
Meli, Clyde; Komínková Oplatková, Zuzana
Publisher(s):
Springer Nature America, Inc; Springer Verlag
Tags:
Engineering; Computer Science; Genetic programming (GP); Linear genetic programming (LGP); Naïve bayesian classifier; Reverse polish notation (RPN); Spam detection
conference paper description
An investigation is performed of a machine learning algorithm and the Bayesian classifier in the spam-filtering context. The paper shows the advantage of the use of Reverse Polish Notation (RPN) expressions with feature extraction compared to the traditional Naïve Bayesian classifier used for spam detection assuming the same features. The performance of the two is investigated using a public corpus and a recent private spam collection, concluding that the system based on RPN LGP (Linear Genetic Programming) gave better results compared to two popularly used open source Bayesian spam filters.