File(s) under permanent embargo
Study on ensemble classification methods towards spam filtering
Recently, many scholars make use of fusion of filters to enhance the performance of spam filtering. In the past several years, a lot of effort has been devoted to different ensemble methods to achieve better performance. In reality, how to select appropriate ensemble methods towards spam filtering is an unsolved problem. In this paper, we investigate this problem through designing a framework to compare the performances among various ensemble methods. It is helpful for researchers to fight spam email more effectively in applied systems. The experimental results indicate that online based methods perform well on accuracy, while the off-line batch methods are evidently influenced by the size of data set. When a large data set is involved, the performance of off-line batch methods is not at par with online methods, and in the framework of online methods, the performance of parallel ensemble is better when using complex filters only.
History
Journal
Lecture notes in artificial intelligenceVolume
5678Pagination
314 - 325Publisher
SpringerLocation
Heidelberg, GermanyPublisher DOI
ISSN
0302-9743eISSN
1611-3349Language
engPublication classification
C1 Refereed article in a scholarly journalCopyright notice
2009, Springer-Verlag Berlin HeidelbergUsage metrics
Categories
No categories selectedKeywords
Licence
Exports
RefWorks
BibTeX
Ref. manager
Endnote
DataCite
NLM
DC