File(s) under permanent embargo
Iterative construction of hierarchical classifiers for phishing website detection
journal contribution
posted on 2014-08-01, 00:00 authored by Jemal AbawajyJemal Abawajy, Gleb BeliakovGleb Beliakov, Andrei Kelarev, Morshed ChowdhuryMorshed ChowdhuryThis article is devoted to a new iterative construction of hierarchical classifiers in SimpleCLI for the detection of phishing websites. Our new construction of hierarchical systems creates ensembles of ensembles in SimpleCLI by iteratively linking a top-level ensemble to another middle-level ensemble instead of a base classifier so that the top-level ensemble can generate a large multilevel system. This new construction makes it easy to set up and run such large systems in SimpleCLI. The present article concentrates on the investigation of performance of the iterative construction of such classifiers for the example of detection of phishing websites. We carried out systematic experiments evaluating several essential ensemble techniques as well as more recent approaches and studying their performance as parts of the iterative construction of hierarchical classifiers. The results presented here demonstrate that the iterative construction of hierarchical classifiers performed better than the base classifiers and standard ensembles. This example of application to the classification of phishing websites shows that the new iterative construction combining diverse ensemble techniques into the iterative construction of hierarchical classifiers can be applied to increase the performance in situations where data can be processed on a large computer. © 2014 ACADEMY PUBLISHER.
History
Journal
Journal of NetworksVolume
9Issue
8Pagination
2089 - 2098Publisher
Academy PublisherLocation
Oulu, FinlandPublisher DOI
ISSN
1796-2056Language
engPublication classification
C Journal article; C1 Refereed article in a scholarly journalCopyright notice
2014, Academy PublisherUsage metrics
Categories
No categories selectedLicence
Exports
RefWorks
BibTeX
Ref. manager
Endnote
DataCite
NLM
DC