You are not logged in.

Hybrid consensus pruning of ensemble classifiers for big data malware detection

Abawajy, Jemal, Chowdhury, Morshed and Kelarev, Andrei 2015, Hybrid consensus pruning of ensemble classifiers for big data malware detection, IEEE transactions on cloud computing, vol. 3, no. 2, pp. 1-11, doi: 10.1109/TCC.2015.2481378.

Attached Files
Name Description MIMEType Size Downloads

Title Hybrid consensus pruning of ensemble classifiers for big data malware detection
Author(s) Abawajy, Jemal
Chowdhury, MorshedORCID iD for Chowdhury, Morshed
Kelarev, Andrei
Journal name IEEE transactions on cloud computing
Volume number 3
Issue number 2
Start page 1
End page 11
Total pages 11
Publisher IEEE
Place of publication Piscataway, N.J.
Publication date 2015-10
ISSN 2168-7161
Keyword(s) ensemble pruning
ensemble classifiers
big data
Summary One of the major challenges for safeguarding the security of big data in the cloud is how to detect and prevent malicious software (malware). Despite of the fact that security and privacy are critical issues in big data, more research needs to be done in this area. As malware can affect the reliability of the data and subsequently the reputation of the system, it is critical to detect and remove malware from a system as early as possible. Recently, ensembles that combine a set of classifiers have been proposed as an efficient approach for malware detection. Unfortunately, the size, memory and processing requirements as well as the high cost of data transfer during training and operation make large ensemble classifiers unsuitable for big data in the cloud. To address this problem, we propose a new advanced ensemble pruning method, Hybrid Consensus Pruning (HCP), which is the first pruning algorithm that employs a fast consensus function to combine several classifier classes into one scheme. To test the effectiveness of the HCP method, we conducted experiments comparing its performance with Ensemble Pruning via Individual Contribution ordering (EPIC), Directed Hill Climbing Ensemble Pruning (DHCEP) and K-Means Pruning approaches for pruning very large ensemble classifiers for malware detection. The results of the experiments show that HCP achieved better results by producing better ensemble classifiers as compared to those created by EPIC, DHCEP and K-Means Pruning.
Language eng
DOI 10.1109/TCC.2015.2481378
Field of Research 089999 Information and Computing Sciences not elsewhere classified
Socio Economic Objective 970108 Expanding Knowledge in the Information and Computing Sciences
HERDC Research category C1.1 Refereed article in a scholarly journal
ERA Research output type C Journal article
Copyright notice ©2015, IEEE
Persistent URL

Connect to link resolver
Unless expressly stated otherwise, the copyright for items in DRO is owned by the author, with all rights reserved.

Version Filter Type
Citation counts: TR Web of Science Citation Count  Cited 0 times in TR Web of Science
Scopus Citation Count Cited 1 times in Scopus
Google Scholar Search Google Scholar
Access Statistics: 29 Abstract Views, 1 File Downloads  -  Detailed Statistics
Created: Thu, 10 Nov 2016, 13:10:30 EST

Every reasonable effort has been made to ensure that permission has been obtained for items included in DRO. If you believe that your rights have been infringed by this repository, please contact