Deakin University
Browse

File(s) under permanent embargo

Classification of malware based on string and function feature selection

conference contribution
posted on 2010-01-01, 00:00 authored by R Islam, Ronghua Tian, Lynn BattenLynn Batten, S Versteeg

Anti-malware software producers are continually challenged to identify and counter new malware as it is released into the wild. A dramatic increase in malware production in recent years has rendered the conventional method of manually determining a signature for each new malware sample untenable. This paper presents a scalable, automated approach for detecting and classifying malware by using pattern recognition algorithms and statistical methods at various stages of the malware analysis life cycle. Our framework combines the static features of function length and printable string information extracted from malware samples into a single test which gives classification results better than those achieved by using either feature individually. In our testing we input feature information from close to 1400 unpacked malware samples to a number of different classification algorithms. Using k-fold cross validation on the malware, which includes Trojans and viruses, along with 151 clean files, we achieve an overall classification accuracy of over 98%.

History

Event

Cybercrime and Trustworthy Computing. Workshop (2nd : 2010 : Ballarat, Victoria)

Pagination

9 - 17

Publisher

IEEE

Location

Ballarat, Victoria

Place of publication

Piscataway, N.J.

Start date

2010-07-19

End date

2010-07-20

ISBN-13

9780769541860

Language

eng

Publication classification

E1 Full written paper - refereed

Copyright notice

2010, Institute of Electrical and Electronics Engineers (IEEE)

Title of proceedings

CTC 2010 : Proceedings of the Second Cybercrime and Trustworthy Computing Workshop 2010

Usage metrics

    Research Publications

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC