A comparison of the classification of disparate malware collected in different time periods

Islam, R; Tian, Ronghua; Moonsamy, Veelasha; Batten, Lynn

tian-acomparisonofthe-2012.pdf (874.48 kB)

A comparison of the classification of disparate malware collected in different time periods

journal contribution

posted on 2012-06-01, 00:00 authored by R Islam, Ronghua Tian, Veelasha Moonsamy, Lynn BattenLynn Batten

It has been argued that an anti-virus strategy based on malware collected at a certain date, will not work at a later date because malware evolves rapidly and an anti-virus engine is then faced with a completely new type of executable not as amenable to detection as the first was.

In this paper, we test this idea by collecting two sets of malware, the first from 2002 to 2007, the second from 2009 to 2010 to determine how well the anti-virus strategy we developed based on the earlier set [18] will do on the later set. This anti-virus strategy integrates dynamic and static features extracted from the executables to classify malware by distinguishing between families. We also perform another test, to investigate the same idea whereby we accumulate all the malware executables in the old and new dataset, separately, and apply a malware versus cleanware classification.

The resulting classification accuracies are very close for both datasets, with a difference of approximately 5.4% for both experiments, the older malware being more accurately classified than the newer malware. This leads us to conjecture that current anti-virus strategies can indeed be modified to deal effectively with new malware.

History

Journal

Journal of networks

Volume

7

Issue

6

Pagination

946 - 955

Publisher

Academy Publisher

Location

Oulu, Finland

Publisher DOI

https://doi.org/10.4304/jnw.7.6.946-955

ISSN

1796-2056

Language

eng

Publication classification

C1 Refereed article in a scholarly journal

Copyright notice

2012, The Authors

Usage metrics

Keywords

classification dynamic malware static

Licence

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

A comparison of the classification of disparate malware collected in different time periods

History

Journal

Volume

Issue

Pagination

Publisher

Location

Publisher DOI

ISSN

Language

Publication classification

Copyright notice

Usage metrics

Categories

Keywords

Licence

Exports