Deakin University
Browse

Semi-supervised and compound classification of network traffic

Version 2 2024-06-06, 00:27
Version 1 2015-03-10, 10:20
journal contribution
posted on 2024-06-06, 00:27 authored by J Zhang, C Chen, Y Xiang, WANLEI Zhou
This paper presents a new semi-supervised method to effectively improve traffic classification performance when very few supervised training data are available. Existing semisupervised methods label a large proportion of testing flows as unknown flows due to limited supervised information, which severely affects the classification performance. To address this problem, we propose to incorporate flow correlation into both training and testing stages. At the training stage, we make use of flow correlation to extend the supervised data set by automatically labelling unlabelled flows according to their correlation to the pre-labelled flows. Consequently, a traffic classifier achieves excellent performance because of the enhanced training data set. At the testing stage, the correlated flows are identified and classified jointly by combining their individual predictions, so as to further boost the classification accuracy. The empirical study on the real-world network traffic shows that the proposed method significantly outperforms the state-of-the-art flow statistical feature based classification methods. Copyright © 2012 Inderscience Enterprises Ltd.

History

Journal

International journal of security and networks

Volume

7

Article number

4

Pagination

252-261

Location

Geneva, Switzerland

ISSN

1747-8405

eISSN

1747-8413

Language

eng

Publication classification

C Journal article, C1.1 Refereed article in a scholarly journal

Copyright notice

2012, Inderscience

Issue

4

Publisher

Inderscience