Skip to main navigation Skip to search Skip to main content

A machine learning approach for feature selection traffic classification using security analysis

  • Muhammad Shafiq*
  • , Xiangzhan Yu
  • , Ali Kashif Bashir
  • , Hassan Nazeer Chaudhry
  • , Dawei Wang
  • *Corresponding author for this work
  • School of Computer Science and Technology, Harbin Institute of Technology
  • University of the Faroe Islands
  • Polytechnic University of Milan
  • National Computer Network Emergency Response Technical Team/Coordination Center of China

Research output: Contribution to journalArticlepeer-review

Abstract

Class imbalance has become a big problem that leads to inaccurate traffic classification. Accurate traffic classification of traffic flows helps us in security monitoring, IP management, intrusion detection, etc. To address the traffic classification problem, in literature, machine learning (ML) approaches are widely used. Therefore, in this paper, we also proposed an ML-based hybrid feature selection algorithm named WMI_AUC that make use of two metrics: weighted mutual information (WMI) metric and area under ROC curve (AUC). These metrics select effective features from a traffic flow. However, in order to select robust features from the selected features, we proposed robust features selection algorithm. The proposed approach increases the accuracy of ML classifiers and helps in detecting malicious traffic. We evaluate our work using 11 well-known ML classifiers on the different network environment traces datasets. Experimental results showed that our algorithms achieve more than 95% flow accuracy results.

Original languageEnglish
Pages (from-to)4867-4892
Number of pages26
JournalJournal of Supercomputing
Volume74
Issue number10
DOIs
StatePublished - 1 Oct 2018
Externally publishedYes

Keywords

  • Class imbalance
  • Feature selection
  • Machine learning
  • Network traffic classification
  • Security

Fingerprint

Dive into the research topics of 'A machine learning approach for feature selection traffic classification using security analysis'. Together they form a unique fingerprint.

Cite this