
Classification model for imbalanced traffic data based on secondary feature extraction
Author(s) -
Shen Jian,
Xia Jingbo,
Shan Yong,
Wei Zekun
Publication year - 2017
Publication title -
iet communications
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.355
H-Index - 62
eISSN - 1751-8636
pISSN - 1751-8628
DOI - 10.1049/iet-com.2016.0332
Subject(s) - feature extraction , computer science , dimensionality reduction , data mining , dimension (graph theory) , feature (linguistics) , artificial intelligence , pattern recognition (psychology) , mathematics , linguistics , philosophy , pure mathematics
The non‐equilibrium of network traffic data brings about the non‐equilibrium of classification. Feature extraction is an effective method to reduce data dimensions, while it can intensify the influence of non‐equilibrium further. A secondary feature extraction algorithm of multidimensional assessment is proposed in this study. The features of network traffic are evaluated in different dimensions to provide the basis for feature extraction. Furthermore, a model dealing with imbalanced data is proposed based on secondary feature extraction and sampling. The model combines the benefits of dimension reduction and redistribution. The experiment results show that the proposed model can not only increase classification accuracy and decrease non‐equilibrium, but also enhance the performance of different classification algorithms.