TY - GEN
T1 - The Impact of Feature Quality on SDN Traffic Learning and Classification
AU - Alzu'bi, Ahmad
AU - Khrais, Jumana
AU - Ennab, Noor
AU - Abuarqoub, Abdelrahman
N1 - Publisher Copyright:
© IET Conference Proceedings 2022.
PY - 2022/11/15
Y1 - 2022/11/15
N2 - Network traffic classification is crucial for facilitating sophisticated network management. Software-defined networks (SDN) provide a featured architecture where the control and data planes are decoupled to enable dynamic configuration. The control plane functionality is implemented in a programmable centralized controller that enables effective use of data-driven based on network traffic classification using efficient machine learning (ML) algorithms. However, there is a lack of standardized benchmarking criteria agreed by the SDN community to evaluate the techniques proposed to collect and classify the network traffic features. This paper presents an empirical analysis to investigate the impact of SDN traffic features on the performance of ML-based classifiers. This study has been conducted using a publicly available SDN traffic dataset. The procedure of feature engineering has also been comprehensively evaluated through three approaches of feature selection, which are low variance filter, high correlation filter, and backward feature elimination. The experimental results show that the performance of ML classifiers is highly driven by the quality traffic features fed to them; the highest quality the traffic features are the best accuracy is achieved.
AB - Network traffic classification is crucial for facilitating sophisticated network management. Software-defined networks (SDN) provide a featured architecture where the control and data planes are decoupled to enable dynamic configuration. The control plane functionality is implemented in a programmable centralized controller that enables effective use of data-driven based on network traffic classification using efficient machine learning (ML) algorithms. However, there is a lack of standardized benchmarking criteria agreed by the SDN community to evaluate the techniques proposed to collect and classify the network traffic features. This paper presents an empirical analysis to investigate the impact of SDN traffic features on the performance of ML-based classifiers. This study has been conducted using a publicly available SDN traffic dataset. The procedure of feature engineering has also been comprehensively evaluated through three approaches of feature selection, which are low variance filter, high correlation filter, and backward feature elimination. The experimental results show that the performance of ML classifiers is highly driven by the quality traffic features fed to them; the highest quality the traffic features are the best accuracy is achieved.
KW - Feature Engineering
KW - Machine Learning
KW - Network Traffic Classification
KW - Software-Defined Networks
UR - http://www.scopus.com/inward/record.url?scp=85174645173&partnerID=8YFLogxK
U2 - 10.1049/icp.2022.2446
DO - 10.1049/icp.2022.2446
M3 - Conference contribution
AN - SCOPUS:85174645173
VL - 2022
T3 - IET Conference Proceedings
SP - 192
EP - 197
BT - IET Conference Proceedings
PB - Institution of Engineering and Technology
T2 - 3rd International Conference on Distributed Sensing and Intelligent Systems, ICDSIS 2022
Y2 - 19 October 2022 through 21 October 2022
ER -