Deep Learning-Based Traffic Prediction for Elephant Flows in Software-Defined Networks

In an article published in the journal Scientific Reports, researchers proposed a traffic-predicting model that can identify large flow sizes (elephant flows) and prevent network congestion in software-defined networks (SDN) using deep learning techniques.

Study: Deep Learning-Based Traffic Prediction for Elephant Flows in Software-Defined Networks. Image credit: Generated using DALL.E.3
Study: Deep Learning-Based Traffic Prediction for Elephant Flows in Software-Defined Networks. Image credit: Generated using DALL.E.3

This paper mainly focuses on the different approaches to enhancing the quality of service (QoS) in real-time applications like video streaming and voice over internet protocol (VoIP) calls. These applications generate data flows and require high bandwidth and low latency.

Background

SDN is a network architecture that separates the control plane from the data plane, enabling centralized and programmable control of network resources and behavior. This network architecture can potentially improve the QoS for various real-time applications.

However, one of the challenges in SDN is handling large and long-lived elephant flows. If elephant flow is not managed appropriately, it can create further congestion problems that can degrade network performance and affect user experience. Therefore, it is essential to detect and predict elephant flows in advance and allocate optimal routes to mitigate potential issues and ensure a consistently high level of network performance.

About the Research

Researchers designed a traffic-predicting model that can identify elephant flows and prevent network congestion in advance in SDN. The model uses deep learning algorithms to learn from historical traffic data and predict the probability of elephant flows in real time. They used the SHapley Additive exPlanations (SHAP) technique (an explainable artificial intelligence or XAI technique) to explain further the prediction of the model and the importance of those features that potentially impact the prediction of the model. They used an SDN dataset that contains 104,345 rows and 23 columns (attributes) related to traffic flows, such as packet size, byte count, protocol, and duration. The dataset is generated by using a mininet emulator and an Ryu controller.

Methodology

Researchers proposed a three-step methodology to develop and evaluate the traffic-predicting model. The first step is to cluster the traffic data into elephant and mice flows using an unsupervised algorithm called H2O, which is a deep learning framework that can automatically label traffic flows based on characteristics. The next step is to remove anomalies from the data using a deep autoencoder neural network algorithm that can reconstruct the input data with minimal bias. In the last step, they applied distributed random forest (DRF), gradient boosting machine (GBM), and eXtreme gradient boosting machine (XGBoost) algorithm for training and testing the model. All algorithms used here are supervised algorithms.

Key metrics such as accuracy, precision, and loss metrics were used to check the model's accuracy. Further, the model is explained using SHAP, as it can quantify the importance of each feature for the prediction and visualize the feature's importance and effects.

Research Findings

The findings show that the clustering model can label the traffic data into elephant and mice flows with 39.11% accuracy, which is close to the manual label assignment. Additionally, the autoencoder model can separate (or remove animalities) normal traffic flows from abnormal ones by a threshold value of 0.091.

The performance of all the implemented models was presented in terms of accuracy and loss after testing them on both the training and validation datasets. Among these, the DRF model stands out as the performer achieving 100% accuracy, with a loss of 0.00000408. SHAP helped explain how their models make predictions and determine which features significantly impact the outcomes of those predictions. Features such as protocol, source address, destination address, packet count, and byte count have the highest impact on predicting elephant flows, and they visualize the positive and negative contributions of each feature using force plots and summary plots.

Applications

This research has potential applications in various domains, including electricity, communications, and informatics. Specifically, the proposed model can be utilized to predict and clarify traffic patterns in network settings, such as sensor networks, cellular networks, vehicular networks, and multimedia networks. It can also be applied in areas such as VOIP, video streaming, online gaming, and the IoT (internet of things). It is also helpful in ensuring quality of service (QoS) standards like latency, bandwidth, and packet loss are met.

Conclusion

In conclusion, this paper comprehensively explains that a traffic-predicting model using deep learning and XAI techniques can achieve high accuracy and low loss in identifying elephant flows and preventing network congestion in SDN. It also shows how the SHAP technique can provide detailed explanations for the traffic prediction results and feature importance.

As per the findings, the traffic predicting and explaining model can be integrated into the SDN controller or switch for real-time and adaptive traffic management. Overall, the proposed model performs well but its performance and accuracy can be improved more using other advanced deep learning algorithms or more data.

Journal reference:
Muhammad Osama

Written by

Muhammad Osama

Muhammad Osama is a full-time data analytics consultant and freelance technical writer based in Delhi, India. He specializes in transforming complex technical concepts into accessible content. He has a Bachelor of Technology in Mechanical Engineering with specialization in AI & Robotics from Galgotias University, India, and he has extensive experience in technical content writing, data science and analytics, and artificial intelligence.

Citations

Please use one of the following formats to cite this article in your essay, paper or report:

  • APA

    Osama, Muhammad. (2023, December 14). Deep Learning-Based Traffic Prediction for Elephant Flows in Software-Defined Networks. AZoAi. Retrieved on September 19, 2024 from https://www.azoai.com/news/20231128/Deep-Learning-Based-Traffic-Prediction-for-Elephant-Flows-in-Software-Defined-Networks.aspx.

  • MLA

    Osama, Muhammad. "Deep Learning-Based Traffic Prediction for Elephant Flows in Software-Defined Networks". AZoAi. 19 September 2024. <https://www.azoai.com/news/20231128/Deep-Learning-Based-Traffic-Prediction-for-Elephant-Flows-in-Software-Defined-Networks.aspx>.

  • Chicago

    Osama, Muhammad. "Deep Learning-Based Traffic Prediction for Elephant Flows in Software-Defined Networks". AZoAi. https://www.azoai.com/news/20231128/Deep-Learning-Based-Traffic-Prediction-for-Elephant-Flows-in-Software-Defined-Networks.aspx. (accessed September 19, 2024).

  • Harvard

    Osama, Muhammad. 2023. Deep Learning-Based Traffic Prediction for Elephant Flows in Software-Defined Networks. AZoAi, viewed 19 September 2024, https://www.azoai.com/news/20231128/Deep-Learning-Based-Traffic-Prediction-for-Elephant-Flows-in-Software-Defined-Networks.aspx.

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of AZoAi.
Post a new comment
Post

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.

You might also like...
Deep Learning-based Gangue Sorting for Coal Plants