DDQP: A Double Deep Q-Learning Approach to Online Fault-Tolerant SFC Placement

doi:10.1109/TNSM.2021.3049298

Journal ArticleDOI

DDQP: A Double Deep Q-Learning Approach to Online Fault-Tolerant SFC Placement

Lei Wang, +3 more

- 05 Jan 2021 -

IEEE Transactions on Network and Service...

- Vol. 18, Iss: 1, pp 118-132

TLDR

In this article, a double deep Q-networks based online SFC placement scheme DDQP is proposed to deal with large continuous network state space, which offers constant generated state updates from active instances to standby instances to guarantee seamless redirection after failures.

Abstract:

Since Network Function Virtualization (NFV) decouples network functions (NFs) from the underlying dedicated hardware and realizes them in the form of software called Virtual Network Functions (VNFs), they are enabled to run in any resource-sufficient virtual machines. A service function chain (SFC) is composed of a sequential set of VNFs. As VNFs are vulnerable to various faults such as software failures, we consider how to deploy both active and standby SFC instances. Given the complexity and unpredictability of the network state, we propose a double deep Q-networks based online SFC placement scheme DDQP. Specifically, DDQP uses deep neural networks to deal with large continuous network state space. In the case of stateful VNFs, we offer constant generated state updates from active instances to standby instances to guarantee seamless redirection after failures. With the goal of balancing the waste of resources and ensuring service reliability, we introduce five progressive schemes of resource reservations to meet different customer needs. Our experimental results demonstrate that DDQP responds rapidly to arriving requests and reaches near-optimal performance. Specifically, DDQP outweighs the state-of-the-art method by 16.30% and 38.51% higher acceptance ratio under different schemes with 82x speedup on average. In order to enhance the integrity of the SFC state transition, we further proposed DDQP+, which extends DDQP by adding the delayed placement mechanism. Compared with DDQP, the design of the DDQP+ algorithm is more reasonable and comprehensive. The experiment results also show that DDQP+ achieved further improvement in multiple performance indicators.

DDQP: A Double Deep Q-Learning Approach to Online Fault-Tolerant SFC Placement

Citations

Deep Reinforcement Learning for Resource Management on Network Slicing: A Survey

Network Function Virtualization and Service Function Chaining Frameworks: A Comprehensive Review of Requirements, Objectives, Implementations, and Open Research Challenges

SARM: Service function chain active reconfiguration mechanism based on load and demand prediction

A reinforcement learning-based approach for availability-aware service function chain placement in large-scale networks

A reinforcement learning-based approach for availability-aware service function chain placement in large-scale networks

References

ImageNet Classification with Deep Convolutional Neural Networks

Human-level control through deep reinforcement learning

Rectified Linear Units Improve Restricted Boltzmann Machines

Mastering the game of Go with deep neural networks and tree search

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Related Papers (5)

Online Fault-tolerant VNF Chain Placement: A Deep Reinforcement Learning Approach

Algorithms for Fault-Tolerant Placement of Stateful Virtualized Network Functions

An Integrated Virtualized Strategy for Fault Tolerance in Cloud Computing Environment

Performance comparison of state synchronization techniques in a distributed LTE EPC

A problem-specific fault-tolerance mechanism for asynchronous, distributed systems