A-DDPG: Attention Mechanism-based Deep Reinforcement Learning for NFV

Nan He; S. Yang; Fan Li; S. Trajanovski; F.A. Kuipers; Xiaoming Fu

doi:10.1109/IWQOS52092.2021.9521285

A-DDPG: Attention Mechanism-based Deep Reinforcement Learning for NFV

Nan He, S. Yang, Fan Li, S. Trajanovski, F.A. Kuipers, Xiaoming Fu

Embedded Systems

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

2 Citations (Scopus)

750 Downloads (Pure)

Abstract

The efficacy of Network Function Virtualization (NFV) depends critically on (1) where the virtual network functions (VNFs) are placed and (2) how the traffic is routed. Unfortunately, these aspects are not easily optimized, especially under time-varying network states with different quality of service (QoS) requirements. Given the importance of NFV, many approaches have been proposed to solve the VNF placement and traffic routing problem. However, those prior approaches mainly assume that the state of the network is static and known, disregarding real-time network variations. To bridge that gap, in this paper, we formulate the VNF placement and traffic routing problem as a Markov Decision Process model to capture the dynamic network state transitions. In order to jointly minimize the delay and cost of NFV providers and maximize the revenue, we devise a customized Deep Reinforcement Learning (DRL) algorithm, called A-DDPG, for VNF placement and traffic routing in a real-time network. A-DDPG uses the attention mechanism to ascertain smooth network behavior within the general framework of network utility maximization (NUM). The simulation results show that A-DDPG outperforms the state-of- the-art in terms of network utility, delay, and cost.

Original language	English
Title of host publication	IWQoS 2021 - IEEE/ACM International Symposium on Quality of Service
Publisher	IEEE
Number of pages	10
ISBN (Print)	978-1-6654-3054-8
DOIs	https://doi.org/10.1109/IWQOS52092.2021.9521285
Publication status	Published - 2021

Bibliographical note

Accepted Author Manuscript

Keywords

Network function virtualization
deep reinforcement learning
placement
routing

Access to Document

10.1109/IWQOS52092.2021.9521285

IWQoS2021Accepted author manuscript, 1.23 MB

Cite this

@inproceedings{e30ef7e11029474d93f033b42c675e9a,

title = "A-DDPG: Attention Mechanism-based Deep Reinforcement Learning for NFV",

abstract = "The efficacy of Network Function Virtualization (NFV) depends critically on (1) where the virtual network functions (VNFs) are placed and (2) how the traffic is routed. Unfortunately, these aspects are not easily optimized, especially under time-varying network states with different quality of service (QoS) requirements. Given the importance of NFV, many approaches have been proposed to solve the VNF placement and traffic routing problem. However, those prior approaches mainly assume that the state of the network is static and known, disregarding real-time network variations. To bridge that gap, in this paper, we formulate the VNF placement and traffic routing problem as a Markov Decision Process model to capture the dynamic network state transitions. In order to jointly minimize the delay and cost of NFV providers and maximize the revenue, we devise a customized Deep Reinforcement Learning (DRL) algorithm, called A-DDPG, for VNF placement and traffic routing in a real-time network. A-DDPG uses the attention mechanism to ascertain smooth network behavior within the general framework of network utility maximization (NUM). The simulation results show that A-DDPG outperforms the state-of- the-art in terms of network utility, delay, and cost.",

keywords = "Network function virtualization, deep reinforcement learning, placement, routing",

author = "Nan He and S. Yang and Fan Li and S. Trajanovski and F.A. Kuipers and Xiaoming Fu",

note = "Accepted Author Manuscript",

year = "2021",

doi = "10.1109/IWQOS52092.2021.9521285",

language = "English",

isbn = "978-1-6654-3054-8",

booktitle = "IWQoS 2021 - IEEE/ACM International Symposium on Quality of Service",

publisher = "IEEE",

address = "United States",

}

TY - GEN

T1 - A-DDPG: Attention Mechanism-based Deep Reinforcement Learning for NFV

AU - He, Nan

AU - Yang, S.

AU - Li, Fan

AU - Trajanovski, S.

AU - Kuipers, F.A.

AU - Fu, Xiaoming

N1 - Accepted Author Manuscript

PY - 2021

Y1 - 2021

N2 - The efficacy of Network Function Virtualization (NFV) depends critically on (1) where the virtual network functions (VNFs) are placed and (2) how the traffic is routed. Unfortunately, these aspects are not easily optimized, especially under time-varying network states with different quality of service (QoS) requirements. Given the importance of NFV, many approaches have been proposed to solve the VNF placement and traffic routing problem. However, those prior approaches mainly assume that the state of the network is static and known, disregarding real-time network variations. To bridge that gap, in this paper, we formulate the VNF placement and traffic routing problem as a Markov Decision Process model to capture the dynamic network state transitions. In order to jointly minimize the delay and cost of NFV providers and maximize the revenue, we devise a customized Deep Reinforcement Learning (DRL) algorithm, called A-DDPG, for VNF placement and traffic routing in a real-time network. A-DDPG uses the attention mechanism to ascertain smooth network behavior within the general framework of network utility maximization (NUM). The simulation results show that A-DDPG outperforms the state-of- the-art in terms of network utility, delay, and cost.

AB - The efficacy of Network Function Virtualization (NFV) depends critically on (1) where the virtual network functions (VNFs) are placed and (2) how the traffic is routed. Unfortunately, these aspects are not easily optimized, especially under time-varying network states with different quality of service (QoS) requirements. Given the importance of NFV, many approaches have been proposed to solve the VNF placement and traffic routing problem. However, those prior approaches mainly assume that the state of the network is static and known, disregarding real-time network variations. To bridge that gap, in this paper, we formulate the VNF placement and traffic routing problem as a Markov Decision Process model to capture the dynamic network state transitions. In order to jointly minimize the delay and cost of NFV providers and maximize the revenue, we devise a customized Deep Reinforcement Learning (DRL) algorithm, called A-DDPG, for VNF placement and traffic routing in a real-time network. A-DDPG uses the attention mechanism to ascertain smooth network behavior within the general framework of network utility maximization (NUM). The simulation results show that A-DDPG outperforms the state-of- the-art in terms of network utility, delay, and cost.

KW - Network function virtualization

KW - deep reinforcement learning

KW - placement

KW - routing

UR - http://www.scopus.com/inward/record.url?scp=85115406035&partnerID=8YFLogxK

U2 - 10.1109/IWQOS52092.2021.9521285

DO - 10.1109/IWQOS52092.2021.9521285

M3 - Conference contribution

SN - 978-1-6654-3054-8

BT - IWQoS 2021 - IEEE/ACM International Symposium on Quality of Service

PB - IEEE

ER -

A-DDPG: Attention Mechanism-based Deep Reinforcement Learning for NFV

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this