Deep Multi-User Reinforcement Learning for Distributed Dynamic Spectrum Access

doi:10.1109/TWC.2018.2879433

Open AccessJournal ArticleDOI

Deep Multi-User Reinforcement Learning for Distributed Dynamic Spectrum Access

Oshri Naparstek, +1 more

- 01 Jan 2019 -

IEEE Transactions on Wireless Communicat...

- Vol. 18, Iss: 1, pp 310-323

Chats0

TLDR

A novel distributed dynamic spectrum access algorithm based on deep multi-user reinforcement leaning is developed for accessing the spectrum that maximizes a certain network utility in a distributed manner without online coordination or message exchanges between users.

Abstract:

We consider the problem of dynamic spectrum access for network utility maximization in multichannel wireless networks. The shared bandwidth is divided into $K$ orthogonal channels. In the beginning of each time slot, each user selects a channel and transmits a packet with a certain transmission probability. After each time slot, each user that has transmitted a packet receives a local observation indicating whether its packet was successfully delivered or not (i.e., ACK signal). The objective is a multi-user strategy for accessing the spectrum that maximizes a certain network utility in a distributed manner without online coordination or message exchanges between users. Obtaining an optimal solution for the spectrum access problem is computationally expensive, in general, due to the large-state space and partial observability of the states. To tackle this problem, we develop a novel distributed dynamic spectrum access algorithm based on deep multi-user reinforcement leaning. Specifically, at each time slot, each user maps its current state to the spectrum access actions based on a trained deep-Q network used to maximize the objective function. Game theoretic analysis of the system dynamics is developed for establishing design principles for the implementation of the algorithm. The experimental results demonstrate the strong performance of the algorithm.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Deep Learning for Intelligent Wireless Networks: A Comprehensive Survey

Qian Mao, +2 more

- 12 Jun 2018 -

IEEE Communications Surveys and Tutorial...

TL;DR: A comprehensive survey of the applications of DL algorithms for different network layers, including physical layer modulation/coding, data link layer access control/resource allocation, and routing layer path search, and traffic balancing is performed.

...read moreread less

Journal ArticleDOI

Deep Reinforcement Learning for User Association and Resource Allocation in Heterogeneous Cellular Networks

Nan Zhao, +5 more

- 13 Aug 2019 -

IEEE Transactions on Wireless Communicat...

TL;DR: A reinforcement learning approach is proposed to achieve the maximum long-term overall network utility while guaranteeing the quality of service requirements of user equipments (UEs) in the downlink of heterogeneous cellular networks.

...read moreread less

Journal ArticleDOI

Multi-Agent Deep Reinforcement Learning for Dynamic Power Allocation in Wireless Networks

Yasar Sinan Nasir, +1 more

- 01 Aug 2018 -

arXiv: Signal Processing

TL;DR: The proposed algorithm is shown to achieve near-optimal power allocation in real time based on delayed CSI measurements available to the agents and is especially suitable for practical scenarios where the system model is inaccurate and CSI delay is non-negligible.

...read moreread less

Journal ArticleDOI

Multi-Agent Deep Reinforcement Learning for Dynamic Power Allocation in Wireless Networks

Yasar Sinan Nasir, +1 more

- 08 Aug 2019 -

IEEE Journal on Selected Areas in Commun...

TL;DR: In this paper, a distributively executed dynamic power allocation scheme is developed based on model-free deep RL for transmit power control in wireless networks, where each transmitter collects CSI and quality of service (QoS) information from several neighbors and adapts its own transmit power accordingly.

...read moreread less

Journal ArticleDOI

Deep-Learning-Based Wireless Resource Allocation With Application to Vehicular Networks

Le Liang, +3 more

TL;DR: The key motivations and roadblocks of using deep learning for wireless resource allocation with application to vehicular networks are discussed and the deep reinforcement learning approach to address resource allocation problems that are difficult to handle in the traditional optimization framework is highlighted.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Posted Content

Playing Atari with Deep Reinforcement Learning

Volodymyr Mnih, +6 more

- 19 Dec 2013 -

arXiv: Learning

TL;DR: This work presents the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning, which outperforms all previous approaches on six of the games and surpasses a human expert on three of them.

...read moreread less

Journal ArticleDOI

A Survey of Dynamic Spectrum Access

Qing Zhao, +1 more

- 21 May 2007 -

IEEE Signal Processing Magazine

TL;DR: An overview of challenges and recent developments in both technological and regulatory aspects of opportunistic spectrum access (OSA) is presented, and the three basic components of OSA are discussed.

...read moreread less

Proceedings Article

Deep reinforcement learning with double Q-Learning

Hado van Hasselt, +2 more

TL;DR: In this paper, the authors show that the DQN algorithm suffers from substantial overestimation in some games in the Atari 2600 domain, and they propose a specific adaptation to the algorithm and show that this algorithm not only reduces the observed overestimations, but also leads to much better performance on several games.

...read moreread less

Journal ArticleDOI

Decentralized cognitive MAC for opportunistic spectrum access in ad hoc networks: A POMDP framework

Qing Zhao, +3 more

- 01 Apr 2007 -

IEEE Journal on Selected Areas in Commun...

TL;DR: An analytical framework for opportunistic spectrum access based on the theory of partially observable Markov decision process (POMDP) is developed and cognitive MAC protocols that optimize the performance of secondary users while limiting the interference perceived by primary users are proposed.

...read moreread less