A Q-Learning Scheme for Fair Coexistence Between LTE and Wi-Fi in Unlicensed Spectrum

doi:10.1109/ACCESS.2018.2829492

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Machine Learning for 5G/B5G Mobile and Wireless Communications: Potential, Limitations, and Future Directions

[...]

Manuel Eugenio Morocho-Cayamcela¹, Haeyoung Lee², Wansu Lim¹•Institutions (2)

Kumoh National Institute of Technology¹, University of Surrey²

19 Sep 2019-IEEE Access

TL;DR: The fundamental concepts of supervised, unsupervised, and reinforcement learning are established, taking a look at what has been done so far in the adoption of ML in the context of mobile and wireless communication, and the promising approaches for how ML can contribute to supporting each target 5G network requirement are discussed.

...read moreread less

Abstract: Driven by the demand to accommodate today’s growing mobile traffic, 5G is designed to be a key enabler and a leading infrastructure provider in the information and communication technology industry by supporting a variety of forthcoming services with diverse requirements. Considering the ever-increasing complexity of the network, and the emergence of novel use cases such as autonomous cars, industrial automation, virtual reality, e-health, and several intelligent applications, machine learning (ML) is expected to be essential to assist in making the 5G vision conceivable. This paper focuses on the potential solutions for 5G from an ML-perspective. First, we establish the fundamental concepts of supervised, unsupervised, and reinforcement learning, taking a look at what has been done so far in the adoption of ML in the context of mobile and wireless communication, organizing the literature in terms of the types of learning. We then discuss the promising approaches for how ML can contribute to supporting each target 5G network requirement, emphasizing its specific use cases and evaluating the impact and limitations they have on the operation of the network. Lastly, this paper investigates the potential features of Beyond 5G (B5G), providing future research directions for how ML can contribute to realizing B5G. This article is intended to stimulate discussion on the role that ML can play to overcome the limitations for a wide deployment of autonomous 5G/B5G mobile and wireless communications.

...read moreread less

249 citations

Cites background or methods from "A Q-Learning Scheme for Fair Coexis..."

...In [80], Q-learning is applied for the fair coexistence between LTE andWi-Fi in the unlicensed spectrum....
[...]
...The learning approach that accounts for the coexistence of LTE and LTE-U to model the resource allocation problem in LTE-U small stations (SBS), has been studied in [80], [95]....
[...]

Proceedings Article•DOI•

An End-to-End Automatic Cloud Database Tuning System Using Deep Reinforcement Learning

[...]

Ji Zhang¹, Yu Liu¹, Ke Zhou¹, Guoliang Li², Zhili Xiao³, Bin Cheng³, Jiashu Xing³, Yangtao Wang¹, Tianheng Cheng¹, Li Liu¹, Minwei Ran¹, Zekang Li¹ - Show less +8 more•Institutions (3)

Huazhong University of Science and Technology¹, Tsinghua University², Tencent³

25 Jun 2019

TL;DR: An end-to-end automatic CDB tuning system, CDBTune, using deep reinforcement learning (RL), which enables end- to-end learning and accelerates the convergence speed of the model and improves efficiency of online tuning.

...read moreread less

Abstract: Configuration tuning is vital to optimize the performance of database management system (DBMS). It becomes more tedious and urgent for cloud databases (CDB) due to the diverse database instances and query workloads, which make the database administrator (DBA) incompetent. Although there are some studies on automatic DBMS configuration tuning, they have several limitations. Firstly, they adopt a pipelined learning model but cannot optimize the overall performance in an end-to-end manner. Secondly, they rely on large-scale high-quality training samples which are hard to obtain. Thirdly, there are a large number of knobs that are in continuous space and have unseen dependencies, and they cannot recommend reasonable configurations in such high-dimensional continuous space. Lastly, in cloud environment, they can hardly cope with the changes of hardware configurations and workloads, and have poor adaptability. To address these challenges, we design an end-to-end automatic CDB tuning system, CDBTune, using deep reinforcement learning (RL). CDBTune utilizes the deep deterministic policy gradient method to find the optimal configurations in high-dimensional continuous space. CDBTune adopts a try-and-error strategy to learn knob settings with a limited number of samples to accomplish the initial training, which alleviates the difficulty of collecting massive high-quality samples. CDBTune adopts the reward-feedback mechanism in RL instead of traditional regression, which enables end-to-end learning and accelerates the convergence speed of our model and improves efficiency of online tuning. We conducted extensive experiments under 6 different workloads on real cloud databases to demonstrate the superiority of CDBTune. Experimental results showed that CDBTune had a good adaptability and significantly outperformed the state-of-the-art tuning tools and DBA experts.

...read moreread less

197 citations

Cites background or methods from "A Q-Learning Scheme for Fair Coexis..."

...As a result, applying Q-Learning to database configuration tuning is impractical....
[...]
...According to Q-Learning algorithm, Vt+1 is multiplied by discount factor γ and added by the value of reward at time t , and now we can estimate the value of V ′t of the current state st ....
[...]
...Q-Learning is effective in a relatively small state space....
[...]
...Q-Learning....
[...]
...Nevertheless, DQN still adopts Q-Learning to update Q-value, so we can describe the relationship between them as follows: Q(s,a,ω) → Q(s,a) where ω of Q(s,a,ω) represents the weights of neural network in DQN....
[...]

Journal Article•DOI•

QTune: a query-aware database tuning system with deep reinforcement learning

[...]

Guoliang Li¹, Xuanhe Zhou¹, Shifu Li², Gao Bo²•Institutions (2)

Tsinghua University¹, Huawei²

01 Aug 2019

TL;DR: A query-aware database tuning system QTune with a deep reinforcement learning (DRL) model, which can efficiently and effectively tune the database configurations based on both the query vector and database states, and which outperforms the state-of-the-art tuning methods.

...read moreread less

Abstract: Database knob tuning is important to achieve high performance (e.g., high throughput and low latency). However, knob tuning is an NP-hard problem and existing methods have several limitations. First, DBAs cannot tune a lot of database instances on different environments (e.g., different database vendors). Second, traditional machine-learning methods either cannot find good configurations or rely on a lot of high-quality training examples which are rather hard to obtain. Third, they only support coarse-grained tuning (e.g., workload-level tuning) but cannot provide fine-grained tuning (e.g., query-level tuning).To address these problems, we propose a query-aware database tuning system QTune with a deep reinforcement learning (DRL) model, which can efficiently and effectively tune the database configurations. QTune first featurizes the SQL queries by considering rich features of the SQL queries. Then QTune feeds the query features into the DRL model to choose suitable configurations. We propose a Double-State Deep Deterministic Policy Gradient (DS-DDPG) model to enable query-aware database configuration tuning, which utilizes the actor-critic networks to tune the database configurations based on both the query vector and database states. QTune provides three database tuning granularities: query-level, workload-level, and cluster-level tuning. We deployed our techniques onto three real database systems, and experimental results show that QTune achieves high performance and outperforms the state-of-the-art tuning methods.

...read moreread less

150 citations

Cites methods from "A Q-Learning Scheme for Fair Coexis..."

...Note that existing DRL models [16, 19, 12] cannot utilize the query features as they ignore the effects to the environment state from the query, and we propose a Double-State Deep Deterministic Policy Gradient (DS-DDPG) model to enable query-aware tuning....
[...]

Journal Article•DOI•

Smart Network Slicing for Vehicular Fog-RANs

[...]

Kai Xiong¹, Supeng Leng¹, Jie Hu¹, Xiaosha Chen¹, Kun Yang¹ - Show less +1 more•Institutions (1)

University of Electronic Science and Technology of China¹

19 Feb 2019-IEEE Transactions on Vehicular Technology

TL;DR: An intelligent algorithm for network slices is proposed based on the Monte Carlo tree search in terms of a new metric cross entropy, which is able to allocate the resource allocation for the match of traffic load in the time-space domain.

...read moreread less

Abstract: Modern transportation systems are facing a sharp alteration since the Internet of Vehicles (IoV) has activated intense information exchange among vehicles, infrastructure, and pedestrians. Existing approaches fail in efficiently handling the heterogeneous network traffic because of the complicated network environment and dynamic vehicle density. Recently, the fog-radio access network with network slicing has emerged as a promising solution to fulfill the demands of the maldistributed network traffic. However, available fog resources as well as network traffic are all dynamic and unpredictable due to high mobility of vehicles, which results in weak resource utilization. To address this problem, we propose a smart slice scheduling scheme in vehicular fog radio access networks. This scheduling scheme is formed as a Markov decision process. Accordingly, an intelligent algorithm for network slices is proposed based on the Monte Carlo tree search in terms of a new metric cross entropy, which is able to allocate the resource allocation for the match of traffic load in the time-space domain. This slice scheduling algorithm does not require any prior knowledge of the network traffic. Furthermore, this paper first reveals the relationship between road traffic and the IoV resource based on the metric perception-reaction time. A collaborative scheduling scheme is proposed to tune the road traffic speed to further release available IoV resource under the heavy traffic load. Simulation results indicate that the proposed algorithm outperforms several baselines in terms of throughput and delay with low complexity.

...read moreread less

53 citations

Cites background from "A Q-Learning Scheme for Fair Coexis..."

...ς < 1 represents a discount factor [28], and an action ai at time i is a element of the set {Cj+ = 1, Cj− = 1, Fj+ = 1, Fj− = 1}....
[...]

Journal Article•DOI•

Enhancing the coexistence of LTE and Wi-Fi in unlicensed spectrum through convolutional neural networks

[...]

Vasilis Maglogiannis¹, Adnan Shahid¹, Dries Naudts¹, Eli De Poorter¹, Ingrid Moerman¹ - Show less +1 more•Institutions (1)

Ghent University¹

06 Mar 2019-IEEE Access

TL;DR: A convolutional neural network (CNN) is proposed that is trained to perform identification of LTE and Wi-Fi transmissions and can identify the hidden terminal effect caused by multiple LTE transmissions, multiple Wi-fi transmissions, or concurrent LTE andWi-Fi broadcasts.

...read moreread less

Abstract: Over the last years, the ever-growing wireless traffic has pushed the mobile community to investigate solutions that can assist in more efficient management of the wireless spectrum. Towards this direction, the long-term evolution (LIE) operation in the unlicensed spectrum has been proposed. Targeting a global solution that respects the regional requirements, 3GPP announced the standard of LIE licensed assisted access (LAA). However, LIE LAA may result in unfair coexistence with Wi-Fi, especially when Wi-Fi does not use frame aggregation. Targeting a technique that enables fair channel access, the mLTE-U scheme has been proposed. According to mLTE-U, LTE uses a variable transmission opportunity, followed by a variable muting period that can be exploited by other networks to transmit. For the selection of the appropriate mLTE-U configuration, information about the dynamically changing wireless environment is required. To this end, this paper proposes a convolutional neural network (CNN) that is trained to perform identification of LIE and Wi-Fi transmissions. In addition, it can identify the hidden terminal effect caused by multiple LTE transmissions, multiple Wi-Fi transmissions, or concurrent LIE and Wi-Fi transmissions. The designed CNN has been trained and validated using commercial off-the-shelf LIE and Wi-Fi hardware equipment and for two wireless signal representations, namely, in-phase and quadrature samples and frequency domain representation through fast Fourier transform. The classification accuracy of the two resulting CNNs is tested for different signal to noise ratio values. The experimentation results show that the data representation affects the accuracy of CNN. The obtained information from CNN can be exploited by the mLTE-U scheme in order to provide fair coexistence between the two wireless technologies.

...read moreread less

41 citations

Cites background or methods from "A Q-Learning Scheme for Fair Coexis..."

...In [10] and [12], we assumed that the information of the...
[...]
...In [12], we further extended our previous work by introducing a Q-learning procedure that is able to provide automatic and autonomous selection of the appropriate TXOP and muting period combinations that can enable fair coexistence between the co-located networks....
[...]

Collapse

A Q-Learning Scheme for Fair Coexistence Between LTE and Wi-Fi in Unlicensed Spectrum

Citations

Cites background or methods from "A Q-Learning Scheme for Fair Coexis..."

Cites background or methods from "A Q-Learning Scheme for Fair Coexis..."

Cites methods from "A Q-Learning Scheme for Fair Coexis..."

Cites background from "A Q-Learning Scheme for Fair Coexis..."

Cites background or methods from "A Q-Learning Scheme for Fair Coexis..."

References

"A Q-Learning Scheme for Fair Coexis..." refers background in this paper

"A Q-Learning Scheme for Fair Coexis..." refers background in this paper

Related Papers (5)

Trending Questions (1)