scispace - formally typeset
Search or ask a question

Showing papers by "Celimuge Wu published in 2021"


Journal ArticleDOI
TL;DR: A multibeam satellite IIoT in Ka-band is proposed to realize wide-area coverage and long-distance transmissions, which uses nonorthogonal multiple access (NOMA) for each beam to improve transmission rate.
Abstract: The traditional ground industrial Internet of Things (IIoT) cannot supply wireless interconnections anywhere due to its small-scale communication coverage. In this article, a multibeam satellite IIoT in Ka-band is proposed to realize wide-area coverage and long-distance transmissions, which uses nonorthogonal multiple access (NOMA) for each beam to improve transmission rate. To guarantee Quality of Service (QoS) for the satellite IIoT, the beam power is optimized to match the theoretical transmission rate with the service rate. The NOMA transmission rate for each beam is maximized by optimizing the power allocation proportion of each node subject to the constraints of the total power for the beam and the minimal transmission rate for each node within the beam. Satellite-ground integrated IIoT is proposed to use the ground cellular network to supplement the satellite coverage in the blocked areas. The power allocation and network selection for the integrated IIoT are proposed to decrease the transmission cost. Simulation results are provided to validate the superiority of employing NOMA in the satellite IIoT and show higher transmission performance for the QoS-guarantee resource allocation.

209 citations


Journal ArticleDOI
TL;DR: A reinforcement learning-based multislot double-threshold spectrum sensing with Bayesian fusion is proposed to sense industrial big spectrum data, which can find required idle channels faster while guaranteeing spectrum sensing performance.
Abstract: With the rapid increase of industrial systems, industrial spectrum is stepping into the era of big data, and at the same time spectrum resources are facing serious shortage. Cognitive industrial system (CIS) based on cognitive radio can improve spectrum utilization by accessing the idle spectrum licensed to primary user. However, the CIS must find enough idle channels by performing spectrum sensing. In this article, a reinforcement learning-based multislot double-threshold spectrum sensing with Bayesian fusion is proposed to sense industrial big spectrum data, which can find required idle channels faster while guaranteeing spectrum sensing performance. Double thresholds are set to guarantee both high detection probability and spectrum access probability, and weighed energy detection is proposed to maximize detection probability when the energy statistic falls into the confusion area between the double thresholds. Bayesian fusion is proposed to get a final decision on the channel availability by combining the local sensing decisions of all the time slots. A prediction and selection algorithm for idle channels is proposed to predict the idle probability of each channel and find required idle channels from the sorted channel set. From simulation results, the proposed spectrum sensing scheme outperforms cooperative spectrum sensing and energy detection, which can predict idle channels accurately and get needed idle channels with fewer sensing operations.

72 citations


Journal ArticleDOI
TL;DR: In this paper, a distributed learning framework is proposed to address the technical challenges arising from the uncertainties and the sharing of limited resource in an MEC system, and the computation offloading problem is formulated as a multi-agent Markov decision process.
Abstract: Facing the trend of merging wireless communications and multi-access edge computing (MEC), this article studies computation offloading in beyond fifth generation networks. To address the technical challenges originating from the uncertainties and the sharing of limited resource in an MEC system, we formulate the computation offloading problem as a multi-agent Markov decision process, for which a distributed learning framework is proposed. We present a case study on resource orchestration in computation offloading to showcase the potential of an online distributed reinforcement learning algorithm developed under the proposed framework. Experimental results demonstrate that our learning algorithm outperforms the benchmark resource orchestration algorithms. Furthermore, we outline the research directions worth in-depth investigation to minimize the time cost, which is one of the main practical issues that prevent the implementation of the proposed distributed learning framework.

54 citations


Journal ArticleDOI
TL;DR: The challenges of and proposed solutions to wireless transmission systems of point cloud video, which is the most popular and favored way to represent volumetric media and significantly differs from the other types of videos are responded to.
Abstract: Volumetric video (or hologram video), the medium for representing natural content in VR/AR/MR, is presumably the next generation of video technology and a typical use case for 5G and beyond wireless communications. To realize volumetric video applications, efficient volumetric video streaming is in critical demand. This article responds to the challenges of and proposes solutions to wireless transmission systems of point cloud video, which is the most popular and favored way to represent volumetric media and significantly differs from the other types of videos. In particular, we first introduce point cloud video technology and its applications, and then discuss the challenges of and solutions to point cloud video streaming, including encoding, tiling, viewing angle prediction, decoding, quality assessment and transmission optimization. Furthermore, we explain a prototype of a MPEG DASH-based point cloud video streaming system as a preliminary study, along with more simulation results to verify its performance. Finally, we identify future research directions for providing high-quality point cloud video streaming.

53 citations


Posted ContentDOI
TL;DR: An aerial video streaming enabled cooperative computing solution namely, UAVideo, which streams videos from a UAV to ground servers, which minimizes the video streaming time under the constraints on UAV trajectory, video features, and communications resources is proposed.
Abstract: Unmanned aerial vehicle (UAV) systems are of increasing interest to academia and industry due to their mobility, flexibility, and maneuverability, and are an effective alternative to various uses such as surveillance and mobile edge computing. However, due to their limited computational and communications resources, it is difficult to serve all computation tasks simultaneously. This article tackles this problem by first proposing a scalable aerial computing solution, which is applicable for computation tasks of multiple quality levels, corresponding to different computation workloads and computation results of distinct performance. It opens up the possibility to maximally improve the overall computing performance with limited computational and communications resources. To meet the demands for timely video analysis that exceed the computing power of a UAV, we propose an aerial video streaming enabled cooperative computing solution, namely, UAVideo, which streams videos from a UAV to ground servers. As a complement to scalable aerial computing, UAVideo minimizes the video streaming time under the constraints on UAV trajectory, video features, and communications resources. Simulation results reveal the substantial advantages of the proposed solutions. Furthermore, we highlight relevant directions for future research.

35 citations


Journal ArticleDOI
TL;DR: In this paper, the authors proposed an edge computing-based joint client selection and networking scheme for vehicular IoT, which assigns some vehicles as edge vehicles by employing a distributed approach, and uses the edge vehicles as FL clients to conduct the training of local models, which learns optimal behaviors based on the interaction with environments.
Abstract: In order to support advanced vehicular Internet-of-Things (IoT) applications, information exchanges among different vehicles are required to find efficient solutions for catering to different application requirements in complex and dynamic vehicular environments. Federated learning (FL), which is a type of distributed learning technology, has been attracting great interest in recent years as it performs knowledge exchange among different network entities without a violation of user privacy. However, client selection and networking scheme for enabling FL in dynamic vehicular environments, which determines the communication delay between FL clients and the central server that aggregates the models received from the clients, is still under-explored. In this paper, we propose an edge computing-based joint client selection and networking scheme for vehicular IoT. The proposed scheme assigns some vehicles as edge vehicles by employing a distributed approach, and uses the edge vehicles as FL clients to conduct the training of local models, which learns optimal behaviors based on the interaction with environments. The clients also work as forwarder nodes in information sharing among network entities. The client selection takes into account the vehicle velocity, vehicle distribution, and the wireless link connectivity between vehicles using a fuzzy logic algorithm, resulting in an efficient learning and networking architecture. We use computer simulations to evaluate the proposed scheme in terms of the communication overhead and the information covered in learning.

33 citations


Journal ArticleDOI
TL;DR: In this paper, the authors proposed a virtual edge formation algorithm that considers both the stability of virtual edge and the computational resources available at the vehicles constituting the virtual edge to facilitate collaborative vehicular edge computing.
Abstract: Vehicular edge computing (VEC) has been a new paradigm to support computation-intensive and latency-sensitive services. However, the scarcity of computational resources is still a challenge. Making efficient use of sporadic idle computational resources on smart vehicles in the vicinity to extend the resource capability of each vehicle is an important research issue. In this paper, we propose Virtual Edge, which is an efficient scheme to utilize free computational resources of multiple vehicles as a virtual server to facilitate collaborative vehicular edge computing. We design a virtual edge formation algorithm that considers both the stability of virtual edge and the computational resources available at the vehicles constituting the virtual edge. The prediction of the link duration between vehicles reduces the number of computation offloading failures caused by unexpected link disconnections. Extensive simulations with realistic vehicle movements are conducted to show the advantage of the proposed scheme over existing baselines in terms of the completion ratio of computation offloading tasks and average task execution time.

32 citations


Journal ArticleDOI
TL;DR: An air-ground integrated multi-access edge computing system, which is deployed by an infrastructure provider (InP), is investigated and an online deep reinforcement learning (RL) scheme that adopts two separate double deep Q-networks to approximate the Q-Factor and the post-decision Q-factor is developed.
Abstract: This paper investigates an air-ground integrated multi-access edge computing system, which is deployed by an infrastructure provider (InP). Under a business agreement with the InP, a third-party service provider provides computing services to the subscribed mobile users (MUs). MUs compete for the shared spectrum and computing resources over time to achieve their distinctive goals. From the perspective of an MU, we deliberately define the age of update to capture the staleness of information from refreshing computation outcomes. Given the system dynamics, we model the interactions among MUs as a stochastic game. In the Nash equilibrium without cooperation, each MU behaves in accordance with the local system states and conjectures. We can hence transform the stochastic game into a single-agent Markov decision process. As another major contribution, we develop an online deep reinforcement learning (RL) scheme that adopts two separate double deep Q-networks to approximate the Q-factor and the post-decision Q-factor, respectively. The deep RL scheme allows each MU to optimize the behaviours with unknown dynamic statistics. Numerical experiments show that our proposed scheme outperforms the baselines in terms of the average utility under various system conditions.

31 citations


Journal ArticleDOI
TL;DR: This work proposes a channel access method for the D2D-U pairs on unlicensed channels and proposes a decentralized joint spectrum and power allocation scheme that can guarantee the global minimization of power consumption across the D1D/U pairs.
Abstract: Unlike the conventional device-to-device (D2D) networks, the unlicensed D2D (D2D-U) pairs can not only reuse the licensed channels with the base station (BS) but also share the unlicensed channels with the WiFi stations. One challenge arises from the fact that the co-channel interference on licensed channels and the collision probability on unlicensed channels may cause extra power consumption at the terminals. Accordingly, we first propose a channel access method for the D2D-U pairs on unlicensed channels. Then, a decentralized joint spectrum and power allocation scheme is designed to minimize the power consumption at D2D-U pairs. Different from the existing distributed schemes, the proposed scheme can guarantee the global minimization of power consumption across the D2D-U pairs. Simulation results validate the theoretical analysis and verify the performance from the proposed scheme.

27 citations


Journal ArticleDOI
26 Jan 2021
TL;DR: In this article, the authors proposed a routing protocol for UAV-assisted VDTNs, which considers both the encounter probability and the persistent connection time between mobile nodes for each encounter.
Abstract: Vehicular delay tolerant networks (VDTNs) enable information sharing among mobile nodes in scenarios where cellular base stations are unavailable and the connections between the mobile nodes are intermittent. While unmanned aerial vehicles (UAVs) have shown to improve the performance of VDTNs, existing routing protocols, such as PRoPHET, consider the encounter probability between mobile nodes only, which does not fully address the characteristics of UAVs in route selection. In this paper, we propose a routing protocol, which considers both the encounter probability and the persistent connection time between mobile nodes for each encounter, in UAV-assisted VDTNs. By introducing the persistent connection time in route selection, the proposed protocol is able to evaluate the stability of a communication link more accurately in the UAV-assisted VDTN environment. The proposed protocol is evaluated using realistic simulations by comparing it with existing baselines. The simulation results verify that the proposed protocol can improve the reliability of message forwarding while reducing network overhead and end-to-end delay.

24 citations


Journal ArticleDOI
02 Apr 2021
TL;DR: In this paper, the authors proposed a multi-channel blockchain scheme that can use the best parameters in accordance with the vehicle density, where each channel is optimized for a certain vehicle density level.
Abstract: With the development of advanced information and communication technology, the traditional centralized service model alone no longer meets the increasing demand of data exchange in intelligent transportation systems (ITS). While Internet of Vehicles (IoV) technology has been introduced to achieve more advanced ITS, there are still some unsettled issues such as flexibility and fault tolerance. The conventional centralized approach for ITS is vulnerable to the single point of failure, and lack of flexibility due to its dependence on a trusted third party (TTP). The emergence of blockchain technology provides a potential direction to address these problems. However, due to varying vehicle densities, it is challenging to select the best blockchain parameters to satisfy the application requirements. In this paper, we propose a multi-channel blockchain scheme that can use the best parameters in accordance with the vehicle density. The proposed scheme first defines multiple blockchain channels where each channel is optimized for a certain vehicle density level. Then, the system selects the best channel according to the vehicle density, and the application requirements on the transaction throughput and latency. We use extensive simulations to show that the proposed blockchain scheme achieves a significantly better performance as compared with existing baselines.

Journal ArticleDOI
TL;DR: In this paper, a learning-based wireless access control approach for edge-aided disaster response network is proposed, where the authors model the access control procedure as a discrete-time single agent Markov decision process, and solve the problem by exploiting deep reinforcement learning technique.
Abstract: The communication infrastructure is most likely to be damaged after a major disaster occurred, which would lead to further chaos in the disaster stricken area. Modern rescue activities heavily rely on the wireless communications, such as safety status report, disrupted area monitoring, evacuation instruction, rescue coordination, etc. Large amount of data generated from victims, sensors and responders must be delivered and processed in a fast and reliable way, even when the normal communication infrastructure is degraded or destroyed. To this end, reconstructing the post-disaster network by deploying MDRU (Movable and Deployable Resource Unit) and relay unit at edge is a very promising solution. However, the optimal wireless access control in this heterogeneous hastily formed network is extremely challenging, due to the frequent varying environment and the lack of statistics information in advance in post-disaster scenarios. In this paper, we propose a learning based wireless access control approach for edge-aided disaster response network. More specifically, we model the wireless access control procedure as a discrete-time single agent Markov decision process, and solve the problem by exploiting deep reinforcement learning technique. By extensive simulation results, we show that the proposed mechanism significantly outperforms the baseline schemes in terms of delay and packet drop rate.

Journal ArticleDOI
15 Apr 2021-Sensors
TL;DR: In this article, a brief survey of MPTCP studies is presented and a brief overview of multipath routing in vehicular networks is discussed. And the significance technical challenges in applying MPTCPs for vehicular network and point out future research directions.
Abstract: Multipath TCP (MPTCP) is one of the most important extensions to TCP that enables the use of multiple paths in data transmissions for a TCP connection. In MPTCP, the end hosts transmit data across a number of TCP subflows simultaneously on one connection. MPTCP can sufficiently utilize the bandwidth resources to improve the transmission efficiency while providing TCP fairness to other TCP connections. Meanwhile, it also offers resilience due to multipath data transfers. MPTCP attracts tremendous attention from the academic and industry field due to the explosive data growth in recent times and limited network bandwidth for each single available communication interface. The vehicular Internet-of-Things systems, such as cooperative autonomous driving, require reliable high speed data transmission and robustness. MPTCP could be a promising approach to solve these challenges. In this paper, we first conduct a brief survey of existing MPTCP studies and give a brief overview to multipath routing. Then we discuss the significance technical challenges in applying MPTCP for vehicular networks and point out future research directions.

Journal ArticleDOI
TL;DR: In this paper, the authors present a survey of literature to understand how augmented intelligence has been applied in the literature, including the roles of HI and AI, AI approaches, features, and applications.
Abstract: Augmented intelligence (AuI) integrates human intelligence (HI) and artificial intelligence (AI) to harness their strengths and mitigate their weaknesses. The combination of HI and AI has seen to improve both human and machine capabilities, and achieve a better performance compared to separate HI and AI approaches. In this paper, we present a survey of literature to understand how AuI has been applied in the literature, including the roles of HI and AI, AI approaches, features, and applications. Due to the limited literature related to this topic, we also present a survey of expert opinion to answer four main questions to understand the experts’ implications of AuI, including: a) the definition of AuI and the significance of HI in AuI; b) the roles of HI in AuI; c) the current and future applications of AuI in research, industry, and public, as well as the advantages and shortcomings of AuI; and d) end users’ view of the application of AuI. We also present recommendations to improve AuI, and provide a comparison between the findings from the surveys of both literature and expert opinion. The discussion of this paper shows the promising potential of AuI compared to separate HI and AI approaches.

Journal ArticleDOI
TL;DR: To improve the spectrum efficiency on the unlicensed bands and fit its distributed structure while ensuring the fairness among D2D-U links and the harmonious coexistence with WiFi networks, a distributed joint power and spectrum scheme is proposed.
Abstract: In this paper, a Device-to-Device communication on unlicensed bands (D2D-U) enabled network is studied. To improve the spectrum efficiency (SE) on the unlicensed bands and fit its distributed structure while ensuring the fairness among D2D-U links and the harmonious coexistence with WiFi networks, a distributed joint power and spectrum scheme is proposed. In particular, a parameter, named as price, is defined, which is updated at each D2D-U pair by a online trained Neural network (NN) according to the channel state and traffic load. In addition, the parameters used in the NN are updated by two ways, unsupervised self-iteration and federated learning, to guarantee the fairness and harmonious coexistence. Then, a non-convex optimization problem with respect to the spectrum and power is formulated and solved on each D2D-U link to maximize its own data rate. Numerical simulation results are demonstrated to verify the effectiveness of the proposed scheme.

Journal ArticleDOI
21 Jan 2021-Sensors
TL;DR: In this paper, the authors quantitatively investigate the impacts of node selfishness caused by energy depletion in MANETs in terms of packet loss rate, round-trip delay, and throughput.
Abstract: Cooperative communication and resource limitation are two main characteristics of mobile ad hoc networks (MANETs). On one hand, communication among the nodes in MANETs highly depends on the cooperation among nodes because of the limited transmission range of the nodes, and multi-hop communications are needed in most cases. On the other hand, every node in MANETs has stringent resource constraints on computations, communications, memory, and energy. These two characteristics lead to the existence of selfish nodes in MANETs, which affects the network performance in various aspects. In this paper, we quantitatively investigate the impacts of node selfishness caused by energy depletion in MANETs in terms of packet loss rate, round-trip delay, and throughput. We conducted extensive measurements on a proper simulation platform incorporating an OMNeT++ and INET Framework. Our experimental results quantitatively indicate the impact of node selfishness on the network performance in MANETs. The results also imply that it is important to evaluate the impact of node selfishness by jointly considering selfish nodes’ mobility models, densities, proportions, and combinations.

Journal ArticleDOI
TL;DR: In this paper, two types of selfish nodes, namely static selfish nodes and dynamic selfish nodes are defined and quantitatively investigated from various aspects including mobilities, proportions, densities, and combinations.
Abstract: Vehicular ad hoc network is a kind of mobile ad hoc networks which provides wireless communication between vehicles. In most cases, multi-hop communication is needed, because of the limited range of wireless transmission. The multi-hop communication among nodes strictly relies on the forwarding functionality of intermediate nodes. Due to resource limitation, the intermediate nodes may exhibit selfishness and refuse to bear forwarding tasks for others. In this article, we defined two types of selfish nodes, namely static selfish nodes and dynamic selfish nodes. The impact of the two types of selfish nodes are quantitatively investigated from various aspects including mobilities, proportions, densities, and combinations. We conducted exhaustive simulations on an integrated simulation platform which consists of OMNeT++, SUMO, INET, and Veins. The experimental results indicate that the static selfish nodes have more harmful impacts on the performance of vehicular ad hoc networks in terms of average packet delivery ratios and end-to-end delays. Moreover, the results also imply that the impact of node selfishness should be evaluated by a comprehensive consideration of mobilities, proportions, densities, and combinations of selfish nodes.

Journal ArticleDOI
01 Sep 2021
TL;DR: In this paper, a cyber-physical design for an indoor temperature monitoring system using a wireless sensor network is discussed, where all sensors dynamically adjust the sleep/wake duty cycles and select the optimal anycast routing according to the sensed temperature.
Abstract: Ambient temperature is closely related to human health. Maintaining a comfortable and stable temperature is crucial for the treatment, rehabilitation and daily e-health care of patients, where the Internet of Things has been widely applied for this purpose. However, the existing research always separates the network design from temperature monitoring and control. In order to provide better e-health services, we design a solution to minimize the cost of energy consumption and delay. This work discusses a cyber-physical design for an indoor temperature monitoring system using a wireless sensor network. All sensors dynamically adjust the sleep/wake duty cycles and select the optimal anycast routing according to the sensed temperature. In addition, IoT-enabled heating, ventilation, and air conditioner (HVAC) systems for indoor temperature control have attracted unprecedented attention. HVAC can be connected to the Internet for weather and time-varying electricity price data. This work discusses the problem of minimizing the total energy cost of the HVAC system while keeping the indoor temperature within a prefixed range. The key idea is to leverage the time-varying electricity prices and do precooling/preheating using HVAC. The simulation results show that our temperature monitoring and control algorithm outperforms other heuristic schemes.

Journal ArticleDOI
TL;DR: Simulation results demonstrate that compared with the conventional Kalman Filter based detection mechanism, the proposed approach has lower complexity and can achieve a more stable and accurate estimation of the number of WiFi users.
Abstract: The unlicensed spectrum has been utilized to make up the shortage on frequency spectrum in new radio (NR) systems To fully exploit the advantages brought by the unlicensed bands, one of the key issues is to guarantee the fair coexistence with WiFi systems To reach this goal, timely and accurate estimation on the WiFi traffic loads is an important prerequisite In this paper, a machine learning (ML) based method is proposed to detect the number of WiFi users on the unlicensed bands An unsupervised Neural Network (NN) structure is applied to filter the detected transmission collision probability on the unlicensed spectrum, which enables the NR users to precisely rectify the measurement error and estimate the number of active WiFi users Moreover, NN is trained online and the related parameters and learning rate of NN are jointly optimized to estimate the number of WiFi users adaptively with high accuracy Simulation results demonstrate that compared with the conventional Kalman Filter based detection mechanism, the proposed approach has lower complexity and can achieve a more stable and accurate estimation

Posted Content
TL;DR: In this paper, the authors studied the scenario where multiple robots cooperate to accomplish the time-critical tasks, where an intelligent master robot (MR) acts as an edge server to provide services to multiple slave robots (SRs) and the SRs are responsible for the environment sensing and data collection.
Abstract: Mobile edge computing (MEC) deployment in a multi-robot cooperation (MRC) system is an effective way to accomplish the tasks in terms of energy consumption and implementation latency. However, the computation and communication resources need to be considered jointly to fully exploit the advantages brought by the MEC technology. In this paper, the scenario where multi robots cooperate to accomplish the time-critical tasks is studied, where an intelligent master robot (MR) acts as an edge server to provide services to multiple slave robots (SRs) and the SRs are responsible for the environment sensing and data collection. To save energy and prolong the function time of the system, two schemes are proposed to optimize the computation and communication resources, respectively. In the first scheme, the energy consumption of SRs is minimized and balanced while guaranteeing that the tasks are accomplished under a time constraint. In the second scheme, not only the energy consumption, but also the remaining energies of the SRs are considered to enhance the robustness of the system. Through the analysis and numerical simulations, we demonstrate that even though the first policy may guarantee the minimization on the total SRs' energy consumption, the function time of MRC system by the second scheme is longer than that by the first one.

Journal ArticleDOI
TL;DR: This paper forms the network performance of D2D-U and Wi-Fi under two different coexistence schemes, namely, listen before talk (LBT) and duty cycle mechanism (DCM), and uses computer simulations to investigate a mode selection scheme that switches between these two schemes.
Abstract: By enabling direct communications between nearby user equipment (UE), device-to-device (D2D) communication has become one of the key technologies in 5th generation (5G) mobile networks. D2D communication brings new communication opportunities for mobile devices, especially in a highly dense network. In this paper, D2D communication in the unlicensed spectrum, namely, D2D-Unlicensed (D2D-U), is discussed. The use of unlicensed frequency bands can ease the shortage of spectrum resources and improve network performance. However, the D2D-U in 5G has significant effects on the network performance of existing unlicensed networks sharing the same frequency bands, such as Wi-Fi and Bluetooth. Therefore, it is necessary to design a fair coexistence scheme for D2D-U. To understand the coexistence problem, in this paper, we first formulate the network performance of D2D-U and Wi-Fi under two different coexistence schemes, namely, listen before talk (LBT) and duty cycle mechanism (DCM). Then, we use computer simulations to investigate a mode selection scheme that switches between these two schemes and point out the best possible solution for the coexistence between D2D-U and Wi-Fi.

Journal ArticleDOI
TL;DR: In this paper, a survey of deep reinforcement learning (DRL) models for various kinds of multi-agent domains is presented, including objectives, characteristics, challenges, applications, and performance measures.
Abstract: Recent advancements in deep reinforcement learning (DRL) have led to its application in multi-agent scenarios to solve complex real-world problems, such as network resource allocation and sharing, network routing, and traffic signal controls. Multi-agent DRL (MADRL) enables multiple agents to interact with each other and with their operating environment, and learn without the need for external critics (or teachers), thereby solving complex problems. Significant performance enhancements brought about by the use of MADRL have been reported in multi-agent domains; for instance, it has been shown to provide higher quality of service (QoS) in network resource allocation and sharing. This paper presents a survey of MADRL models that have been proposed for various kinds of multi-agent domains, in a taxonomic approach that highlights various aspects of MADRL models and applications, including objectives, characteristics, challenges, applications, and performance measures. Furthermore, we present open issues and future directions of MADRL.

Journal ArticleDOI
TL;DR: In this paper, a machine learning (ML) based method is proposed to detect the number of WiFi users on the unlicensed bands, where an unsupervised neural network (NN) structure is applied to filter the detected transmission collision probability.
Abstract: The unlicensed spectrum has been utilized to make up the shortage on frequency spectrum in new radio (NR) systems To fully exploit the advantages brought by the unlicensed bands, one of the key issues is to guarantee the fair coexistence with WiFi systems To reach this goal, timely and accurate estimation on the WiFi traffic loads is an important prerequisite In this paper, a machine learning (ML) based method is proposed to detect the number of WiFi users on the unlicensed bands An unsupervised Neural Network (NN) structure is applied to filter the detected transmission collision probability on the unlicensed spectrum, which enables the NR users to precisely rectify the measurement error and estimate the number of active WiFi users Moreover, NN is trained online and the related parameters and learning rate of NN are jointly optimized to estimate the number of WiFi users adaptively with high accuracy Simulation results demonstrate that compared with the conventional Kalman Filter based detection mechanism, the proposed approach has lower complexity and can achieve a more stable and accurate estimation

Journal ArticleDOI
TL;DR: The Internet of Vehicles enables various types of vehicular applications, such as autonomous driving, precise fleet management, and real-time video analytics, which contribute significantly to bring us traffic efficiency, driving safety, and ride comfort, but these powerful applications always require intensive computation and very large size caching services under ultra-low latency constraints, and thus pose significant challenges on resource-constrained vehicles.
Abstract: Empowered with advanced computation units, autonomous sensing platforms and various wireless access capabilities, connected and autonomous vehicles evolve over time and become tightly coupled and closely cooperative. Being one of the most active research fields in both academic and industry, the Internet of Vehicles (IoV) enables various types of vehicular applications, such as autonomous driving, precise fleet management, and real-time video analytics, which contribute significantly to bring us traffic efficiency, driving safety, and ride comfort. However, these powerful applications always require intensive computation and very large size caching services under ultra-low latency constraints, and thus pose significant challenges on resource-constrained vehicles.

Journal ArticleDOI
TL;DR: The results show an increase of end-to-end delay and a decrease of packet delivery ratio due to the transmission of control messages and data packets in the wireless medium in the presence of the dynamic PUs’ activities.
Abstract: This paper demonstrates a route selection mechanism on a testbed with heterogeneous device-to-device (D2D) wireless communication for a 5G network scenario. The source node receives information about the primary users’ (PUs’) (or licensed users’) activities and available routes from the macrocell base station (or a central controller) and makes a decision to select a multihop route to the destination node. The source node from small cells can either choose: (a) a route with direct communication with the macrocell base station to improve the route performance; or (b) a route with D2D communication among nodes in the small cells to offload traffic from the macrocell to improve spectrum efficiency. The selected D2D route has the least PUs’ activities. The route selection mechanism is investigated on our testbed that helps to improve the accuracy of network performance measurement. In traditional testbeds, each node (e.g., Universal Software Radio Peripheral (USRP) that serves as the front-end communication block) is connected to a single processing unit (e.g., a personal computer) via a switch using cables. In our testbed, each USRP node is connected to a separate processing unit, i.e., raspberry Pi3 B+ (or RP3), which offers three main advantages: (a) control messages and data packets are exchanged via the wireless medium; (b) separate processing units make decisions in a distributed and heterogeneous manner; and (c) the nodes are placed further apart from one another. Therefore, in the investigation of our route selection scheme, the response delay of control message exchange and the packet loss caused by the operating environment (e.g., ambient noise) are implied in our end-to-end delay and packet delivery ratio measurement. Our results show an increase of end-to-end delay and a decrease of packet delivery ratio due to the transmission of control messages and data packets in the wireless medium in the presence of the dynamic PUs’ activities. Furthermore, D2D communication can offload 25% to 75% traffic from macrocell base station to small cells.

Journal ArticleDOI
TL;DR: In this article, the authors proposed a low-complexity iterative channel estimation and tracking algorithm by fully exploiting the sparsity of mmWave massive MIMO channels, where the signal eigenvectors are estimated and tracked based on the received signals at the base station.
Abstract: Although the millimeter wave (mmWave) massive multiple-input and multiple-output (MIMO) system can potentially boost the network capacity for future communications, the pilot overhead of the system in practice will greatly increase, which causes a significant decrease in system performance. In this paper, we propose a novel grouping-based channel estimation and tracking approach to reduce the pilot overhead and computational complexity while improving the estimation accuracy. Specifically, we design a low-complexity iterative channel estimation and tracking algorithm by fully exploiting the sparsity of mmWave massive MIMO channels, where the signal eigenvectors are estimated and tracked based on the received signals at the base station (BS). With the recovered signal eigenvectors, the celebrated multiple-signal classification (MUSIC) algorithm can be employed to estimate the direction of arrival (DoA) angles and the path amplitude for the user terminals (UTs). To improve the estimation accuracy and accelerate the tracking speed, we develop a closed-form solution for updating the step-size in the proposed iterative algorithm. Furthermore, a grouping method is proposed to reduce the number of sharing pilots in the scenario of multiple UTs to shorten the pilot overhead. The computational complexity of the proposed algorithm is analyzed. Simulation results are provided to verify the effectiveness of the proposed schemes in terms of the estimation accuracy, tracking speed, and overhead reduction.

Posted ContentDOI
TL;DR: An adaptive communication energy optimization scheme based on road curvature radius to save the energy of AES for the electric vehicle platoon on curved roads and reduces the power consumption of AES as long as the platoon driving on curved Roads.
Abstract: the cruising range of an electric vehicle is limited by its battery. reducing the energy consumption of mes (main energy systems) or aes (auxiliary energy systems) of the vehicle battery is an effective means to increase the electric vehicle cruising range. platoon driving can greatly reduce the wind resistance of the vehicle and then reduce the energy consumption of mes for electric vehicles. this paper proposes an adaptive communication energy optimization scheme based on road curvature radius to save the energy of aes for the electric vehicle platoon on curved roads. in this paper, the inter-vehicle distance error based on the car-like model in a two-dimensional space is established. then, the inter-vehicle distance error is used to design a control law k to accomplish successful platooning. next, three platooning control schemes based on different information flow topologies are discussed. finally, the consensus of three platooning control schemes and the energy consumption of electric vehicle communication systems are analyzed by matlab’s simulink. simulation results show that the communication energy optimization scheme reduces the power consumption of aes as long as the platoon driving on curved roads.

Journal ArticleDOI
01 Jan 2021
TL;DR: In this paper, an efficient information and communication technology (ICT) framework is proposed to collect, process, and utilize data for the purpose of satisfying increasing user demand in communications and computing while providing a functionality of responding quickly to pandemic.
Abstract: In order to satisfy new normal life styles in the post-COVID-19 era, an efficient information and communication technology (ICT) framework is required to collect, process, and utilize data for the purpose of satisfying increasing user demand in communications and computing while providing a functionality of responding quickly to pandemic. We propose an ICT framework that is capable to support diverse application requirements based on ambient communication, resilient computing, and agile control technologies. Computer simulations are conducted to evaluate the fundamental functionalities of the proposed framework by using a case study where data are exchanged and processed among different network entities in unmanned aerial vehicle (UAV) empowered vehicular environments. The simulation results show that the proposed approach can outperform existing baselines in various conditions.

Posted Content
TL;DR: In this article, a distributed joint power and spectrum scheme is proposed to improve the spectrum efficiency on the unlicensed bands and fit its distributed structure while ensuring the fairness among D2D-U links and the harmonious coexistence with WiFi networks.
Abstract: In this paper, a Device-to-Device communication on unlicensed bands (D2D-U) enabled network is studied. To improve the spectrum efficiency (SE) on the unlicensed bands and fit its distributed structure while ensuring the fairness among D2D-U links and the harmonious coexistence with WiFi networks, a distributed joint power and spectrum scheme is proposed. In particular, a parameter, named as price, is defined, which is updated at each D2D-U pair by a online trained Neural network (NN) according to the channel state and traffic load. In addition, the parameters used in the NN are updated by two ways, unsupervised self-iteration and federated learning, to guarantee the fairness and harmonious coexistence. Then, a non-convex optimization problem with respect to the spectrum and power is formulated and solved on each D2D-U link to maximize its own data rate. Numerical simulation results are demonstrated to verify the effectiveness of the proposed scheme.

Posted Content
Abstract: Federated learning has attracted much research attention due to its privacy protection in distributed machine learning. However, existing work of federated learning mainly focuses on Convolutional Neural Network (CNN), which cannot efficiently handle graph data that are popular in many applications. Graph Convolutional Network (GCN) has been proposed as one of the most promising techniques for graph learning, but its federated setting has been seldom explored. In this paper, we propose FedGraph for federated graph learning among multiple computing clients, each of which holds a subgraph. FedGraph provides strong graph learning capability across clients by addressing two unique challenges. First, traditional GCN training needs feature data sharing among clients, leading to risk of privacy leakage. FedGraph solves this issue using a novel cross-client convolution operation. The second challenge is high GCN training overhead incurred by large graph size. We propose an intelligent graph sampling algorithm based on deep reinforcement learning, which can automatically converge to the optimal sampling policies that balance training speed and accuracy. We implement FedGraph based on PyTorch and deploy it on a testbed for performance evaluation. The experimental results of four popular datasets demonstrate that FedGraph significantly outperforms existing work by enabling faster convergence to higher accuracy.