scispace - formally typeset
Search or ask a question

Showing papers in "arXiv: Signal Processing in 2021"


Journal ArticleDOI
TL;DR: In this article, the authors provide a comprehensive survey to draw a picture of the 6G system in terms of drivers, use cases, usage scenarios, requirements, key performance indicators (KPIs), architecture, and enabling technologies.
Abstract: As of today, the fifth generation (5G) mobile communication system has been rolled out in many countries and the number of 5G subscribers already reaches a very large scale. It is time for academia and industry to shift their attention towards the next generation. At this crossroad, an overview of the current state of the art and a vision of future communications are definitely of interest. This article thus aims to provide a comprehensive survey to draw a picture of the sixth generation (6G) system in terms of drivers, use cases, usage scenarios, requirements, key performance indicators (KPIs), architecture, and enabling technologies. First, we attempt to answer the question of "Is there any need for 6G?" by shedding light on its key driving factors, in which we predict the explosive growth of mobile traffic until 2030, and envision potential use cases and usage scenarios. Second, the technical requirements of 6G are discussed and compared with those of 5G with respect to a set of KPIs in a quantitative manner. Third, the state-of-the-art 6G research efforts and activities from representative institutions and countries are summarized, and a tentative roadmap of definition, specification, standardization, and regulation is projected. Then, we identify a dozen of potential technologies and introduce their principles, advantages, challenges, and open research issues. Finally, the conclusions are drawn to paint a picture of "What 6G may look like?". This survey is intended to serve as an enlightening guideline to spur interests and further investigations for subsequent research and development of 6G communications systems.

475 citations


Journal ArticleDOI
TL;DR: In this paper, the authors explore the emerging opportunities brought by 6G technologies in IoT networks and applications, by conducting a holistic survey on the convergence of 6G and IoT, and highlight interesting research challenges and point out potential directions to spur further research in this promising area.
Abstract: The sixth generation (6G) wireless communication networks are envisioned to revolutionize customer services and applications via the Internet of Things (IoT) towards a future of fully intelligent and autonomous systems. In this article, we explore the emerging opportunities brought by 6G technologies in IoT networks and applications, by conducting a holistic survey on the convergence of 6G and IoT. We first shed light on some of the most fundamental 6G technologies that are expected to empower future IoT networks, including edge intelligence, reconfigurable intelligent surfaces, space-air-ground-underwater communications, Terahertz communications, massive ultra-reliable and low-latency communications, and blockchain. Particularly, compared to the other related survey papers, we provide an in-depth discussion of the roles of 6G in a wide range of prospective IoT applications via five key domains, namely Healthcare Internet of Things, Vehicular Internet of Things and Autonomous Driving, Unmanned Aerial Vehicles, Satellite Internet of Things, and Industrial Internet of Things. Finally, we highlight interesting research challenges and point out potential directions to spur further research in this promising area.

305 citations


Journal ArticleDOI
TL;DR: In this paper, a comprehensive survey of the emerging applications of federated learning in IoT networks is provided, which explores and analyzes the potential of FL for enabling a wide range of IoT services, including IoT data sharing, data offloading and caching, attack detection, localization, mobile crowdsensing and IoT privacy and security.
Abstract: The Internet of Things (IoT) is penetrating many facets of our daily life with the proliferation of intelligent services and applications empowered by artificial intelligence (AI). Traditionally, AI techniques require centralized data collection and processing that may not be feasible in realistic application scenarios due to the high scalability of modern IoT networks and growing data privacy concerns. Federated Learning (FL) has emerged as a distributed collaborative AI approach that can enable many intelligent IoT applications, by allowing for AI training at distributed IoT devices without the need for data sharing. In this article, we provide a comprehensive survey of the emerging applications of FL in IoT networks, beginning from an introduction to the recent advances in FL and IoT to a discussion of their integration. Particularly, we explore and analyze the potential of FL for enabling a wide range of IoT services, including IoT data sharing, data offloading and caching, attack detection, localization, mobile crowdsensing, and IoT privacy and security. We then provide an extensive survey of the use of FL in various key IoT applications such as smart healthcare, smart transportation, Unmanned Aerial Vehicles (UAVs), smart cities, and smart industry. The important lessons learned from this review of the FL-IoT services and applications are also highlighted. We complete this survey by highlighting the current challenges and possible directions for future research in this booming area.

205 citations


Journal ArticleDOI
TL;DR: In this article, the authors present an in-depth tutorial of the 3GPP Release 16 5G NR V2X standard, with a particular focus on the sidelink.
Abstract: The Third Generation Partnership Project (3GPP) has recently published its Release 16 that includes the first Vehicle to-Everything (V2X) standard based on the 5G New Radio (NR) air interface. 5G NR V2X introduces advanced functionalities on top of the 5G NR air interface to support connected and automated driving use cases with stringent requirements. This paper presents an in-depth tutorial of the 3GPP Release 16 5G NR V2X standard for V2X communications, with a particular focus on the sidelink, since it is the most significant part of 5G NR V2X. The main part of the paper is an in-depth treatment of the key aspects of 5G NR V2X: the physical layer, the resource allocation, the quality of service management, the enhancements introduced to the Uu interface and the mobility management for V2N (Vehicle to Network) communications, as well as the co-existence mechanisms between 5G NR V2X and LTE V2X. We also review the use cases, the system architecture, and describe the evaluation methodology and simulation assumptions for 5G NR V2X. Finally, we provide an outlook on possible 5G NR V2X enhancements, including those identified within Release 17.

193 citations


Posted Content
TL;DR: In this paper, the authors provide a comprehensive overview on the background, range of key applications and state-of-the-art approaches of Integrated Sensing and Communications (ISAC).
Abstract: As the standardization of 5G is being solidified, researchers are speculating what 6G will be. Integrating sensing functionality is emerging as a key feature of the 6G Radio Access Network (RAN), allowing to exploit the dense cell infrastructure of 5G for constructing a perceptive network. In this paper, we provide a comprehensive overview on the background, range of key applications and state-of-the-art approaches of Integrated Sensing and Communications (ISAC). We commence by discussing the interplay between sensing and communications (S&C) from a historical point of view, and then consider multiple facets of ISAC and its performance gains. By introducing both ongoing and potential use cases, we shed light on industrial progress and standardization activities related to ISAC. We analyze a number of performance tradeoffs between S&C, spanning from information theoretical limits, tradeoffs in physical layer performance, to the tradeoff in cross-layer designs. Next, we discuss signal processing aspects of ISAC, namely ISAC waveform design and receive signal processing. As a step further, we provide our vision on the deeper integration between S&C within the framework of perceptive networks, where the two functionalities are expected to mutually assist each other, i.e., communication-assisted sensing and sensing-assisted communications. Finally, we summarize the paper by identifying the potential integration between ISAC and other emerging communication technologies, and their positive impact on the future of wireless networks.

181 citations


Posted Content
TL;DR: In this paper, the authors provide a tutorial on the fundamental properties of the RIS technology from a signal processing perspective, to complement the recent surveys of electromagnetic and hardware aspects, and exemplify how they can be utilized for improved communication, localization and sensing.
Abstract: A reconfigurable intelligent surface (RIS) is a two-dimensional surface of engineered material whose properties are reconfigurable rather than static [4]. For example, the scattering, absorption, reflection, and diffraction properties can be changed with time and controlled by software. In principle, the surface can be used to synthesize an arbitrarily-shaped object of the same size, when it comes to how electromagnetic waves interact with it [5]. The long-term vision of the RIS technology is to create smart radio environments [9], where the wireless propagation conditions are co-engineered with the physical-layer signaling, and investigate how to utilize this new capability. The common protocol stack consists of seven layers and wireless technology is chiefly focused on the first three layers (physical, link, and network) [10]. An RIS operates at what can be referred to as Layer 0, where the traditional design issue is the antennas of the transmitter/receivers; one can think of RIS as extending the antenna design towards the environment, commonly seen as uncontrollable and decided by "nature". This approach can profoundly change the wireless design beyond 5G. This article provides a tutorial on the fundamental properties of the RIS technology from a signal processing perspective, to complement the recent surveys of electromagnetic and hardware aspects [4], [7], communication theory [11], and localization [8]. We will provide the formulas and derivations that are required to understand and analyze RIS-aided systems, and exemplify how they can be utilized for improved communication, localization, and sensing. We will also elaborate on the fundamentally new possibilities enabled by Layer 0 engineering and electromagnetic phenomena that remain to be modeled and utilized for improved signal processing.

98 citations


Posted Content
TL;DR: In this paper, a novel diagnostic procedure using fuzzy theory and deep learning techniques is introduced, which is evaluated on the Bonn University dataset with six classification combinations and also on the Freiburg dataset.
Abstract: Epilepsy is one of the most crucial neurological disorders, and its early diagnosis will help the clinicians to provide accurate treatment for the patients. The electroencephalogram (EEG) signals are widely used for epileptic seizures detection, which provides specialists with substantial information about the functioning of the brain. In this paper, a novel diagnostic procedure using fuzzy theory and deep learning techniques are introduced. The proposed method is evaluated on the Bonn University dataset with six classification combinations and also on the Freiburg dataset. The tunable-Q wavelet transform (TQWT) is employed to decompose the EEG signals into different sub-bands. In the feature extraction step, 13 different fuzzy entropies are calculated from different sub-bands of TQWT, and their computational complexities are calculated to help researchers choose the best feature sets. In the following, an autoencoder (AE) with six layers is employed for dimensionality reduction. Finally, the standard adaptive neuro-fuzzy inference system (ANFIS), and also its variants with grasshopper optimization algorithm (ANFIS-GOA), particle swarm optimization (ANFIS-PSO), and breeding swarm optimization (ANFIS-BS) methods are used for classification. Using our proposed method, ANFIS-BS method has obtained an accuracy of 99.74% in classifying into two classes and an accuracy of 99.46% in ternary classification on the Bonn dataset and 99.28% on the Freiburg dataset, reaching state-of-the-art performances on both of them.

52 citations


Posted Content
TL;DR: KalmanNet as discussed by the authors incorporates the structural Gaussian state space (SS) model with a dedicated recurrent neural network module in the flow of the Kalman filter to learn complex dynamics from data.
Abstract: Real-time state estimation of dynamical systems is a fundamental task in signal processing and control. For systems that are well-represented by a fully known linear Gaussian state space (SS) model, the celebrated Kalman filter (KF) is a low complexity optimal solution. However, both linearity of the underlying SS model and accurate knowledge of it are often not encountered in practice. Here, we present KalmanNet, a real-time state estimator that learns from data to carry out Kalman filtering under non-linear dynamics with partial information. By incorporating the structural SS model with a dedicated recurrent neural network module in the flow of the KF, we retain data efficiency and interpretability of the classic algorithm while implicitly learning complex dynamics from data. We numerically demonstrate that KalmanNet overcomes nonlinearities and model mismatch, outperforming classic filtering methods operating with both mismatched and accurate domain knowledge.

42 citations


Posted Content
TL;DR: Simulation results show that the proposed predictive method not only guarantees the required sensing performance, but also achieves a satisfactory sum-rate that can approach the upper bound obtained by the genie-aided scheme with the perfect instantaneous channel state information.
Abstract: This paper investigates the integrated sensing and communication (ISAC) in vehicle-to-infrastructure (V2I) networks. To realize ISAC, an effective beamforming design is essential which however, highly depends on the availability of accurate channel tracking requiring large training overhead and computational complexity. Motivated by this, we adopt a deep learning (DL) approach to implicitly learn the features of historical channels and directly predict the beamforming matrix to be adopted for the next time slot to maximize the average achievable sum-rate of an ISAC system. The proposed method can bypass the need of explicit channel tracking process and reduce the signaling overhead significantly. To this end, a general sum-rate maximization problem with Cramer-Rao lower bounds (CRLBs)-based sensing constraints is first formulated for the considered ISAC system. Then, by exploiting the penalty method, a versatile unsupervised DL-based predictive beamforming design framework is developed to address the formulated design problem. As a realization of the developed framework, a historical channels-based convolutional long short-term memory (LSTM) network (HCL-Net) is devised for predictive beamforming in the ISAC-based V2I network. Specifically, the convolution and LSTM modules are successively adopted in the proposed HCL-Net to exploit the spatial and temporal dependencies of communication channels to further improve the learning performance. Finally, simulation results show that the proposed predictive method not only guarantees the required sensing performance, but also achieves a satisfactory sum-rate that can approach the upper bound obtained by the genie-aided scheme with the perfect instantaneous channel state information.

40 citations


Posted Content
TL;DR: In this paper, a multi-skilled diffractive neural network based on a metasurface device is demonstrated for on-chip multi-channel sensing and multitasking at the speed of light in the visible.
Abstract: Replacing electrons with photons is a compelling route towards light-speed, highly parallel, and low-power artificial intelligence computing. Recently, all-optical diffractive neural deep neural networks have been demonstrated. However, the existing architectures often comprise bulky components and, most critically, they cannot mimic the human brain for multitasking. Here, we demonstrate a multi-skilled diffractive neural network based on a metasurface device, which can perform on-chip multi-channel sensing and multitasking at the speed of light in the visible. The metasurface is integrated with a complementary metal oxide semiconductor imaging sensor. Polarization multiplexing scheme of the subwavelength nanostructures are applied to construct a multi-channel classifier framework for simultaneous recognition of digital and fashionable items. The areal density of the artificial neurons can reach up to 6.25x106/mm2 multiplied by the number of channels. Our platform provides an integrated solution with all-optical on-chip sensing and computing for applications in machine vision, autonomous driving, and precision medicine.

40 citations


Posted Content
TL;DR: In this article, the impact of the radiation patterns of the antennas and unit cells of the RISs is formulated in terms of an angle-dependent loss factor, which gives more accurate estimates of the path loss of RISs comprised of unit cells with a deep sub-wavelength size.
Abstract: Reconfigurable intelligent surfaces (RISs) provide an interface between the electromagnetic world of the wireless propagation environment and the digital world of information science. Simple yet sufficiently accurate path loss models for RISs are an important basis for theoretical analysis and optimization of RIS-assisted wireless communication systems. In this paper, we refine our previously proposed free-space path loss model for RISs to make it simpler, more applicable, and easier to use. In the proposed path loss model, the impact of the radiation patterns of the antennas and unit cells of the RIS is formulated in terms of an angle-dependent loss factor. The refined model gives more accurate estimates of the path loss of RISs comprised of unit cells with a deep sub-wavelength size. The free-space path loss model of the sub-channel provided by a single unit cell is also explicitly provided. In addition, two fabricated RISs, which are designed to operate in the millimeter-wave (mmWave) band, are utilized to carry out a measurement campaign in order to characterize and validate the proposed path loss model for RIS-assisted wireless communications. The measurement results corroborate the proposed analytical model. The proposed refined path loss model for RISs reveals that the reflecting capability of a single unit cell is proportional to its physical aperture and to an angle-dependent factor. In particular, the far-field beamforming gain provided by an RIS is mainly determined by the total area of the surface and by the angles of incidence and reflection.

Posted Content
TL;DR: In this article, the authors provide a comprehensive and up-to-date survey on the communication technologies used in the smart grid, including the communication requirements, physical layer technologies, network architectures, and research challenges.
Abstract: With the ongoing trends in the energy sector such as vehicular electrification and renewable energy, smart grid is clearly playing a more and more important role in the electric power system industry. One essential feature of the smart grid is the information flow over the high-speed, reliable and secure data communication network in order to manage the complex power systems effectively and intelligently. Smart grids utilize bidirectional communication to function where traditional power grids mainly only use one-way communication. The communication requirements and suitable technique differ depending on the specific environment and scenario. In this paper, we provide a comprehensive and up-to-date survey on the communication technologies used in the smart grid, including the communication requirements, physical layer technologies, network architectures, and research challenges. This survey aims to help the readers identify the potential research problems in the continued research on the topic of smart grid communications.

Posted Content
TL;DR: A reconfigurable intelligent surface (RIS) is used to enhance the radar sensing and communication capabilities of a mmWave dual function radar communication system and a multi-stage hierarchical codebook is designed to localize the target while ensuring a strong communication link to the user.
Abstract: In this paper, we use a reconfigurable intelligent surface (RIS) to enhance the radar sensing and communication capabilities of a mmWave dual function radar communication system. To simultaneously localize the target and to serve the user, we propose to adaptively partition the RIS by reserving separate RIS elements for sensing and communication. We design a multi-stage hierarchical codebook to localize the target while ensuring a strong communication link to the user. We also present a method to choose the number of times to transmit the same beam in each stage to achieve a desired target localization probability of error. The proposed algorithm typically requires fewer transmissions than an exhaustive search scheme to achieve a desired target localization probability of error. Furthermore, the average spectral efficiency of the user with the proposed algorithm is found to be comparable to that of a RIS-assisted MIMO communication system without sensing capabilities and is much better than that of traditional MIMO systems without RIS.

Posted Content
TL;DR: In this article, a unified systematic framework for federated learning in a manner that encapsulates and highlights the main challenges that are natural to treat using signal processing tools is presented, and a set of candidate approaches for tackling its unique challenges are surveyed.
Abstract: The dramatic success of deep learning is largely due to the availability of data. Data samples are often acquired on edge devices, such as smart phones, vehicles and sensors, and in some cases cannot be shared due to privacy considerations. Federated learning is an emerging machine learning paradigm for training models across multiple edge devices holding local datasets, without explicitly exchanging the data. Learning in a federated manner differs from conventional centralized machine learning, and poses several core unique challenges and requirements, which are closely related to classical problems studied in the areas of signal processing and communications. Consequently, dedicated schemes derived from these areas are expected to play an important role in the success of federated learning and the transition of deep learning from the domain of centralized servers to mobile edge devices. In this article, we provide a unified systematic framework for federated learning in a manner that encapsulates and highlights the main challenges that are natural to treat using signal processing tools. We present a formulation for the federated learning paradigm from a signal processing perspective, and survey a set of candidate approaches for tackling its unique challenges. We further provide guidelines for the design and adaptation of signal processing and communication methods to facilitate federated learning at large scale.

Posted Content
TL;DR: In this paper, the authors considered the target detection problem in a RIS-aided multiple-input multiple-output (MIMO) radar system and proposed a general signal model, which includes the possibility of using up to two RISs and subsumes both a mono-static and a bi-static radar configuration with or without a line of sight (LOS) view of the prospective target.
Abstract: A reconfigurable intelligent surface (RIS) is a flat layer made of sub-wavelength-sized reflective elements capable of adding a tunable phase shift to the impinging electromagnetic wave. This paper considers the fundamental problem of target detection in a RIS-aided multiple-input multiple-output (MIMO) radar system. At first, a general signal model is introduced, which includes the possibility of using up to two RISs (one close to the transmitter and one close to the receiver) and subsumes both a mono-static and a bi-static radar configuration with or without a line-of-sight (LOS) view of the prospective target. Upon resorting to a generalized likelihood ratio test (GLRT), the design of the RIS phase shifts is formulated as the maximization of the probability of detection in the resolution cell under inspection for a fixed probability of false alarm, and suitable optimization algorithms are proposed and discussed. Both the theoretical and the numerical analysis clearly show the benefits, in terms of the signal-to-noise ratio (SNR) at the radar receiver, granted by the use of the RISs and shed light on the interplay among the key system parameters, such as the radar-RIS distance, the RIS size, and location of the prospective target. A major finding is that the RISs should be deployed in the near-field of the radar transmit/receive array. The paper is then concluded by discussing some open problems and foreseen applications.

Posted ContentDOI
TL;DR: In this article, the authors proposed a Graph Neural Network (GNN) based, scalable and real-time detector of FDIAs that combines model-driven and data-driven approaches by incorporating the inherent physical connections of modern AC power grids and exploiting the spatial correlations of the measurement data.
Abstract: False data injection attacks (FDIAs) represent a major class of attacks that aim to break the integrity of measurements by injecting false data into the smart metering devices in power grid. To the best of authors' knowledge, no study has attempted to design a detector that automatically models the underlying graph topology and spatially correlated measurement data of the smart grids to better detect cyber attacks. The contributions of this paper to detect and mitigate FDIAs are twofold. First, we present a generic, localized, and stealth (unobservable) attack generation methodology and a publicly accessible dataset for researchers to develop and test their algorithms. Second, we propose a Graph Neural Network (GNN) based, scalable and real-time detector of FDIAs that efficiently combines model-driven and data-driven approaches by incorporating the inherent physical connections of modern AC power grids and exploiting the spatial correlations of the measurement data. It is experimentally verified by comparing the proposed GNN based detector with the currently available FDIA detectors in literature that our algorithm outperforms the best available solutions by 6.21\%, 0.69\%, and 2.73\% in detection rate and by 3.65\%, 0.34\% and 1.38\% in F1 score for standard IEEE testbeds with 14, 118, and 300 buses, respectively.

Posted Content
TL;DR: This work introduces the concept of intelligent spectrum learning (ISL), which uses an appropriately trained convolutional neural network at the RIS controller to help the RISs infer the interfering signals directly from the incident signals, and proposes a distributed control algorithm to maximize the received SINR.
Abstract: Reconfigurable intelligent surface (RIS) has become a promising technology for enhancing the reliability of wireless communications, which is capable of reflecting the desired signals through appropriate phase shifts. However, the intended signals that impinge upon an RIS are often mixed with interfering signals, which are usually dynamic and unknown. In particular, the received signal-to-interference-plus-noise ratio (SINR) may be degraded by the signals reflected from the RISs that originate from non-intended users. To tackle this issue, we introduce the concept of intelligent spectrum learning (ISL), which uses an appropriately trained convolutional neural network (CNN) at the RIS controller to help the RISs infer the interfering signals directly from the incident signals. By capitalizing on the ISL, a distributed control algorithm is proposed to maximize the received SINR by dynamically configuring the active/inactive binary status of the RIS elements. Simulation results validate the performance improvement offered by deep learning and demonstrate the superiority of the proposed ISL-aided approach.

Proceedings ArticleDOI
TL;DR: In this article, the authors proposed a meta-learning based approach to one-shot RF-HAR, which reduces the labeling efforts for environment adaptation to the minimum level by using a dual-path base HAR network, where both time and frequency domains are dedicated to learning powerful RF features.
Abstract: Radio-Frequency (RF) based device-free Human Activity Recognition (HAR) rises as a promising solution for many applications. However, device-free (or contactless) sensing is often more sensitive to environment changes than device-based (or wearable) sensing. Also, RF datasets strictly require on-line labeling during collection, starkly different from image and text data collections where human interpretations can be leveraged to perform off-line labeling. Therefore, existing solutions to RF-HAR entail a laborious data collection process for adapting to new environments. To this end, we propose RF-Net as a meta-learning based approach to one-shot RF-HAR; it reduces the labeling efforts for environment adaptation to the minimum level. In particular, we first examine three representative RF sensing techniques and two major meta-learning approaches. The results motivate us to innovate in two designs: i) a dual-path base HAR network, where both time and frequency domains are dedicated to learning powerful RF features including spatial and attention-based temporal ones, and ii) a metric-based meta-learning framework to enhance the fast adaption capability of the base network, including an RF-specific metric module along with a residual classification module. We conduct extensive experiments based on all three RF sensing techniques in multiple real-world indoor environments; all results strongly demonstrate the efficacy of RF-Net compared with state-of-the-art baselines.

Journal ArticleDOI
TL;DR: In this paper, the authors proposed orthogonal time sequency multiplexing (OTSM), a single carrier modulation scheme that places information symbols in the delay-sequency domain.
Abstract: This paper proposes orthogonal time sequency multiplexing (OTSM), a novel single carrier modulation scheme that places information symbols in the delay-sequency domain followed by a cascade of time-division multiplexing (TDM) and Walsh-Hadamard sequence multiplexing. Thanks to the Walsh Hadamard transform (WHT), the modulation and demodulation do not require complex domain multiplications. For the proposed OTSM, we first derive the input-output relation in the delay-sequency domain and present a low complexity detection method taking advantage of zero-padding. We demonstrate via simulations that OTSM offers high performance gains over orthogonal frequency division multiplexing (OFDM) and similar performance to orthogonal time frequency space (OTFS), but at lower complexity owing to WHT. Then we propose a low complexity time-domain channel estimation method. Finally, we show how to include an outer error control code and a turbo decoder to improve error performance of the coded system.

Posted Content
TL;DR: In this paper, the authors proposed a two-phase channel estimation method, in which the cascaded channel of one typical user is estimated in Phase I based on the linear correlation among cascaded paths, while the estimated channels of other users are estimated by utilizing the partial CSI of the common base station (BS)-RIS channel obtained in Phase II.
Abstract: Channel estimation in the RIS-aided massive multiuser multiple-input single-output (MU-MISO) wireless communication systems is challenging due to the passive feature of RIS and the large number of reflecting elements that incur high channel estimation overhead. To address this issue, we propose a novel cascaded channel estimation strategy with low pilot overhead by exploiting the sparsity and the correlation of multiuser cascaded channels in millimeter-wave massive MISO systems. Based on the fact that the phsical positions of the BS, the RIS and users may not change in several or even tens of consecutive channel coherence blocks, we first estimate the full channel state information (CSI) including all the angle and gain information in the first coherence block, and then only re-estimate the channel gains in the remaining coherence blocks with much less pilot overhead. In the first coherence block, we propose a two-phase channel estimation method, in which the cascaded channel of one typical user is estimated in Phase I based on the linear correlation among cascaded paths, while the cascaded channels of other users are estimated in Phase II by utilizing the partial CSI of the common base station (BS)-RIS channel obtained in Phase I. The total theoretical minimum pilot overhead in the first coherence block is $8J-2+(K-1)\left\lceil (8J-2)/L\right\rceil $, where $K$, $L$ and $J$ denote the numbers of users, paths in the BS-RIS channel and paths in the RIS-user channel, respectively. In each of the remaining coherence blocks, the minimum pilot overhead is $JK$. Moreover, the training phase shift matrices at the RIS are optimized to improve the estimation performance.

Proceedings ArticleDOI
Chenglin Li1, Di Niu1, Bei Jiang1, Xiao Zuo2, Jianming Yang2 
TL;DR: Meta-HAR as mentioned in this paper is a federated representation learning framework, in which a signal embedding network is meta-learned and fed into a personalized classification network at each user for activity prediction.
Abstract: Human activity recognition (HAR) based on mobile sensors plays an important role in ubiquitous computing. However, the rise of data regulatory constraints precludes collecting private and labeled signal data from personal devices at scale. Federated learning has emerged as a decentralized alternative solution to model training, which iteratively aggregates locally updated models into a shared global model, therefore being able to leverage decentralized, private data without central collection. However, the effectiveness of federated learning for HAR is affected by the fact that each user has different activity types and even a different signal distribution for the same activity type. Furthermore, it is uncertain if a single global model trained can generalize well to individual users or new users with heterogeneous data. In this paper, we propose Meta-HAR, a federated representation learning framework, in which a signal embedding network is meta-learned in a federated manner, while the learned signal representations are further fed into a personalized classification network at each user for activity prediction. In order to boost the representation ability of the embedding network, we treat the HAR problem at each user as a different task and train the shared embedding network through a Model-Agnostic Meta-learning framework, such that the embedding network can generalize to any individual user. Personalization is further achieved on top of the robustly learned representations in an adaptation procedure. We conducted extensive experiments based on two publicly available HAR datasets as well as a newly created HAR dataset. Results verify that Meta-HAR is effective at maintaining high test accuracies for individual users, including new users, and significantly outperforms several baselines, including Federated Averaging, Reptile and even centralized learning in certain cases.

Posted Content
TL;DR: In this paper, the authors report a prototype of an RIS that offers the capability of controlling the phase shift of the reflected waves in a continuous manner, and characterize its characteristics by using full-wave simulations and through experimental measurements.
Abstract: With the development of the next generation of mobile networks, new research challenges have emerged, and new technologies have been proposed to face them. On the one hand, the reconfigurable intelligent surface (RIS) technology is being investigated for partially controlling the wireless channels. The RIS is a promising technology for improving the signal quality by controlling the scattering of the electromagnetic waves in a nearly passive manner. On the other hand, ambient backscatter communications (AmBC) is another promising technology that is tailored for addressing the energy efficiency requirements for the Internet of Things (IoT). This technique enables low-power communications by backscattering ambient signals and, thus, reusing existing electromagnetic waves for communications. RIS technology can be utilized in the context of AmBC for improving the system performance. In this paper, we report a prototype of an RIS that offers the capability of controlling the phase shift of the reflected waves in a continuous manner, and we characterize its characteristics by using full-wave simulations and through experimental measurements. Specifically, we introduce a phase shift model for predicting the signal reflected by the RIS prototype. We apply the proposed model for optimizing an RISassisted AmBC system and we demonstrate that the use of an RIS can significantly improve the system performance.

Journal ArticleDOI
TL;DR: In this paper, the authors discuss several key aspects of multi-modal emotion recognition (MER) and summarize existing emotion annotation strategies and corresponding computational tasks, followed by the description of main challenges in MER.
Abstract: Humans are emotional creatures. Multiple modalities are often involved when we express emotions, whether we do so explicitly (e.g., facial expression, speech) or implicitly (e.g., text, image). Enabling machines to have emotional intelligence, i.e., recognizing, interpreting, processing, and simulating emotions, is becoming increasingly important. In this tutorial, we discuss several key aspects of multi-modal emotion recognition (MER). We begin with a brief introduction on widely used emotion representation models and affective modalities. We then summarize existing emotion annotation strategies and corresponding computational tasks, followed by the description of main challenges in MER. Furthermore, we present some representative approaches on representation learning of each affective modality, feature fusion of different affective modalities, classifier optimization for MER, and domain adaptation for MER. Finally, we outline several real-world applications and discuss some future directions.

Journal ArticleDOI
TL;DR: In this paper, an SCMA codebook design approach is proposed based on uniquely decomposable constellation group (UDCG), which helps improve spectrum efficiency (SE) and enhance connectivity, has been proposed as a NOMA scheme for 5G systems.
Abstract: Sparse code multiple access (SCMA), which helps improve spectrum efficiency (SE) and enhance connectivity, has been proposed as a non-orthogonal multiple access (NOMA) scheme for 5G systems. In SCMA, codebook design determines system overload ratio and detection performance at a receiver. In this paper, an SCMA codebook design approach is proposed based on uniquely decomposable constellation group (UDCG). We show that there are $N+1 (N \geq 1)$ constellations in the proposed UDCG, each of which has $M (M \geq 2)$ constellation points. These constellations are allocated to users sharing the same resource. Combining the constellations allocated on multiple resources of each user, we can obtain UDCG-based codebook sets. Bit error ratio (BER) performance will be discussed in terms of coding gain maximization with superimposed constellations and UDCG-based codebooks. Simulation results demonstrate that the superimposed constellation of each resource has large minimum Euclidean distance (MED) and meets uniquely decodable constraint. Thus, BER performance of the proposed codebook design approach outperforms that of the existing codebook design schemes in both uncoded and coded SCMA systems, especially for large-size codebooks.

Posted ContentDOI
Qin Wang1, Cees Taal, Olga Fink1
TL;DR: Wang et al. as discussed by the authors integrated expert knowledge with domain adaptation in a synthetic-to-real framework for unsupervised fault diagnosis by augmenting real vibration samples of healthy bearings with synthetic data.
Abstract: Data-driven fault diagnosis methods often require abundant labeled examples for each fault type. On the contrary, real-world data is often unlabeled and consists of mostly healthy observations and only few samples of faulty conditions. The lack of labels and fault samples imposes a significant challenge for existing data-driven fault diagnosis methods. In this paper, we aim to overcome this limitation by integrating expert knowledge with domain adaptation in a synthetic-to-real framework for unsupervised fault diagnosis. Motivated by the fact that domain experts often have a relatively good understanding on how different fault types affect healthy signals, in the first step of the proposed framework, a synthetic fault dataset is generated by augmenting real vibration samples of healthy bearings. This synthetic dataset integrates expert knowledge and encodes class information about the faults types. However, models trained solely based on the synthetic data often do not perform well because of the distinct distribution difference between the synthetically generated and real faults. To overcome this domain gap between the synthetic and real data, in the second step of the proposed framework, an imbalance-robust domain adaptation~(DA) approach is proposed to adapt the model from synthetic faults~(source) to the unlabeled real faults~(target) which suffer from severe class imbalance. The framework is evaluated on two unsupervised fault diagnosis cases for bearings, the CWRU laboratory dataset and a real-world wind-turbine dataset. Experimental results demonstrate that the generated faults are effective for encoding fault type information and the domain adaptation is robust against the different levels of class imbalance between faults.

Posted Content
TL;DR: In this paper, a scalable and robust RFFI framework achieved by deep learning powered radio frequency fingerprint (RFF) extractor is proposed, which leverages the deep metric learning to train an RFF extractor which has excellent generalization ability and can extract RFFs from previously unseen devices.
Abstract: Radio frequency fingerprint identification (RFFI) is a promising device authentication technique based on the transmitter hardware impairments. In this paper, we propose a scalable and robust RFFI framework achieved by deep learning powered radio frequency fingerprint (RFF) extractor. Specifically, we leverage the deep metric learning to train an RFF extractor, which has excellent generalization ability and can extract RFFs from previously unseen devices. Any devices can be enrolled via the pre-trained RFF extractor and the RFF database can be maintained efficiently for allowing devices to join and leave. Wireless channel impacts the RFF extraction and is tackled by exploiting channel independent feature and data augmentation. We carried out extensive experimental evaluation involving 60 commercial off-the-shelf LoRa devices and a USRP N210 software defined radio platform. The results have successfully demonstrated that our framework can achieve excellent generalization abilities for device classification and rogue device detection as well as effective channel mitigation.

Journal ArticleDOI
TL;DR: Numerical results verify the theoretical analysis that the new PARN has high accuracy in AN clock offset estimation and simultaneous localization and synchronization for a moving UD, and demonstrate the feasibility and superiority of the PARN in real-world applications by experiments.
Abstract: In this article, we design a new time-of-arrival (TOA) system for simultaneous user device (UD) localization and synchronization with a periodic asymmetric ranging network, namely PARN. The PARN includes one primary anchor node (PAN) transmitting and receiving signals, and many secondary ANs (SAN) only receiving signals. All the UDs can transmit and receive signals. The PAN periodically transmits sync signal and the UD transmits response signal after reception of the sync signal. Using TOA measurements from the periodic sync signal at SANs, we develop a Kalman filtering method to virtually synchronize ANs with high accuracy estimation of clock parameters. Employing the virtual synchronization, and TOA measurements from the response signal and sync signal, we then develop a maximum likelihood (ML) approach, namely ML-LAS, to simultaneously localize and synchronize a moving UD. We analyze the UD localization and synchronization error, and derive the Cramer-Rao lower bound (CRLB). Different from existing asymmetric ranging network-based TOA systems, the new PARN i) uses the periodic sync signals at the SAN to exploit the temporal correlated clock information for high accuracy virtual synchronization, and ii) compensates the UD movement and clock drift using various TOA measurements to achieve consistent and simultaneous localization and synchronization performance. Numerical results verify the theoretical analysis that the new system has high accuracy in AN clock offset estimation and simultaneous localization and synchronization for a moving UD. We implement a prototype hardware system and demonstrate the feasibility and superiority of the PARN in real-world applications by experiments.

Posted Content
TL;DR: Terahertz (THz) communications are celebrated as key enablers for converged localization and sensing in future 6G wireless communication systems and beyond as discussed by the authors, and localization in 6G is indispensable for location-aware communications.
Abstract: Terahertz (THz) communications are celebrated as key enablers for converged localization and sensing in future sixth-generation (6G) wireless communication systems and beyond. Instead of being a byproduct of the communication system, localization in 6G is indispensable for location-aware communications. Towards this end, we aim to identify the prospects, challenges, and requirements of THz localization techniques. We first review the history and trends of localization methods and discuss their objectives, constraints, and applications in contemporary communication systems. We then detail the latest advances in THz communications and introduce the THz-specific channel and system models. Afterward, we formulate THz-band localization as a 3D position/orientation estimation problem, detailing geometry-based localization techniques and describing potential THz localization and sensing extensions. We further formulate the offline design and online optimization of THz localization systems, provide numerical simulation results, and conclude by providing insight into interdisciplinary future research directions. Preliminary results illustrate that under the same total transmission power and time, THz-based localization is ~5 (~20) times more accurate than mmWave-based localization without (with) prior position information.

Posted Content
TL;DR: In this article, a comprehensive assessment of self-supervised representation learning from short segments of clinical 12-lead electrocardiography (ECG) data is presented, which explores adaptations of state-of-the-art selfsupervised learning algorithms from computer vision (SimCLR, BYOL, SwAV) and speech (CPC).
Abstract: We put forward a comprehensive assessment of self-supervised representation learning from short segments of clinical 12-lead electrocardiography (ECG) data. To this end, we explore adaptations of state-of-the-art self-supervised learning algorithms from computer vision (SimCLR, BYOL, SwAV) and speech (CPC). In a first step, we learn contrastive representations and evaluate their quality based on linear evaluation performance on a downstream classification task. For the best-performing method, CPC, we find linear evaluation performances only 0.8% below supervised performance. In a second step, we analyze the impact of self-supervised pretraining on finetuned ECG classifiers as compared to purely supervised performance and find improvements in downstream performance of more than 1%, label efficiency, as well as an increased robustness against physiological noise. All experiments are carried out exclusively on publicly available datasets, the to-date largest collection used for self-supervised representation learning from ECG data, to foster reproducible research in the field of ECG representation learning.

Posted Content
TL;DR: In this paper, a joint synthesis of constant envelope transmit signal and receive filter aimed at optimizing radar performance in signal-dependent interference and spectrally contested-congested environments is proposed.
Abstract: This paper focuses on the joint synthesis of constant envelope transmit signal and receive filter aimed at optimizing radar performance in signal-dependent interference and spectrally contested-congested environments. To ensure the desired Quality of Service (QoS) at each communication system, a precise control of the interference energy injected by the radar in each licensed/shared bandwidth is imposed. Besides, along with an upper bound to the maximum transmitted energy, constant envelope (with either arbitrary or discrete phases) and similarity constraints are forced to ensure compatibility with amplifiers operating in saturation regime and bestow relevant waveform features, respectively. To handle the resulting NP-hard design problems, new iterative procedures (with ensured convergence properties) are devised to account for continuous and discrete phase constraints, capitalizing on the Coordinate Descent (CD) framework. Two heuristic procedures are also proposed to perform valuable initializations. Numerical results are provided to assess the effectiveness of the conceived algorithms in comparison with the existing methods.