Showing papers on "Noise measurement published in 2020"

PDF

Open Access

Proceedings Article•DOI•

Learning to Restore Low-Light Images via Decomposition-and-Enhancement

[...]

Ke Xu¹, Xin Yang², Baocai Yin², Rynson W. H. Lau¹•Institutions (2)

City University of Hong Kong¹, Dalian University of Technology²

14 Jun 2020

TL;DR: A frequency-based decompositionand- enhancement model that first learns to recover image objects in the low-frequency layer and then enhances high-frequency details based on the recovered image objects and outperforms state-of-the-art approaches in enhancing practical noisy low-light images.

...read moreread less

Abstract: Low-light images typically suffer from two problems. First, they have low visibility (i.e., small pixel values). Second, noise becomes significant and disrupts the image content, due to low signal-to-noise ratio. Most existing lowlight image enhancement methods, however, learn from noise-negligible datasets. They rely on users having good photographic skills in taking images with low noise. Unfortunately, this is not the case for majority of the low-light images. While concurrently enhancing a low-light image and removing its noise is ill-posed, we observe that noise exhibits different levels of contrast in different frequency layers, and it is much easier to detect noise in the lowfrequency layer than in the high one. Inspired by this observation, we propose a frequency-based decompositionand- enhancement model for low-light image enhancement. Based on this model, we present a novel network that first learns to recover image objects in the low-frequency layer and then enhances high-frequency details based on the recovered image objects. In addition, we have prepared a new low-light image dataset with real noise to facilitate learning. Finally, we have conducted extensive experiments to show that the proposed method outperforms state-of-the-art approaches in enhancing practical noisy low-light images.

...read moreread less

167 citations

Proceedings Article•DOI•

Noisier2Noise: Learning to Denoise From Unpaired Noisy Data

[...]

Nick Moran¹, Dan Schmidt¹, Yu Zhong¹, Patrick Coady¹•Institutions (1)

Analog Devices¹

14 Jun 2020

TL;DR: This work presents a method for training a neural network to perform image denoising without access to clean training examples or access to paired noisy training examples, which produces results which are competitive with other learned methods which require richer training data, and outperforms traditional non-learned Denoising methods.

...read moreread less

Abstract: We present a method for training a neural network to perform image denoising without access to clean training examples or access to paired noisy training examples. Our method requires only a single noisy realization of each training example and a statistical model of the noise distribution, and is applicable to a wide variety of noise models, including spatially structured noise. Our model produces results which are competitive with other learned methods which require richer training data, and outperforms traditional non-learned denoising methods. We present derivations of our method for arbitrary additive noise, an improvement specific to Gaussian additive noise, and an extension to multiplicative Bernoulli noise.

...read moreread less

136 citations

Proceedings Article•DOI•

Distilling Effective Supervision From Severe Label Noise

[...]

Zizhao Zhang¹, Han Zhang¹, Sercan O. Arik¹, Honglak Lee¹, Tomas Pfister¹ - Show less +1 more•Institutions (1)

Google¹

14 Jun 2020

TL;DR: This paper presents a holistic framework to train deep neural networks in a way that is highly invulnerable to label noise and achieves excellent performance on large-scale datasets with real-world label noise.

...read moreread less

Abstract: Collecting large-scale data with clean labels for supervised training of neural networks is practically challenging. Although noisy labels are usually cheap to acquire, existing methods suffer a lot from label noise. This paper targets at the challenge of robust training at high label noise regimes. The key insight to achieve this goal is to wisely leverage a small trusted set to estimate exemplar weights and pseudo labels for noisy data in order to reuse them for supervised training. We present a holistic framework to train deep neural networks in a way that is highly invulnerable to label noise. Our method sets the new state of the art on various types of label noise and achieves excellent performance on large-scale datasets with real-world label noise. For instance, on CIFAR100 with a 40% uniform noise ratio and only 10 trusted labeled data per class, our method achieves 80.2% classification accuracy, where the error rate is only 1.4% higher than a neural network trained without label noise. Moreover, increasing the noise ratio to 80%, our method still maintains a high accuracy of 75.5%, compared to the previous best accuracy 48.2%.

...read moreread less

135 citations

Proceedings Article•DOI•

A Physics-Based Noise Formation Model for Extreme Low-Light Raw Denoising

[...]

Kaixuan Wei¹, Ying Fu¹, Jiaolong Yang², Hua Huang¹•Institutions (2)

Beijing Institute of Technology¹, Microsoft²

14 Jun 2020

TL;DR: A highly accurate noise formation model based on the characteristics of CMOS photosensors is presented, thereby enabling us to synthesize realistic samples that better match the physics of image formation process.

...read moreread less

Abstract: Lacking rich and realistic data, learned single image denoising algorithms generalize poorly in real raw images that not resemble the data used for training. Although the problem can be alleviated by the heteroscedastic Gaussian noise model, the noise sources caused by digital camera electronics are still largely overlooked, despite their significant effect on raw measurement, especially under extremely low-light condition. To address this issue, we present a highly accurate noise formation model based on the characteristics of CMOS photosensors, thereby enabling us to synthesize realistic samples that better match the physics of image formation process. Given the proposed noise model, we additionally propose a method to calibrate the noise parameters for available modern digital cameras, which is simple and reproducible for any new device. We systematically study the generalizability of a neural network trained with existing schemes, by introducing a new low-light denoising dataset that covers many modern digital cameras from diverse brands. Extensive empirical results collectively show that by utilizing our proposed noise formation model, a network can reach the capability as if it had been trained with rich real data, which demonstrates the effectiveness of our noise formation model.

...read moreread less

129 citations

Journal Article•DOI•

Noisy-as-Clean: Learning Self-Supervised Denoising From Corrupted Image

[...]

Jun Xu¹, Yuan Huang², Ming-Ming Cheng¹, Li Liu, Fan Zhu, Zhou Xu³, Ling Shao - Show less +3 more•Institutions (3)

Nankai University¹, Xi'an Jiaotong University², Chongqing University³

30 Sep 2020-IEEE Transactions on Image Processing

TL;DR: A novel “Noisy-As-Clean” (NAC) strategy of training self-supervised denoising networks, where the corrupted test image is directly taken as the “clean” target, while the inputs are synthetic images consisted of this corrupted image and a second yet similar corruption.

...read moreread less

Abstract: Supervised deep networks have achieved promising performance on image denoising, by learning image priors and noise statistics on plenty pairs of noisy and clean images. Unsupervised denoising networks are trained with only noisy images. However, for an unseen corrupted image, both supervised and unsupervised networks ignore either its particular image prior, the noise statistics, or both. That is, the networks learned from external images inherently suffer from a domain gap problem: the image priors and noise statistics are very different between the training and test images. This problem becomes more clear when dealing with the signal dependent realistic noise. To circumvent this problem, in this work, we propose a novel “Noisy-As-Clean” (NAC) strategy of training self-supervised denoising networks. Specifically, the corrupted test image is directly taken as the “clean” target, while the inputs are synthetic images consisted of this corrupted image and a second yet similar corruption. A simple but useful observation on our NAC is: as long as the noise is weak, it is feasible to learn a self-supervised network only with the corrupted image, approximating the optimal parameters of a supervised network learned with pairs of noisy and clean images. Experiments on synthetic and realistic noise removal demonstrate that, the DnCNN and ResNet networks trained with our self-supervised NAC strategy achieve comparable or better performance than the original ones and previous supervised/unsupervised/self-supervised networks. The code is publicly available at https://github.com/csjunxu/Noisy-As-Clean .

...read moreread less

109 citations

Journal Article•DOI•

A Novel Trilateration Algorithm for RSSI-Based Indoor Localization

[...]

Bo Yang¹, Luyao Guo¹, Ruijie Guo¹, Miaomiao Zhao¹, Tiantian Zhao¹ - Show less +1 more•Institutions (1)

Shanxi University¹

15 Jul 2020-IEEE Sensors Journal

TL;DR: A novel trilateration algorithm for indoor localization based on received signal strength indication (RSSI) based on the extreme value theory, which constructs a nonlinear error function depending on distances and anchor nodes position is proposed.

...read moreread less

Abstract: This paper proposed a novel trilateration algorithm for indoor localization based on received signal strength indication (RSSI). Firstly, all the raw measurement data are preprocessed by a Gaussian filter to reducing the influence of measurement noise. Secondly, the transmit power and the path loss exponent are estimated by a novel least-squares curve fitting (LSCF) method in the RSSI-based localization. Thirdly, a novel trilateration algorithm is proposed based on the extreme value theory, which constructs a nonlinear error function depending on distances and anchor nodes position. To minimize the function, a Taylor series approximation can be used for reduce the computational complexity. And, an iteration condition is designed to further improve the positioning accuracy. Afterward, Bayesian filtering is used to smoothing the localization error, and decrease the influence of the process noise. Both the simulation and experimental results demonstrate the effectiveness of the proposed methodology.

...read moreread less

95 citations

Journal Article•DOI•

DeepMMSE: A Deep Learning Approach to MMSE-Based Noise Power Spectral Density Estimation

[...]

Qiquan Zhang¹, Aaron Nicolson², Mingjiang Wang¹, Kuldip K. Paliwal², Chenxu Wang¹ - Show less +1 more•Institutions (2)

Harbin Institute of Technology¹, Griffith University²

14 Apr 2020-IEEE Transactions on Audio, Speech, and Language Processing

TL;DR: The proposed noise PSD tracker, called DeepMMSE makes no assumptions about the characteristics of the noise or the speech, exhibits no tracking delay, and produces an accurate estimate that requires no bias correction, and when employed in a speech enhancement framework is able to outperform state-of-the-art noise PSd trackers, as well as multiple deep learning approaches to speech enhancement.

...read moreread less

Abstract: An accurate noise power spectral density (PSD) tracker is an indispensable component of a single-channel speech enhancement system. Bayesian-motivated minimum mean-square error (MMSE)-based noise PSD estimators have been the most prominent in recent time. However, they lack the ability to track highly non-stationary noise sources due to current methods of a priori signal-to-noise (SNR) estimation. This is caused by the underlying assumption that the noise signal changes at a slower rate than the speech signal. As a result, MMSE-based noise PSD trackers exhibit a large tracking delay and produce noise PSD estimates that require bias compensation. Motivated by this, we propose an MMSE-based noise PSD tracker that employs a temporal convolutional network (TCN) a priori SNR estimator. The proposed noise PSD tracker, called DeepMMSE makes no assumptions about the characteristics of the noise or the speech, exhibits no tracking delay, and produces an accurate estimate that requires no bias correction. Our extensive experimental investigation shows that the proposed DeepMMSE method outperforms state-of-the-art noise PSD trackers and demonstrates the ability to track abrupt changes in the noise level. Furthermore, when employed in a speech enhancement framework, the proposed DeepMMSE method is able to outperform state-of-the-art noise PSD trackers, as well as multiple deep learning approaches to speech enhancement. Availability: DeepMMSE is available at: https://github.com/anicolson/DeepXi .

...read moreread less

88 citations

Journal Article•DOI•

Fixed-Time Control With Uncertainty and Measurement Noise Suppression for Hypersonic Vehicles via Augmented Sliding Mode Observers

[...]

Jinlin Sun¹, Zhiqiang Pu¹, Jianqiang Yi¹, Zhen Liu¹•Institutions (1)

Chinese Academy of Sciences¹

01 Feb 2020-IEEE Transactions on Industrial Informatics

TL;DR: This article proposes a novel fixed-time convergent nonsmooth backstepping control scheme for FAHV via augmented sliding mode observers (ASMOs) to overcome these obstacles of uncertainty and measurement noise.

...read moreread less

Abstract: Uncertainty and measurement noise are main obstacles that limit the tracking control performances of flexible air-breathing hypersonic vehicles (FAHVs). In this article, we propose a novel fixed-time convergent nonsmooth backstepping control scheme for FAHV via augmented sliding mode observers (ASMOs) to overcome these obstacles. The ASMOs are first designed for the FAHV dynamics by employing the measured states corrupted by noises as inputs. On one hand, the ASMOs can simultaneously estimate the uncertainties and filter out the measurement noises. On the other hand, the observation error of each ASMO can be convergent within a fixed time independent of its initial observation error. Then, based on the estimation results, the altitude and velocity tracking controllers are developed by using fixed-time nonsmooth backstepping technique. Afterwards, a Lyapunov-based stability analysis is given to illustrate the fixed-time convergence of the closed-loop signals of the FAHV control system. Finally, comparative simulations are conducted to illustrate the superiority of the proposed control scheme.

...read moreread less

70 citations

Journal Article•DOI•

Adaptive LADRC-Based Disturbance Rejection Method for Electromechanical Servo System

[...]

Chunqiang Liu¹, Guangzhao Luo¹, Xiaoli Duan, Zhe Chen¹, Zeliang Zhang¹, Cai Qiu¹ - Show less +2 more•Institutions (1)

Northwestern Polytechnical University¹

01 Jan 2020-IEEE Transactions on Industry Applications

TL;DR: This article proposes an adaptive linear active disturbance rejection control (LADRC) controller to achieve strong antidisturbance performance and reduce noise sensitivity for EMAs, and proposes a novel parallel structure to improve dynamic responses.

...read moreread less

Abstract: Electromechanical actuator (EMA) exhibits advanced performance in industry, but its dynamic servo responses are constrained by parametric perturbations, load torque variations, and measurement noise. A strong disturbance rejection ability is necessary for EMAs. However, this usually makes them more sensitive to the measurement noise, reducing the steady-state precision. In this article, an adaptive linear active disturbance rejection control (LADRC) controller is proposed to achieve strong antidisturbance performance and reduce noise sensitivity for EMAs. A novel parallel structure is proposed to improve dynamic responses, which replaces the traditional cascade structure of position and speed loops. Aiming to improve the antidisturbance performance, a linear full-order-extended state observer is integrated with the parallel controller, called the LADRC controller. To reduce the difficulty of parameter tuning, the number of tuning parameters of LADRC is reduced to two by a pole placement design. And these two parameters of LADRC can be adjusted adaptively by the hyperbolic tangent function. Finally, the simulation and experimental results are provided to verify the effectiveness of the proposed strategy for EMAs.

...read moreread less

70 citations

Proceedings Article•DOI•

Global-Local GCN: Large-Scale Label Noise Cleansing for Face Recognition

[...]

Yaobin Zhang¹, Weihong Deng¹, Mei Wang¹, Jiani Hu¹, Xian Li, Dongyue Zhao, Dongchao Wen - Show less +3 more•Institutions (1)

Beijing University of Posts and Telecommunications¹

14 Jun 2020

TL;DR: An effective automatic label noise cleansing framework for face recognition datasets, FaceGraph, which performs global-to-local discrimination to select useful data in a noisy environment and surpasses state-of-the-art performance on the IJB-C benchmark.

...read moreread less

Abstract: In the field of face recognition, large-scale web-collected datasets are essential for learning discriminative representations, but they suffer from noisy identity labels, such as outliers and label flips. It is beneficial to automatically cleanse their label noise for improving recognition accuracy. Unfortunately, existing cleansing methods cannot accurately identify noise in the wild. To solve this problem, we propose an effective automatic label noise cleansing framework for face recognition datasets, FaceGraph. Using two cascaded graph convolutional networks, FaceGraph performs global-to-local discrimination to select useful data in a noisy environment. Extensive experiments show that cleansing widely used datasets, such as CASIA-WebFace, VGGFace2, MegaFace2, and MS-Celeb-1M, using the proposed method can improve the recognition performance of state-of-the-art representation learning methods like Arcface. Further, we cleanse massive self-collected celebrity data, namely MillionCelebs, to provide 18.8M images of 636K identities. Training with the new data, Arcface surpasses state-of-the-art performance by a notable margin to reach 95.62% TPR at 1e-5 FPR on the IJB-C benchmark.

...read moreread less

60 citations

Proceedings Article•DOI•

Training Noise-Robust Deep Neural Networks via Meta-Learning

[...]

Zhen Wang¹, Guosheng Hu, Qinghua Hu¹•Institutions (1)

Tianjin University¹

14 Jun 2020

TL;DR: This work proposes a new loss correction approach, named as Meta Loss Correction (MLC), to directly learn T from data via the meta-learning framework, which is model-agnostic and learns T fromData rather than heuristically approximates it using prior knowledge.

...read moreread less

Abstract: Label noise may significantly degrade the performance of Deep Neural Networks (DNNs). To train noise-robust DNNs, Loss correction (LC) approaches have been introduced. LC approaches assume the noisy labels are corrupted from clean (ground-truth) labels by an unknown noise transition matrix T. The backbone DNNs and T can be trained separately, where T is approximated with prior knowledge. For example, T is constructed by stacking the maximum or mean predic- tions of the samples from each class. In this work, we pro- pose a new loss correction approach, named as Meta Loss Correction (MLC), to directly learn T from data via the meta-learning framework. The MLC is model-agnostic and learns T from data rather than heuristically approximates it using prior knowledge. Extensive evaluations are conducted on computer vision (MNIST, CIFAR-10, CIFAR-100, Cloth- ing1M) and natural language processing (Twitter) datasets. The experimental results show that MLC achieves very com- petitive performance against state-of-the-art approaches.

...read moreread less

Journal Article•DOI•

Noise2Inverse: Self-Supervised Deep Convolutional Denoising for Tomography

[...]

Allard A. Hendriksen¹, Daniël M. Pelt¹, K. Joost Batenburg¹•Institutions (1)

Centrum Wiskunde & Informatica¹

01 Jan 2020-IEEE Transactions on Computational Imaging

TL;DR: Noise2Inverse is proposed, a deep CNN-based denoising method for linear image reconstruction algorithms that does not require any additional clean or noisy data and demonstrates an improvement in peak signal-to-noise ratio and structural similarity index compared to state-of-the-art image Denoising methods, and conventional reconstruction methods, such as Total-Variation Minimization.

...read moreread less

Abstract: Recovering a high-quality image from noisy indirect measurements is an important problem with many applications. For such inverse problems, supervised deep convolutional neural network (CNN)-based denoising methods have shown strong results, but the success of these supervised methods critically depends on the availability of a high-quality training dataset of similar measurements. For image denoising, methods are available that enable training without a separate training dataset by assuming that the noise in two different pixels is uncorrelated. However, this assumption does not hold for inverse problems, resulting in artifacts in the denoised images produced by existing methods. Here, we propose Noise2Inverse, a deep CNN-based denoising method for linear image reconstruction algorithms that does not require any additional clean or noisy data. Training a CNN-based denoiser is enabled by exploiting the noise model to compute multiple statistically independent reconstructions. We develop a theoretical framework which shows that such training indeed obtains a denoising CNN, assuming the measured noise is element-wise independent, and zero-mean. On simulated CT datasets, Noise2Inverse demonstrates an improvement in peak signal-to-noise ratio and structural similarity index compared to state-of-the-art image denoising methods, and conventional reconstruction methods, such as Total-Variation Minimization. We also demonstrate that the method is able to significantly reduce noise in challenging real-world experimental datasets.

...read moreread less

Journal Article•DOI•

Dynamic Event-Triggered Fault Detection via Zonotopic Residual Evaluation and Its Application to Vehicle Lateral Dynamics

[...]

Xudong Wang¹, Zhongyang Fei¹, Huaicheng Yan², Yinliang Xu³•Institutions (3)

Harbin Institute of Technology¹, East China University of Science and Technology², Tsinghua University³

06 Feb 2020-IEEE Transactions on Industrial Informatics

TL;DR: An optimal event-triggered fault detection observer (FDO) design criterion is proposed such that the generated residual is sensitive to system faults while robust against exogenous disturbance and measurement noise.

...read moreread less

Abstract: This article is concerned with the event-triggered fault detection problem for discrete-time systems subject to unknown-but-bounded (UBB) process disturbance and measurement noise via zonotope-based residual evaluation. To save communication resources, a novel discrete-time dynamic event-triggered mechanism is proposed. An optimal event-triggered $l_1 /H_{\infty }$ fault detection observer (FDO) design criterion is proposed such that the generated residual is sensitive to system faults while robust against exogenous disturbance and measurement noise. On the basis of the designed FDO, a zonotope-based dynamic threshold for residual evaluation is well constructed by considering the impacts of disturbance, noise, and event-triggered communication. Finally, a vehicle lateral dynamic system is adopted to illustrate the effectiveness of the proposed zonotope-based dynamic event-triggered fault detection mechanism.

...read moreread less

Journal Article•DOI•

A Model-Based Approach for Measurement Noise Estimation and Compensation in Feedback Control Systems

[...]

Yang Zhu¹, Bo Zhu², Hugh H. T. Liu¹, Kaiyu Qin²•Institutions (2)

University of Toronto¹, University of Electronic Science and Technology of China²

29 Apr 2020-IEEE Transactions on Instrumentation and Measurement

TL;DR: This article considers the problem of measurement noise rejection in a linear output-feedback control system and proposes a novel noise estimator (NE)-based robust control solution, which takes into account not only the rejection of high-frequency stochastic noises but also the compensation for low-frequency measurement errors, such as bias and drift.

...read moreread less

Abstract: This article considers the problem of measurement noise rejection in a linear output-feedback control system. Specifically, we take into account not only the rejection of high-frequency stochastic noises but also the compensation for low-frequency measurement errors, such as bias and drift, which cannot be well-handled by the classic frequency-domain filters or Kalman filters. A novel noise estimator (NE)-based robust control solution is proposed. The NE is designed in the frequency domain by exploiting the system model and control structure information and is embedded into the controller instead of being an independent functional module in the closed-loop system. The adverse effects of model uncertainties on the performance of the NE-based solution are investigated, and an improved solution is proposed by incorporating a simple low-pass filter as the prefilter of NE. This solution is applied to the angle tracking problem of a 2-DOF experimental helicopter platform equipped with a low-cost and low-accuracy microelectromechanical system (MEMS) inertial measurement unit (IMU) (MEMS IMU) for angular position/rate measurements. Both numerical simulation and experimental comparisons with other existing approaches demonstrate: 1) constant bias and time-varying drift in the IMU measurements are systematically addressed by the solution; 2) it is accessible to improve the steady-state tracking accuracy by tuning the parameter of NE to extend its bandwidth; and 3) when model uncertainties limit the feasible bandwidth of NE, the improved solution is able to largely maintain its noise rejection performance.

...read moreread less

Journal Article•DOI•

State Estimation for Stochastic Time-Varying Boolean Networks

[...]

Hongwei Chen¹, Zidong Wang², Jinling Liang³, Maozhen Li²•Institutions (3)

Donghua University¹, Brunel University London², Southeast University³

13 Feb 2020-IEEE Transactions on Automatic Control

TL;DR: A Boolean Bayesian filter is designed that can be utilized to provide the minimum MSE state estimate for the STVBNs and a recursive matrix-based algorithm is obtained to calculate the one-step prediction and estimation of the forward–backward state probability distribution vectors.

...read moreread less

Abstract: In this article, a general theoretical framework is developed for the state estimation problem of stochastic time-varying Boolean networks (STVBNs). The STVBN consists of a system model describing the evolution of the Boolean states and a model relating the noisy measurements to the Boolean states. Both the process noise and the measurement noise are characterized by sequences of mutually independent Bernoulli distributed stochastic variables taking values of 1 or 0, which imply that the state/measurement variables may be flipped with certain probabilities. First, an algebraic representation of the STVBNs is derived based on the semitensor product. Then, based on Bayes’ theorem, a recursive matrix-based algorithm is obtained to calculate the one-step prediction and estimation of the forward–backward state probability distribution vectors. Owing to the Boolean nature of the state variables, the Boolean Bayesian filter is designed that can be utilized to provide the minimum MSE state estimate for the STVBNs. The fixed-interval smoothing filter is also obtained by resorting to the forward–backward technique. Finally, a simulation experiment is carried out for the context estimation problem of the $p53$ - $MDM2$ negative-feedback gene regulatory network.

...read moreread less

Journal Article•DOI•

An Improved Variable Kernel Width for Maximum Correntropy Criterion Algorithm

[...]

Long Shi¹, Haiquan Zhao¹, Yuriy Zakharov²•Institutions (2)

Southwest Jiaotong University¹, University of York²

01 Jul 2020-IEEE Transactions on Circuits and Systems Ii-express Briefs

TL;DR: This brief proposes an improved variable kernel width MCC algorithm, which is derived by minimizing the squared deviation at each iteration, and designs a reset mechanism for the proposed algorithm to improve its tracking capability when the estimated vector encounters a sudden change.

...read moreread less

Abstract: The maximum correntropy criterion (MCC) algorithm has attracted much attention due to its capability of combating impulsive noise. However, its performance depends on choice of the kernel width, which is a hard issue. Several variable kernel width schemes based on various error functions have been proposed to address this problem. Nevertheless, these methods may not provide an optimal kernel width because they do not contain any knowledge of the background noise that actually has influence on the optimization of the kernel width. This brief proposes an improved variable kernel width MCC algorithm, which is derived by minimizing the squared deviation at each iteration. We also design a reset mechanism for the proposed algorithm to improve its tracking capability when the estimated vector encounters a sudden change. Simulations for system identification and echo cancellation scenarios show that the proposed scheme outperforms other variable kernel width algorithms.

...read moreread less

Journal Article•DOI•

A 22nm, 10.8 μ W/15.1 μ W Dual Computing Modes High Power-Performance-Area Efficiency Domained Background Noise Aware Keyword- Spotting Processor

[...]

Bo Liu¹, Hao Cai¹, Zhen Wang, Sun Yuhao¹, Shen Zeyu¹, Wentao Zhu¹, Li Yan¹, Yu Gong¹, Ge Wei¹, Jun Yang¹, Longxing Shi¹ - Show less +7 more•Institutions (1)

Southeast University¹

02 Jun 2020-IEEE Transactions on Circuits and Systems

TL;DR: This paper proposes a high power-performance-area efficient background noise aware keyword-spotting (KWS) processor based on an optimized binarized weight network (BWN) processor with adaptively configured to use dual computing modes for both high recognition accuracy under high background noise and ultra-low power consumption under low background noise.

...read moreread less

Abstract: This paper proposes a high power-performance-area efficient background noise aware keyword-spotting (KWS) processor based on an optimized binarized weight network (BWN). To reduce the power consumption while maintaining the system recognition accuracy for different background noise, the KWS processor with a SNR prediction module can be adaptively configured to use dual computing modes (standard computing mode and approximate computing mode) for both high recognition accuracy under high background noise and ultra-low power consumption under low background noise. The mel-scale frequency cepstral coefficients (MFCC) module is optimized with approximate computing technologies, which can reduce the power consumption by up to $3.1\times $ and $5.7\times $ for high/low background noise, respectively. Based on the evaluation of the architecture design space exploration, an ultra-low power BWN accelerator with low voltage, area and leakage power and using precision self-adaptive approximate computing units was proposed. Evaluated under 22nm process technology, this work can support up to 10 keywords real time recognition with power consumption of $15.1~\mu \text{W}$ for high background noise and $10.8~\mu \text{W}$ for low background noise. Compared to the state-of-the-art KWS architectures, our work can achieve ultra-low power consumption (about $1.7\times $ reduced), while maintaining high system capability and adaptability.

...read moreread less

Journal Article•DOI•

Audio-Visual Speech Separation and Dereverberation With a Two-Stage Multimodal Network

[...]

Ke Tan¹, Yong Xu², Shi-Xiong Zhang², Meng Yu², Dong Yu² - Show less +1 more•Institutions (2)

Ohio State University¹, Tencent²

16 Apr 2020-IEEE Journal of Selected Topics in Signal Processing

TL;DR: This study addresses joint speech separation and dereverberation, which aims to separate target speech from background noise, interfering speech and room reverberation, and proposes a novel multimodal network that exploits both audio and visual signals.

...read moreread less

Abstract: Background noise, interfering speech and room reverberation frequently distort target speech in real listening environments. In this study, we address joint speech separation and dereverberation, which aims to separate target speech from background noise, interfering speech and room reverberation. In order to tackle this fundamentally difficult problem, we propose a novel multimodal network that exploits both audio and visual signals. The proposed network architecture adopts a two-stage strategy, where a separation module is employed to attenuate background noise and interfering speech in the first stage and a dereverberation module to suppress room reverberation in the second stage. The two modules are first trained separately, and then integrated for joint training, which is based on a new multi-objective loss function. Our experimental results show that the proposed multimodal network yields consistently better objective intelligibility and perceptual quality than several one-stage and two-stage baselines. We find that our network achieves a 21.10% improvement in ESTOI and a 0.79 improvement in PESQ over the unprocessed mixtures. Moreover, our network architecture does not require the knowledge of the number of speakers.

...read moreread less

Journal Article•DOI•

Partial Discharge Signal Denoising Based on Singular Value Decomposition and Empirical Wavelet Transform

[...]

Jun Zhong¹, Xiaowen Bi¹, Qin Shu¹, Minwei Chen², Dianbo Zhou², Dakun Zhang - Show less +2 more•Institutions (2)

Sichuan University¹, Electric Power Research Institute²

22 May 2020-IEEE Transactions on Instrumentation and Measurement

TL;DR: This method uses traditionally singular value transform [singular value decomposition (SVD)] to reconstruct narrowband interference and remove it and the PD signal is obtained by time-domain denoising.

...read moreread less

Abstract: Online partial discharge (PD) monitoring is an important means to detect insulation deterioration. However, it is difficult to extract the PD signal due to various interferences in the field. Noisy PD signal is used to judge the status of insulation, which would affect the conclusion; therefore, denoising PD signal is a major task in online PD monitoring. Common methods for PD denoising include the empirical mode decomposition (EMD) and wavelet transform; however, the denoising results are highly dependent on the modal aliasing, the selection of mother wavelets, and decomposition levels. This article proposes a method to solve these problems. This method uses traditionally singular value transform [singular value decomposition (SVD)] to reconstruct narrowband interference and remove it. Next, the empirical wavelet transform (EWT) is carried out for the PD signal that has residual white noise. Then, the noisy signal is decomposed into several modes corresponding to each spectrum segment. The $3~\sigma $ principle is used to denoise the modes with large kurtosis, and the modes are combined into a reference signal. The start-end positions of PD signal are then obtained from the reference signal. Finally, the PD signal is obtained by time-domain denoising. The results from both simulated and actual field detection signals show the excellent performance of this method.

...read moreread less

Journal Article•DOI•

Analysis of White Noise on Power Frequency Estimation by DFT-Based Frequency Shifting and Filtering Algorithm

[...]

Junhao Zhang¹, Lu Tang¹, Alessandro Mingotti², Lorenzo Peretto², He Wen¹ - Show less +1 more•Institutions (2)

Hunan University¹, University of Bologna²

01 Jul 2020-IEEE Transactions on Instrumentation and Measurement

TL;DR: The influence of a stationary white noise on FSF-based frequency estimation of the power system is investigated and the variance expression of the frequency estimator is derived theoretically and compared to its unbiased Cramer–Rao lower bound (CRLB).

...read moreread less

Abstract: The frequency shifting and filtering (FSF) algorithm, a variant of DFT, has the merit of high efficiency for frequency analysis thanks to its simple implementation in the time domain. However, the inevitable white noise injected by various factors leads to inaccurate frequency estimation in practical measurement. This article investigates the influence of a stationary white noise on FSF-based frequency estimation of the power system. The variance expression of the frequency estimator is derived theoretically and compared to its unbiased Cramer–Rao lower bound (CRLB). The obtained results are validated by several computer simulations.

...read moreread less

Journal Article•DOI•

A Slide Window Variational Adaptive Kalman Filter

[...]

Yulong Huang¹, Fengchi Zhu¹, Guangle Jia¹, Yonggang Zhang¹•Institutions (1)

Harbin Engineering University¹

19 May 2020-IEEE Transactions on Circuits and Systems Ii-express Briefs

TL;DR: A slide window variational adaptive Kalman filter is presented in this brief based on adaptive learning of inaccurate state and measurement noise covariance matrices, which is composed of the forward Kalman filtering, the backward Kalman smoothing, and the online estimates of noise covariant matrices.

...read moreread less

Abstract: A slide window variational adaptive Kalman filter is presented in this brief based on adaptive learning of inaccurate state and measurement noise covariance matrices, which is composed of the forward Kalman filtering, the backward Kalman smoothing, and the online estimates of noise covariance matrices. By imposing an approximation on the smoothing posterior distribution of slide window state vectors, the posterior distributions of noise covariance matrices can be analytically updated as inverse Wishart distributions by exploiting the variational Bayesian method, which avoids the fixed-point iterations and achieves good computational efficiency. Simulation comparisons demonstrate that the proposed method has better filtering accuracy and consistency than the existing cutting-edge method.

...read moreread less

Journal Article•DOI•

New Suppression Technology for Low-Frequency Noise in Desert Region: The Improved Robust Principal Component Analysis Based on Prediction of Neural Network

[...]

Xintong Dong¹, Tie Zhong², Yue Li¹•Institutions (2)

Jilin University¹, Electric Power University²

28 Jan 2020-IEEE Transactions on Geoscience and Remote Sensing

TL;DR: To approach real desert seismic data, a variety of seismic wavelets are used to simulate different types of seismic events, and then these synthetic seismic events and real desert low-frequency noise to construct training set are used.

...read moreread less

Abstract: Lots of low-frequency noise including random noise and surface waves seriously reduces the quality of desert seismic data. However, the suppression for desert low-frequency noise faces three main problems: nonstationary and non-Gaussian of random noise; strong energy of low-frequency noise; a more serious frequency-band overlap between effective signals and low-frequency noise. Robust principal component analysis (RPCA) is a classical low-rank matrix (LM) recovery method which is very suitable for processing nonlinear noise. It can decompose noisy data to the optimal LM and sparse matrix (SM), which include most effective signals and noise, respectively. Therefore, the RPCA is introduced to suppress desert low-frequency noise. However, due to the low signal-to-noise ratio (SNR) and serious frequency-band overlap, much low-frequency noise still remains in the LM of desert seismic data after the decomposition of RPCA. Meanwhile, some nonnegligible effective signals are decomposed into the SM of desert seismic data. To solve this problem, the convolutional neural network (CNN) is introduced to extract effective signals from SM and LM. By constructing suitable training sets to guide the CNN’s training, the CNN denoising models after training are used to predict the effective signals from these two matrices, respectively. In this article, to approach real desert seismic data, we use a variety of seismic wavelets to simulate different types of seismic events, and then use these synthetic seismic events and real desert low-frequency noise to construct training set. In experiments, our method can raise the SNR of synthetic noisy data from −8.69 to 9.63 dB.

...read moreread less

Journal Article•DOI•

Audio-Visual Speech Enhancement Using Conditional Variational Auto-Encoders

[...]

Mostafa Sadeghi¹, Simon Leglaive¹, Xavier Alameda-Pineda¹, Laurent Girin¹, Radu Horaud¹ - Show less +1 more•Institutions (1)

French Institute for Research in Computer Science and Automation¹

30 May 2020-IEEE Transactions on Audio, Speech, and Language Processing

TL;DR: This article develops a conditional VAE (CVAE) where the audio speech generative process is conditioned on visual information of the lip region, and it improves the speech enhancement performance compared with the audio-only VAE model.

...read moreread less

Abstract: Variational auto-encoders (VAEs) are deep generative latent variable models that can be used for learning the distribution of complex data. VAEs have been successfully used to learn a probabilistic prior over speech signals, which is then used to perform speech enhancement. One advantage of this generative approach is that it does not require pairs of clean and noisy speech signals at training. In this article, we propose audio-visual variants of VAEs for single-channel and speaker-independent speech enhancement. We develop a conditional VAE (CVAE) where the audio speech generative process is conditioned on visual information of the lip region. At test time, the audio-visual speech generative model is combined with a noise model based on nonnegative matrix factorization, and speech enhancement relies on a Monte Carlo expectation-maximization algorithm. Experiments are conducted with the recently published NTCD-TIMIT dataset as well as the GRID corpus. The results confirm that the proposed audio-visual CVAE effectively fuses audio and visual information, and it improves the speech enhancement performance compared with the audio-only VAE model, especially when the speech signal is highly corrupted by noise. We also show that the proposed unsupervised audio-visual speech enhancement approach outperforms a state-of-the-art supervised deep learning method.

...read moreread less

Journal Article•DOI•

A Low-Power 1-V Supply Dynamic Comparator

[...]

Subhash Chevella¹, Daniel O'Hare¹, Ivan O'Connell¹•Institutions (1)

Tyndall National Institute¹

15 Jul 2020

TL;DR: In this article, a low-power dynamic comparator for ultralow power applications is presented, which uses cross-coupled devices to prevent the comparator internal nodes from fully discharging to ground in contrast to the conventional architecture.

...read moreread less

Abstract: This letter presents a low-power dynamic comparator for ultralow power applications. The prototype is designed in a 65-nm CMOS process with a supply voltage of 1 V and is compared against the widely used double tail latch comparator in terms of power consumption and input referred rms noise. The addition of cross-coupled devices to the input differential pair prevents the comparator internal nodes from fully discharging to ground in contrast to the conventional architecture. This reduces the power consumption while achieving similar noise levels. Measurements demonstrate that the proposed comparator achieves an input referred rms noise voltage of 220 $\mu \text{V}$ against 210 $\mu \text{V}$ for the conventional comparator with a 30% reduction in power. The proposed circuit consumes 0.19-pJ energy per comparison.

...read moreread less

Journal Article•DOI•

Combined Weighted Method for TDOA-Based Localization

[...]

Shuai Cao¹, Xiang Chen¹, Xu Zhang¹, Xun Chen¹•Institutions (1)

University of Science and Technology of China¹

01 May 2020-IEEE Transactions on Instrumentation and Measurement

TL;DR: The experimental results show that the proposed COM-W method is a closed-form and computationally efficient method that has an excellent positioning performance under both narrow and square geometric arrangements, specifically in the central cross-shaped area and in the vicinity of the nodes.

...read moreread less

Abstract: A novel and practical combined weighted (COM-W) method for target localization based on time difference of arrival (TDOA) is proposed in this paper. For 2-D situations with more than three nodes, the main idea of the proposed method is to obtain multiple preliminary localization results corresponding to the different three-node combinations first, and then use the weighted averaging technique based on the Cramer–Rao lower bound (CRLB) to obtain the final estimation. Considering target localization with a set of fixed-positioning nodes employing TDOA measurements as an example, the performance of the proposed method is evaluated by both simulations and acoustic indoor positioning experiments. The experimental results show that the COM-W method is a closed-form and computationally efficient method. It has an excellent positioning performance under both narrow and square geometric arrangements, specifically in the central cross-shaped area and in the vicinity of the nodes. Compared to the three classic weighted linear least squares (WLS)-based positioning methods, the proposed COM-W method is less sensitive to the measurement noise and the NLOS environment. The proposed method has an important application value for the realization of a TDOA-based indoor target positioning system.

...read moreread less

Journal Article•DOI•

Discrete-Time Implementation of Homogeneous Differentiators

[...]

Stefan Koch¹, Markus Reichhartinger¹, Martin Horn¹, Leonid Fridman²•Institutions (2)

Graz University of Technology¹, National Autonomous University of Mexico²

01 Feb 2020-IEEE Transactions on Automatic Control

TL;DR: The discrete-time version of Levant's arbitrary order robust exact differentiator is extended by taking into account also nonlinear higher order terms, which preserves the asymptotic accuracies with respect to sampling and noise known from the continuous-time algorithm.

...read moreread less

Abstract: The discrete-time version of Levant's arbitrary order robust exact differentiator, which is a forward Euler discretized version of the continuous-time algorithm enhanced by linear higher order terms, is extended by taking into account also nonlinear higher order terms. The resulting differentiator preserves the asymptotic accuracies with respect to sampling and noise known from the continuous-time algorithm. It is demonstrated in a simulation example and by differentiating a measured signal that the nonlinear higher order terms allow reducing the high-frequency switching amplitude whenever the $(n+1)$ th derivative of the signal to be differentiated vanishes, leading to an improvement in the precision.

...read moreread less

Journal Article•DOI•

A Parallel-Data-Free Speech Enhancement Method Using Multi-Objective Learning Cycle-Consistent Generative Adversarial Network

[...]

Yang Xiang¹, Changchun Bao¹•Institutions (1)

Beijing University of Technology¹

29 May 2020-IEEE Transactions on Audio, Speech, and Language Processing

TL;DR: A novel parallel-data-free speech enhancement method, in which the cycle-consistent generative adversarial network (CycleGAN) and multi-objective learning are employed, which is effective to improve speech quality and intelligibility when the networks are trained under the parallel data.

...read moreread less

Abstract: Recently, deep neural networks (DNNs) have become the mainstream strategy for speech enhancement task because it can achieve the higher speech quality and intelligibility than the traditional methods. However, these DNN-based methods always need a large number of parallel corpus consisting of clean speech and noise to produce noisy data for the training of the DNN in order to improve the generalization of the network. As a result, this implies that many noisy speech signals that are collected in real environment cannot be used to train the DNN because of the lack of corresponding clean speech and noise. Additionally, as we know, noise varies with the time and scenario, so we cannot obtain parallel speech and noise due to infinite noise data and some limited speech data. Thus, the network training with unparallel speech and noise data is essential for the generalization of the network. To address this problem, we propose a novel parallel-data-free speech enhancement method, in which the cycle-consistent generative adversarial network (CycleGAN) and multi-objective learning are employed. Our method is also able to make best use of the benefits of multi-objective learning. On the training stage, we utilize two different encoders to encode the features of clean speech and noisy speech, respectively. Then, two forward generators are immediately used to predict the ideal time-frequency (T-F) mask and log-power spectrum (LPS) of clean speech. Two inverse generators are applied to map the magnitude spectrum (MS) and LPS of noisy speech, respectively. In addition, four discriminators are used to distinguish the real speech features from the generated features. Two encoders, four generators and four discriminators are simultaneously trained by using adversarial, identity-mapping, latent similarity and cycle-consistent loss. On the test stage, we directly utilize the forward generators and encoders to acquire the enhanced speech. The experimental results indicate that the proposed approach is able to achieve the better speech enhancement performance than the reference methods. Moreover, the proposed method is also effective to improve speech quality and intelligibility when the networks are trained under the parallel data.

...read moreread less

Journal Article•DOI•

Min-Max Average Pooling Based Filter for Impulse Noise Removal

[...]

Piyush Satti¹, Nikhil Sharma¹, Bharat Garg¹•Institutions (1)

Thapar University¹

17 Aug 2020-IEEE Signal Processing Letters

TL;DR: Results show that the proposed filter significantly improves edges over exiting literature, and Peak Signal to Noise Ratio was improved by 1.2 dB in de-noising of medical images corrupted by medium to high noise densities.

...read moreread less

Abstract: Image corruption is a common phenomenon which occurs due to electromagnetic interference, and electric signal instabilities in a system. In this letter, a novel multi procedure Min-Max Average Pooling based Filter is proposed for removal of salt, and pepper noise that betide during transmission. The first procedure functions as a pre-processing step that activates for images with low noise corruption. In latter procedure, the noisy image is divided into two instances, and passed through multiple layers of max, and min pooling which allow restoration of intensity transitions in an image. The final procedure recombines the parallel processed images from the previous procedures, and performs average pooling to remove all residual noise. Experimental results were obtained using MATLAB software, and show that the proposed filter significantly improves edges over exiting literature. Moreover, Peak Signal to Noise Ratio was improved by 1.2 dB in de-noising of medical images corrupted by medium to high noise densities.

...read moreread less

Journal Article•DOI•

Extended-State-Observer-Based Distributed Robust Secondary Voltage and Frequency Control for an Autonomous Microgrid

[...]

Pudong Ge¹, Xiaobo Dou¹, Xiangjun Quan¹, Qinran Hu¹, Sheng Wanxing², Zaijun Wu¹, Wei Gu¹ - Show less +3 more•Institutions (2)

Southeast University¹, Electric Power Research Institute²

01 Jan 2020-IEEE Transactions on Sustainable Energy

TL;DR: An extended state observer is employed to estimate the accurate state information of each DG, which is significantly influenced by measurement noise, and the distributed controllers based on a fast terminal sliding mode surface and an adaptive super-twisting algorithm are designed to track the voltage reference and to fasten the convergence rate against disturbances and uncertainties caused by parameter perturbation.

...read moreread less

Abstract: This paper proposes an extended-state-observer-based distributed robust secondary voltage and frequency control for an autonomous microgrid (MG) with inverter-based distributed generators (DGs) considering the uncertainties from models and measurement noise. The MG is considered as a multi-agent system where each DG is defined as an agent and its controller only requires its own information and the information of its neighbors, but each DG obtains noisy measurements of the states of itself and its neighbors easily due to stochastic noise. Therefore, in this paper, an extended state observer is employed to estimate the accurate state information of each DG, which is significantly influenced by measurement noise. Furthermore, the distributed controllers based on a fast terminal sliding mode surface and an adaptive super-twisting algorithm are designed to track the voltage reference and to fasten the convergence rate against disturbances and uncertainties caused by parameter perturbation. Moreover, the distributed frequency controllers are also designed to restore the frequency and to guarantee the accurate active power sharing without power information of DGs. Finally, the effectiveness of the propose control strategy is illustrated by the simulation of an autonomous MG in MATLAB/Simulink.

...read moreread less

Journal Article•DOI•

Noisy Non-Adaptive Group Testing: A (Near-)Definite Defectives Approach

[...]

Jonathan Scarlett¹, Oliver Johnson²•Institutions (2)

National University of Singapore¹, University of Bristol²

29 Jan 2020-IEEE Transactions on Information Theory

TL;DR: In this paper, the authors study the noisy version of the group testing problem, where the outcome of each standard noiseless group test is subject to independent noise, corresponding to passing the result through a binary channel.

...read moreread less

Abstract: The group testing problem consists of determining a small set of defective items from a larger set of items based on a number of possibly-noisy tests, and is relevant in applications such as medical testing, communication protocols, pattern matching, and more. We study the noisy version of this problem, where the outcome of each standard noiseless group test is subject to independent noise, corresponding to passing the noiseless result through a binary channel. We introduce a class of algorithms that we refer to as Near-Definite Defectives (NDD), and study bounds on the required number of tests for asymptotically vanishing error probability under Bernoulli random test designs. In addition, we study algorithm-independent converse results, giving lower bounds on the required number of tests under Bernoulli test designs. Under reverse Z-channel noise, the achievable rates and converse results match in a broad range of sparsity regimes, and under Z-channel noise, the two match in a narrower range of dense/low-noise regimes. We observe that although these two channels have the same Shannon capacity when viewed as a communication channel, they can behave quite differently when it comes to group testing. Finally, we extend our analysis of these noise models to a general binary noise model (including symmetric noise), and show improvements over known existing bounds in broad scaling regimes.

...read moreread less

Collapse