scispace - formally typeset
Open AccessJournal ArticleDOI

Dominant Data Set Selection Algorithms for Electricity Consumption Time-Series Data Analysis Based on Affine Transformation

TLDR
In this paper, the authors proposed a cohort of dominant data set selection algorithms for electricity consumption TSD with a focus on discriminating the dominant data sets that is a small data set but capable of representing the kernel information carried by TSD.
Abstract
In the explosive growth of time-series data (TSD), the scale of TSD suggests that the scale and capability of many Internet of Things (IoT)-based applications has already been exceeded. Moreover, redundancy persists in TSD due to the correlation between information acquired via different sources. In this article, we propose a cohort of dominant data set selection algorithms for electricity consumption TSD with a focus on discriminating the dominant data set that is a small data set but capable of representing the kernel information carried by TSD with an arbitrarily small error rate less than $\varepsilon $ . Furthermore, we prove that the selection problem of the minimum dominant data set is an NP-complete problem. The affine transformation model is introduced to define the linear correlation relationship between TSD objects. Our proposed framework consists of the scanning selection algorithm with $O({n^{3}})$ time complexity and the greedy selection algorithm with $O({n^{4}})$ time complexity, which are, respectively, proposed to select the dominant data set based on the linear correlation distance between TSD objects. The proposed algorithms are evaluated on the real electricity consumption data of Harbin city in China. The experimental results show that the proposed algorithms not only reduce the size of the extracted kernel data set but also ensure the TSD integrity in terms of accuracy and efficiency.

read more

Citations
More filters
Journal ArticleDOI

A Secure Federated Learning Framework for 5G Networks

TL;DR: A blockchain-based secure FL framework to create smart contracts and prevent malicious or unreliable participants from being involved in FL is proposed, which can effectively deter poisoning and membership inference attacks, thereby improving the security of FL in 5G networks.
Journal ArticleDOI

Deep Anomaly Detection for Time-series Data in Industrial IoT: A Communication-Efficient On-device Federated Learning Approach

TL;DR: A new communication-efficient on-device federated learning (FL)-based deep anomaly detection framework for sensing time-series data in IIoT and an attention mechanism-based convolutional neural network-long short-term memory (AMCNN-LSTM) model to accurately detect anomalies is proposed.
Journal ArticleDOI

Federated Learning in the Sky: Aerial-Ground Air Quality Sensing Framework with UAV Swarms

TL;DR: A new federated learning (FL)-based aerial-ground air quality sensing framework for fine-grained 3-D air quality monitoring and forecasting that leverages a lightweight Dense-MobileNet model to achieve energy-efficient end-to-end learning from haze features of haze images taken by unmanned aerial vehicles (UAVs) for predicting AQI scale distribution.
Journal ArticleDOI

URLLC for 5G and Beyond: Requirements, Enabling Incumbent Technologies and Network Intelligence

TL;DR: In this article, the authors present a possibility to use the federated reinforcement learning (FRL) technique, which is one of the ML techniques, for 5G NR URLLC requirements and summarizes the corresponding achievements.
References
More filters
Book

Computers and Intractability: A Guide to the Theory of NP-Completeness

TL;DR: The second edition of a quarterly column as discussed by the authors provides a continuing update to the list of problems (NP-complete and harder) presented by M. R. Garey and myself in our book "Computers and Intractability: A Guide to the Theory of NP-Completeness,” W. H. Freeman & Co., San Francisco, 1979.
Journal ArticleDOI

Extended Bayesian information criteria for model selection with large model spaces

TL;DR: This paper re-examine the Bayesian paradigm for model selection and proposes an extended family of Bayesian information criteria, which take into account both the number of unknown parameters and the complexity of the model space.
Journal Article

A Selective Overview of Variable Selection in High Dimensional Feature Space.

TL;DR: In this paper, a brief account of the recent developments of theory, methods, and implementations for high-dimensional variable selection is presented, with emphasis on independence screening and two-scale methods.
Book ChapterDOI

StatStream: statistical monitoring of thousands of data streams in real time

TL;DR: In this article, the authors proposed an algorithm based on Discrete Fourier Transform (DFT) and a three level time interval hierarchy to find high correlations among all pairs of streams.
Related Papers (5)