Dominant Data Set Selection Algorithms for Electricity Consumption Time-Series Data Analysis Based on Affine Transformation

doi:10.1109/JIOT.2019.2946753

Open AccessJournal ArticleDOI

Dominant Data Set Selection Algorithms for Electricity Consumption Time-Series Data Analysis Based on Affine Transformation

Yi Wu, +4 more

- 01 May 2020 -

IEEE Internet of Things Journal

- Vol. 7, Iss: 5, pp 4347-4360

TLDR

In this paper, the authors proposed a cohort of dominant data set selection algorithms for electricity consumption TSD with a focus on discriminating the dominant data sets that is a small data set but capable of representing the kernel information carried by TSD.

Abstract:

In the explosive growth of time-series data (TSD), the scale of TSD suggests that the scale and capability of many Internet of Things (IoT)-based applications has already been exceeded. Moreover, redundancy persists in TSD due to the correlation between information acquired via different sources. In this article, we propose a cohort of dominant data set selection algorithms for electricity consumption TSD with a focus on discriminating the dominant data set that is a small data set but capable of representing the kernel information carried by TSD with an arbitrarily small error rate less than $\varepsilon $ . Furthermore, we prove that the selection problem of the minimum dominant data set is an NP-complete problem. The affine transformation model is introduced to define the linear correlation relationship between TSD objects. Our proposed framework consists of the scanning selection algorithm with $O({n^{3}})$ time complexity and the greedy selection algorithm with $O({n^{4}})$ time complexity, which are, respectively, proposed to select the dominant data set based on the linear correlation distance between TSD objects. The proposed algorithms are evaluated on the real electricity consumption data of Harbin city in China. The experimental results show that the proposed algorithms not only reduce the size of the extracted kernel data set but also ensure the TSD integrity in terms of accuracy and efficiency.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Deep Anomaly Detection for Time-Series Data in Industrial IoT: A Communication-Efficient On-Device Federated Learning Approach

Yi Liu, +6 more

- 15 Apr 2021 -

IEEE Internet of Things Journal

TL;DR: In this article, the authors proposed an attention mechanism-based convolutional neural network-long short-term memory (AMCNN-LSTM) model to accurately detect anomalies.

...read moreread less

Journal ArticleDOI

A Secure Federated Learning Framework for 5G Networks

Yi Liu, +5 more

- 12 May 2020 -

arXiv: Cryptography and Security

TL;DR: A blockchain-based secure FL framework to create smart contracts and prevent malicious or unreliable participants from being involved in FL is proposed, which can effectively deter poisoning and membership inference attacks, thereby improving the security of FL in 5G networks.

...read moreread less

Journal ArticleDOI

Deep Anomaly Detection for Time-series Data in Industrial IoT: A Communication-Efficient On-device Federated Learning Approach

Yi Liu, +6 more

- 19 Jul 2020 -

arXiv: Learning

TL;DR: A new communication-efficient on-device federated learning (FL)-based deep anomaly detection framework for sensing time-series data in IIoT and an attention mechanism-based convolutional neural network-long short-term memory (AMCNN-LSTM) model to accurately detect anomalies is proposed.

...read moreread less

Journal ArticleDOI

Federated Learning in the Sky: Aerial-Ground Air Quality Sensing Framework with UAV Swarms

Yi Liu, +5 more

- 23 Jul 2020 -

arXiv: Signal Processing

TL;DR: A new federated learning (FL)-based aerial-ground air quality sensing framework for fine-grained 3-D air quality monitoring and forecasting that leverages a lightweight Dense-MobileNet model to achieve energy-efficient end-to-end learning from haze features of haze images taken by unmanned aerial vehicles (UAVs) for predicting AQI scale distribution.

...read moreread less

Journal ArticleDOI

URLLC for 5G and Beyond: Requirements, Enabling Incumbent Technologies and Network Intelligence

Rashid Ali, +4 more

- 16 Apr 2021 -

IEEE Access

TL;DR: In this article, the authors present a possibility to use the federated reinforcement learning (FRL) technique, which is one of the ML techniques, for 5G NR URLLC requirements and summarizes the corresponding achievements.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Johnson: Computers and Intractability-A Guide to the Theory of NP-Completeness

Michael Randolph Garey

Book

Computers and Intractability: A Guide to the Theory of NP-Completeness

Michael Randolph Garey, +1 more

TL;DR: The second edition of a quarterly column as discussed by the authors provides a continuing update to the list of problems (NP-complete and harder) presented by M. R. Garey and myself in our book "Computers and Intractability: A Guide to the Theory of NP-Completeness,” W. H. Freeman & Co., San Francisco, 1979.

...read moreread less

Journal ArticleDOI

Extended Bayesian information criteria for model selection with large model spaces

Jiahua Chen, +1 more

- 01 Sep 2008 -

Biometrika

TL;DR: This paper re-examine the Bayesian paradigm for model selection and proposes an extended family of Bayesian information criteria, which take into account both the number of unknown parameters and the complexity of the model space.

...read moreread less

Journal Article

A Selective Overview of Variable Selection in High Dimensional Feature Space.

Jianqing Fan, +1 more

- 01 Jan 2010 -

Statistica Sinica

TL;DR: In this paper, a brief account of the recent developments of theory, methods, and implementations for high-dimensional variable selection is presented, with emphasis on independence screening and two-scale methods.

...read moreread less

Book ChapterDOI

StatStream: statistical monitoring of thousands of data streams in real time

Yunyue Zhu, +1 more

TL;DR: In this article, the authors proposed an algorithm based on Discrete Fourier Transform (DFT) and a three level time interval hierarchy to find high correlations among all pairs of streams.

...read moreread less