Semisupervised Deep Reinforcement Learning in Support of IoT and Smart City Services

doi:10.1109/JIOT.2017.2712560

Open AccessJournal ArticleDOI

Semisupervised Deep Reinforcement Learning in Support of IoT and Smart City Services

Mehdi Mohammadi, +3 more

- 01 Apr 2018 -

IEEE Internet of Things Journal

- Vol. 5, Iss: 2, pp 624-635

Chats0

TLDR

This paper proposes a semisupervised DRL model that fits smart city applications as it consumes both labeled and unlabeled data to improve the performance and accuracy of the learning agent and utilizes variational autoencoders as the inference engine for generalizing optimal policies.

Abstract:

Smart services are an important element of the smart cities and the Internet of Things (IoT) ecosystems where the intelligence behind the services is obtained and improved through the sensory data. Providing a large amount of training data is not always feasible; therefore, we need to consider alternative ways that incorporate unlabeled data as well. In recent years, deep reinforcement learning (DRL) has gained great success in several application domains. It is an applicable method for IoT and smart city scenarios where auto-generated data can be partially labeled by users’ feedback for training purposes. In this paper, we propose a semisupervised DRL model that fits smart city applications as it consumes both labeled and unlabeled data to improve the performance and accuracy of the learning agent. The model utilizes variational autoencoders as the inference engine for generalizing optimal policies. To the best of our knowledge, the proposed model is the first investigation that extends DRL to the semisupervised paradigm. As a case study of smart city applications, we focus on smart buildings and apply the proposed model to the problem of indoor localization based on Bluetooth low energy signal strength. Indoor localization is the main component of smart city services since people spend significant time in indoor environments. Our model learns the best action policies that lead to a close estimation of the target locations with an improvement of 23% in terms of distance to the target and at least 67% more received rewards compared to the supervised DRL model.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

An uncertainty-aware deep reinforcement learning framework for residential air conditioning energy management

Clement Lork, +6 more

- 15 Oct 2020 -

Applied Energy

TL;DR: This work proposes a data-driven uncertainty-aware approach to control split-type inverter ACs of residential buildings using Bayesian-Convolutional-Neural-Networks and BCNN models to model the performance and uncertainty of the ACs from the aggregated data.

...read moreread less

Posted Content

Whatever Does Not Kill Deep Reinforcement Learning, Makes It Stronger

Vahid Behzadan, +1 more

- 23 Dec 2017 -

arXiv: Artificial Intelligence

TL;DR: It is demonstrated that under noncontiguous training-time attacks, Deep Q-Network (DQN) agents can recover and adapt to the adversarial conditions by reactively adjusting the policy.

...read moreread less

Journal ArticleDOI

An Intelligent Non-Integer PID Controller-Based Deep Reinforcement Learning: Implementation and Experimental Results

Meysam Gheisarnejad, +1 more

- 01 Apr 2021 -

IEEE Transactions on Industrial Electron...

TL;DR: A noninteger proportional integral derivative (PID)-type controller based on the deep deterministic policy gradient algorithm is developed for the tracking problem of a mobile robot that is exposed to the measurement noises and external disturbances.

...read moreread less

Journal ArticleDOI

Semi‐supervised learning based on convolutional neural network and uncertainty filter for façade defects classification

Jingjing Guo, +2 more

- 01 Mar 2021 -

Computer-aided Civil and Infrastructure ...

TL;DR: A semi‐supervised learning algorithm that uses only a small amount of labeled data for training, but still achieves high classification accuracy is proposed, and a novel uncertainty filter to select reliable unlabeled data for initial training epochs is developed to further improve the classification accuracy.

...read moreread less

Journal ArticleDOI

The Deep Learning Compiler: A Comprehensive Survey

Mingzhen Li, +9 more

- 01 Mar 2021 -

IEEE Transactions on Parallel and Distri...

TL;DR: A comprehensive survey of DL compilers can be found in this article, with an emphasis on the DL oriented multi-level IRs and frontend/backend optimizations, and several insights are highlighted as the potential research directions of DL compiler.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Deep learning

Yann LeCun, +4 more

- 28 May 2015 -

Nature

TL;DR: Deep learning is making major advances in solving problems that have resisted the best attempts of the artificial intelligence community for many years, and will have many more successes in the near future because it requires very little engineering by hand and can easily take advantage of increases in the amount of available computation and data.

...read moreread less

Book

Reinforcement Learning: An Introduction

Richard S. Sutton, +1 more

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

...read moreread less

Posted Content

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

Martín Abadi, +39 more

- 01 Jan 2015 -

arXiv: Distributed, Parallel, and Cluste...

TL;DR: The TensorFlow interface and an implementation of that interface that is built at Google are described, which has been used for conducting research and for deploying machine learning systems into production across more than a dozen areas of computer science and other fields.

...read moreread less

Posted Content

Playing Atari with Deep Reinforcement Learning

Volodymyr Mnih, +6 more

- 19 Dec 2013 -

arXiv: Learning

TL;DR: This work presents the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning, which outperforms all previous approaches on six of the games and surpasses a human expert on three of them.

...read moreread less