Showing papers by "Robert Babuska published in 2017"

PDF

Open Access

Journal Article•DOI•

Residential Demand Response of Thermostatically Controlled Loads Using Batch Reinforcement Learning

[...]

Frederik Ruelens¹, Bert Claessens², Stijn Vandael¹, Bart De Schutter³, Robert Babuska³, Ronnie Belmans¹ - Show less +2 more•Institutions (3)

Katholieke Universiteit Leuven¹, Flemish Institute for Technological Research², Delft University of Technology³

01 Sep 2017-IEEE Transactions on Smart Grid

TL;DR: The experiments show that batch RL techniques provide a valuable alternative to model-based controllers and that they can be used to construct both closed-loop and open-loop policies.

...read moreread less

Abstract: Driven by recent advances in batch Reinforcement Learning (RL), this paper contributes to the application of batch RL to demand response. In contrast to conventional model-based approaches, batch RL techniques do not require a system identification step, making them more suitable for a large-scale implementation. This paper extends fitted Q-iteration, a standard batch RL technique, to the situation when a forecast of the exogenous data is provided. In general, batch RL techniques do not rely on expert knowledge about the system dynamics or the solution. However, if some expert knowledge is provided, it can be incorporated by using the proposed policy adjustment method. Finally, we tackle the challenge of finding an open-loop schedule required to participate in the day-ahead market. We propose a model-free Monte Carlo method that uses a metric based on the state-action value function or Q-function and we illustrate this method by finding the day-ahead schedule of a heat-pump thermostat. Our experiments show that batch RL techniques provide a valuable alternative to model-based controllers and that they can be used to construct both closed-loop and open-loop policies.

...read moreread less

263 citations

Journal Article•DOI•

Railway Track Circuit Fault Diagnosis Using Recurrent Neural Networks

[...]

Tim de Bruin¹, K. Verbert¹, Robert Babuska¹•Institutions (1)

Delft University of Technology¹

01 Mar 2017-IEEE Transactions on Neural Networks

TL;DR: The long-short-term memory (LSTM) recurrent neural network is proposed to accomplish fault detection and identification tasks based on the commonly available measurement signals by considering the signals from multiple track circuits in a geographic area.

...read moreread less

Abstract: Timely detection and identification of faults in railway track circuits are crucial for the safety and availability of railway networks. In this paper, the use of the long-short-term memory (LSTM) recurrent neural network is proposed to accomplish these tasks based on the commonly available measurement signals. By considering the signals from multiple track circuits in a geographic area, faults are diagnosed from their spatial and temporal dependences. A generative model is used to show that the LSTM network can learn these dependences directly from the data. The network correctly classifies 99.7% of the test input sequences, with no false positive fault detections. In addition, the t-Distributed Stochastic Neighbor Embedding (t-SNE) method is used to examine the resulting network, further showing that it has learned the relevant dependences in the data. Finally, we compare our LSTM network with a convolutional network trained on the same task. From this comparison, we conclude that the LSTM network architecture is better suited for the railway track circuit fault detection and identification tasks than the convolutional network.

...read moreread less

221 citations

Journal Article•DOI•

A big data analysis approach for rail failure risk assessment

[...]

Ali Jamshidi¹, Shahrzad Faghih-Roohi¹, Siamak Hajizadeh¹, Alfredo Núñez¹, Robert Babuska¹, Rolf Dollevoet¹, Zili Li¹, Bart De Schutter¹ - Show less +4 more•Institutions (1)

Delft University of Technology¹

31 May 2017-Risk Analysis

TL;DR: This article proposes an image processing approach for automatic detection of squats, especially severe types that are prone to rail breaks, that are detected automatically among the huge number of records from video cameras.

...read moreread less

Abstract: Railway infrastructure monitoring is a vital task to ensure rail transportation safety A rail failure could result in not only a considerable impact on train delays and maintenance costs, but also on safety of passengers In this article, the aim is to assess the risk of a rail failure by analyzing a type of rail surface defect called squats that are detected automatically among the huge number of records from video cameras We propose an image processing approach for automatic detection of squats, especially severe types that are prone to rail breaks We measure the visual length of the squats and use them to model the failure risk For the assessment of the rail failure risk, we estimate the probability of rail failure based on the growth of squats Moreover, we perform severity and crack growth analyses to consider the impact of rail traffic loads on defects in three different growth scenarios The failure risk estimations are provided for several samples of squats with different crack growth lengths on a busy rail track of the Dutch railway network The results illustrate the practicality and efficiency of the proposed approach

...read moreread less

97 citations

Journal Article•DOI•

Timely condition-based maintenance planning for multi-component systems

[...]

K. Verbert¹, B. De Schutter¹, Robert Babuska¹•Institutions (1)

Delft University of Technology¹

01 Mar 2017-Reliability Engineering & System Safety

TL;DR: This paper proposes a new strategy for timely maintenance planning in multi-component systems that accounts for economic and structural dependence with the aim to profit from spreading or combining various maintenance activities.

...read moreread less

66 citations

Proceedings Article•DOI•

Automated tuning and configuration of path planning algorithms

[...]

Ruben Burger¹, Mukunda Bharatheesha¹, Marc van Bert, Robert Babuska¹•Institutions (1)

Delft University of Technology¹

01 May 2017

TL;DR: It is shown that it is possible to improve the performance of a planning algorithm for a specific problem without the need of in-depth knowledge of the algorithm itself, and the use of Sequential Model-based Algorithm Configuration (SMAC) tools to address these concerns.

...read moreread less

Abstract: A large number of novel path planning methods for a wide range of problems have been described in literature over the past few decades. These algorithms can often be configured using a set of parameters that greatly influence their performance. In a typical use case, these parameters are only very slightly tuned or even left untouched. Systematic approaches to tune parameters of path planning algorithms have been largely unexplored. At the same time, there is a rising interest in the planning and robotics communities regarding the real world application of these theoretically developed and simulation-tested planning algorithms. In this work, we propose the use of Sequential Model-based Algorithm Configuration (SMAC) tools to address these concerns. We show that it is possible to improve the performance of a planning algorithm for a specific problem without the need of in-depth knowledge of the algorithm itself. We compare five planners that see a lot of practical usage on three typical industrial pick-and-place tasks to demonstrate the effectiveness of the method.

...read moreread less

14 citations

Journal Article•DOI•

Identification of distributed-parameter systems from sparse measurements

[...]

Z. Hidayat¹, Robert Babuska¹, Alfredo Núñez¹, B. De Schutter¹•Institutions (1)

Delft University of Technology¹

01 Nov 2017-Applied Mathematical Modelling

TL;DR: A method based on finite-difference discretization on a grid in space and time for the identification of distributed-parameter systems is proposed, suitable for the case when the partial differential equation describing the system is not known.

...read moreread less

12 citations

Journal Article•DOI•

Optimal Control via Reinforcement Learning with Symbolic Policy Approximation

[...]

Jiří Kubalík¹, Eduard Alibekov¹, Robert Babuska¹, Robert Babuska²•Institutions (2)

Czech Technical University in Prague¹, Delft University of Technology²

01 Jul 2017-IFAC-PapersOnLine

TL;DR: A novel method to construct a smooth policy represented by an analytic equation, obtained by means of symbolic regression is proposed and shows that the analytic control law performs at least equally well as the original numerically approximated policy, while it leads to much smoother control signals.

...read moreread less

11 citations

Proceedings Article•DOI•

Enhanced Symbolic Regression Through Local Variable Transformations

[...]

Jiri Kubalik¹, Erik Derner¹, Robert Babuska²•Institutions (2)

Czech Technical University in Prague¹, Delft University of Technology²

16 Nov 2017

TL;DR: A GP extension introducing a new concept of local transformed variables, based on a locally applied affine transformation of the original variables is presented, which confirms the hypothesis that the transformed variables significantly improve the performance of the standard SNGP algorithm.

...read moreread less

Abstract: Genetic programming (GP) is a technique widely used in a range of symbolic regression problems, in particular when there is no prior knowledge about the symbolic function sought. In this paper, we present a GP extension introducing a new concept of local transformed variables, based on a locally applied affine transformation of the original variables. This approach facilitates finding accurate parsimonious models. We have evaluated the proposed extension in the context of the Single Node Genetic Programming (SNGP) algorithm on synthetic as well as real-problem datasets. The results confirm our hypothesis that the transformed variables significantly improve the performance of the standard SNGP algorithm.

...read moreread less

11 citations

Proceedings Article•DOI•

Model-based real-time control of a magnetic manipulator system

[...]

Jan-Willem Damsteeg¹, Subramanya Nageshrao², Robert Babuska¹•Institutions (2)

Delft University of Technology¹, Ford Motor Company²

01 Dec 2017

TL;DR: This paper designs and experimentally evaluates two nonlinear controllers for a magnetic manipulation (Magman) system, which consists of four electromagnetic coils arranged linearly, and benchmark two non linear control methods, namely feedback linearization and a constrained state-dependent Riccati equation (SDRE) control.

...read moreread less

Abstract: Precise magnetic manipulation has numerous applications, ranging from manufacturing to the medical field. Owing to the nonlinear nature of the electromagnetic force, magnetic manipulation requires advanced nonlinear control. In this paper, we design and experimentally evaluate two nonlinear controllers for a magnetic manipulation (Magman) system, which consists of four electromagnetic coils arranged linearly. The current through the coils is controlled in order to accurately position a steel ball, rolling freely in a track above the coils. We benchmark two nonlinear control methods, namely feedback linearization and a constrained state-dependent Riccati equation (SDRE) control. These methods are chosen due to their widespread use in academia as well as industrial applications. On the actual setup, constrained SDRE has performed considerably better in terms of the settling time, overshoot, and the amount of control effort when compared to feedback linearization.

...read moreread less

7 citations

Journal Article•DOI•

Evaluation of physical damage associated with action selection strategies in reinforcement learning

[...]

Ivan Koryakovskiy¹, Heike Vallery¹, Robert Babuska¹, Wouter Caarls²•Institutions (2)

Delft University of Technology¹, Pontifical Catholic University of Rio de Janeiro²

01 Jul 2017-IFAC-PapersOnLine

TL;DR: Inspired by the OU and PADA methods, four new action-selection methods are proposed in a systematic way and one of the proposed methods with a time-correlated noise outperforms the well-known e-greedy method in all three benchmarks.

...read moreread less

7 citations

Journal Article•DOI•

Takagi–Sugeno fuzzy payload estimation and adaptive control

[...]

Selami Beyhan¹, Farnaz Eghbal Sarabi², Zsofia Lendek³, Robert Babuska²•Institutions (3)

Pamukkale University¹, Delft University of Technology², Technical University of Cluj-Napoca³

01 Jul 2017-IFAC-PapersOnLine

TL;DR: In this article, a novel adaptive Takagi-Sugeno (TS) fuzzy observer-based controller is proposed, and the closed-loop stability and boundedness of all the signals are proven by Lyapunov stability analysis.

...read moreread less