scispace - formally typeset
Open AccessPosted Content

Proposition of a Theoretical Model for Missing Data Imputation using Deep Learning and Evolutionary Algorithms.

TLDR
It is hypothesize that the use of deep neural networks built using autoencoders and denoising autoen coders in conjunction with genetic algorithms, swarm intelligence and maximum likelihood estimator methods as novel data imputation techniques will lead to better imputed values than existing techniques.
Abstract
In the last couple of decades, there has been major advancements in the domain of missing data imputation. The techniques in the domain include amongst others: Expectation Maximization, Neural Networks with Evolutionary Algorithms or optimization techniques and K-Nearest Neighbor approaches to solve the problem. The presence of missing data entries in databases render the tasks of decision-making and data analysis nontrivial. As a result this area has attracted a lot of research interest with the aim being to yield accurate and time efficient and sensitive missing data imputation techniques especially when time sensitive applications are concerned like power plants and winding processes. In this article, considering arbitrary and monotone missing data patterns, we hypothesize that the use of deep neural networks built using autoencoders and denoising autoencoders in conjunction with genetic algorithms, swarm intelligence and maximum likelihood estimator methods as novel data imputation techniques will lead to better imputed values than existing techniques. Also considered are the missing at random, missing completely at random and missing not at random missing data mechanisms. We also intend to use fuzzy logic in tandem with deep neural networks to perform the missing data imputation tasks, as well as different building blocks for the deep neural networks like Stacked Restricted Boltzmann Machines and Deep Belief Networks to test our hypothesis. The motivation behind this article is the need for missing data imputation techniques that lead to better imputed values than existing methods with higher accuracies and lower errors.

read more

Citations
More filters
Journal ArticleDOI

Incomplete data management: a survey

TL;DR: It is hoped that this survey could provide insights to the database community on how incomplete data is managed, and inspire database researchers to develop more advanced processing techniques and tools to cope with the issues resulting from incomplete data in the real world.
Journal ArticleDOI

A robust deep learning model for missing value imputation in big NCDC dataset

TL;DR: An efficient deep learning imputation model is proposed for imputing the missing values in weather data of an individual weather station on a temporal basis and the SGD optimizer is found to be more accurate in predicting the missing numbers.
Journal ArticleDOI

Missing Value Estimation using Clustering and Deep Learning within Multiple Imputation Framework

TL;DR: Zhang et al. as mentioned in this paper proposed methods to improve both the imputation accuracy of MICE and the classification accuracy of imputed data by replacing MICE's linear regressors with ensemble learning and deep neural networks.
Journal ArticleDOI

Providing an imputation algorithm for missing values of longitudinal data using Cuckoo search algorithm: A case study on cervical dystonia

TL;DR: Concomitant use of similar parameters and correlation coefficients led to a significant increase in accuracy of missing data imputation in this study.
Journal ArticleDOI

Pay attention and you won't lose it: a deep learning approach to sequence imputation.

TL;DR: This work proposes using attention mechanisms to entirely replace the recurrent components of these LSTM networks as a solution for data restoration, and demonstrates that this approach leads to reduced model sizes, a 2-fold to 4-fold reduction in training times, and 95% accuracy for automotive data restoration.
References
More filters
Book

Statistical Analysis with Missing Data

TL;DR: This work states that maximum Likelihood for General Patterns of Missing Data: Introduction and Theory with Ignorable Nonresponse and large-Sample Inference Based on Maximum Likelihood Estimates is likely to be high.
Journal ArticleDOI

Reducing the Dimensionality of Data with Neural Networks

TL;DR: In this article, an effective way of initializing the weights that allows deep autoencoder networks to learn low-dimensional codes that work much better than principal components analysis as a tool to reduce the dimensionality of data is described.
Journal ArticleDOI

A fast learning algorithm for deep belief nets

TL;DR: A fast, greedy algorithm is derived that can learn deep, directed belief networks one layer at a time, provided the top two layers form an undirected associative memory.
Journal ArticleDOI

Deep learning in neural networks

TL;DR: This historical survey compactly summarizes relevant work, much of it from the previous millennium, review deep supervised learning, unsupervised learning, reinforcement learning & evolutionary computation, and indirect search for short programs encoding deep and large networks.
Journal ArticleDOI

Representation Learning: A Review and New Perspectives

TL;DR: Recent work in the area of unsupervised feature learning and deep learning is reviewed, covering advances in probabilistic models, autoencoders, manifold learning, and deep networks.
Related Papers (5)