Journal ArticleDOI
Toward semantic data imputation for a dengue dataset
TLDR
An improvement in the efficiency of predicting missing data utilizing Particle Swarm Optimization (PSO), which is applied to the numerical data cleansing problem, with the performance of PSO being enhanced using K-means to help determine the fitness value.Abstract:
Missing data are a major problem that affects data analysis techniques for forecasting. Traditional methods suffer from poor performance in predicting missing values using simple techniques, e.g., mean and mode. In this paper, we present and discuss a novel method of imputing missing values semantically with the use of an ontology model. We make three new contributions to the field: first, an improvement in the efficiency of predicting missing data utilizing Particle Swarm Optimization (PSO), which is applied to the numerical data cleansing problem, with the performance of PSO being enhanced using K-means to help determine the fitness value. Second, the incorporation of an ontology with PSO for the purpose of narrowing the search space, to make PSO provide greater accuracy in predicting numerical missing values while quickly converging on the answer. Third, the facilitation of a framework to substitute nominal data that are lost from the dataset using the relationships of concepts and a reasoning mechanism concerning the knowledge-based model. The experimental results indicated that the proposed method could estimate missing data more efficiently and with less chance of error than conventional methods, as measured by the root mean square error.read more
Citations
More filters
Journal ArticleDOI
A Critical Review of Real-Time Modelling of Flood Forecasting in Urban Drainage Systems
TL;DR: In this article , the authors present a comprehensive review of the current state-of-the-art and future trends of real-time modelling of flood forecasting in urban drainage systems.
Journal ArticleDOI
Semantic data mining in the information age: A systematic review
TL;DR: A comprehensive overview of the literature on domain ontologies as used in the various semantic data‐mining tasks, such as preprocessing, modeling, and postprocessing is provided.
Journal ArticleDOI
Virtual sensor-based imputed graph attention network for anomaly detection of equipment with incomplete data
TL;DR: Wang et al. as discussed by the authors proposed a virtual sensor-based imputed graph attention network, which generates signals to impute the time of sensor record failure by generative adversarial network (GAN) and extracts the features of complete signals mixed with real signals and generated signals by GAT.
Posted Content
Nearest Neighbor Imputation for Categorical Data by Weighting of Attributes
Shahla Faisal,Gerhard Tutz +1 more
TL;DR: The weighted nearest neighbors approach is extended to impute missing values in categorical variables and shows that the weighting of attributes yields smaller imputation errors than existing approaches.
Journal ArticleDOI
Intelligent approach to automated star-schema construction using a knowledge base
TL;DR: A new strategy that incorporates knowledge-based models into a framework, named the Semantic-based Star-schema Designer, that assists the automation of star schema construction and their relationship information without human intervention using homegrown algorithms.
References
More filters
Journal ArticleDOI
A GS-MPSO-WKNN method for missing data imputation in wireless sensor networks monitoring manufacturing conditions:
TL;DR: A missing data estimation algorithm, GS-MPSO-WKNN (Gaussian mutation and simulated annealing-based memetic particle swarm optimization for weighted K nearest neighbours), based on a weighted K closest neighbour (WKnn) and memetic computing is proposed.
Complexity analysis of problem-dimension using PSO
TL;DR: This work analyzes the internal behavior of particle swarm optimization (PSO) algorithm when the complexity of the problem increased and illustrates that all parameters in any dimension behave in similar pattern and can expect similar behavior for additional complexity in the problem.
Proceedings ArticleDOI
Ontology-based functional classification of genes: Evaluation with reference sets and overlap analysis
Sidahmed Benabderrahmane,Marie-Dominique Devignes,Malika Smail Tabbone,Amedeo Napoli,Olivier Poch +4 more
TL;DR: This paper evaluates functional classification of genes using the previously described IntelliGO semantic similarity measure with the help of reference sets, and proposes a set-difference method for discovering missing information.
Proceedings ArticleDOI
Attempt to reduce the computational complexity in multi-objective differential evolution algorithms
TL;DR: This work presents the methods to decrease the cost of nondominated sorting and diversity estimation procedures for multiobjective differential evolution (DE) algorithms by using a combination of well known data structures to efficiently update these attributes.
Journal ArticleDOI
Enrichment of Association Rules through Exploitation of Ontology Properties - Healthcare Case Study
TL;DR: A hypothesis suggests that especially tackling property relations, chain property being part of the current version of the W3C Web Ontology Language (OWL), will yield better rules, and suggests that the latter produces novel rules with strong confidence and support.
Related Papers (5)
Enhanced Fuzzy K-NN Approach for Handling Missing Values in Medical Data Mining
R. Naveen Kumar,M. Anand Kumar +1 more