scispace - formally typeset
Proceedings ArticleDOI

Ordinal association rules for error identification in data sets

TLDR
A method that finds these rules and identifies potential errors in data is proposed, and one use for ordinal rules is to identify possible errors inData.
Abstract
A new extension of the Boolean association rules, ordinal association rules, that incorporates ordinal relationships among data items, is introduced. One use for ordinal rules is to identify possible errors in data. A method that finds these rules and identifies potential errors in data is proposed.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Software defect prediction using relational association rule mining

TL;DR: This paper proposes a novel classification model based on relational association rules mining that overperforms, for most of the considered evaluation measures, the existing machine learning based techniques for defect prediction.
Patent

Semantic Discovery and Mapping Between Data Sources

TL;DR: In this paper, an apparatus and method are described for the discovery of semantics, relationships and mappings between data in different software applications, databases, files, reports, messages, or systems.
Journal ArticleDOI

The pairwise attribute noise detection algorithm

TL;DR: This work presents a novel approach for detecting instances with attribute noise and demonstrates its usefulness with case studies using two different real-world software measurement data sets, showing that PANDA provides better noise detection performance than the DM algorithm.
Patent

Discovering transformations applied to a source table to generate a target table

TL;DR: In this paper, a method, system, and article of manufacture for discovering transformations applied to a source table to generate a target table is presented, where a first preprocessing method is applied with respect to columns in the source and target tables to produce first category pre-processing output.
Book ChapterDOI

Data Cleansing: A Prelude to Knowledge Discovery

TL;DR: This chapter analyzes the problem of data cleansing and the identification of potential errors in data sets and presents a set of general methods that can be used to address the problem.
References
More filters
Proceedings ArticleDOI

Mining association rules between sets of items in large databases

TL;DR: An efficient algorithm is presented that generates all significant association rules between items in the database of customer transactions and incorporates buffer management and novel estimation and pruning techniques.
Book

Applied Multivariate Statistical Analysis

TL;DR: In this article, the authors present an overview of the basic concepts of multivariate analysis, including matrix algebra and random vectors, as well as a strategy for analyzing multivariate models.
Journal ArticleDOI

Applied Multivariate Statistical Analysis.

TL;DR: In this article, the authors present an overview of the basic concepts of multivariate analysis, including matrix algebra and random vectors, as well as a strategy for analyzing multivariate models.
Journal ArticleDOI

Data mining and knowledge discovery: making sense out of data

TL;DR: Without a concerted effort to develop knowledge discovery techniques, organizations stand to forfeit much of the value from the data they currently collect and store.
Journal ArticleDOI

Beyond accuracy: what data quality means to data consumers

TL;DR: Using this framework, IS managers were able to better understand and meet their data consumers' data quality needs and this research provides a basis for future studies that measure data quality along the dimensions of this framework.
Related Papers (5)