Privacy-Preserving Data Mining in Spatiotemporal Databases Based on Mining Negative Association Rules

doi:10.1007/978-981-15-0135-7_32

Home
/
Papers
/
Privacy-Preserving Data Mining in Spatiotemporal Databases Based on Mining Negative Association Rules

Book Chapter•DOI•

Privacy-Preserving Data Mining in Spatiotemporal Databases Based on Mining Negative Association Rules

K. S. Ranjith¹, A. Geetha Mary¹•Institutions (1)

VIT University¹

01 Jan 2020-pp 329-339

TL;DR: The mathematical calculation was done and proved that this approach is best for mining association rules for spatiotemporal databases based on the mining negative association rules and cryptography with low storage and communication cost.

read less

Abstract: In the real world, most of the entities are involved with space and time, from any starting point to the end point of the space. The conventional data mining process is extended to the mining knowledge of the spatiotemporal databases. The major knowledge is to mine the association rules in the spatiotemporal databases; the traditional approaches are not sufficient to do mining in the spatiotemporal databases. While mining the association rules, the privacy is the main concern. This paper proposed privacy preserved data mining technique for spatiotemporal databases based on the mining negative association rules and cryptography with low storage and communication cost. In the proposed approach first, the partial support for all the distributed sites is calculated, and then finally, the actual support was calculated to achieve privacy preserve data mining. The mathematical calculation was done and proved that this approach is best for mining association rules for spatiotemporal databases.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Privacy Preserving Data Mining Framework for Negative Association Rules: An Application to Healthcare Informatics

[...]

Saad M. Darwish, Reham Moustafa Essa, Mohamed A. Osman, Ahmed A. Ismail

01 Jan 2022-IEEE Access

TL;DR: Experiments with benchmark healthcare datasets show that the suggested privacy preserving data mining (PPDM) method outperforms existing algorithms in terms of Hiding Failure (HF), Artificial Rule Generation (AR), and Lost Rules (LR).

...read moreread less

Abstract: Protecting the privacy of healthcare information is an important part of encouraging data custodians to give accurate records so that mining may proceed with confidence. The application of association rule mining in healthcare data has been widespread to this point in time. Most applications focus on positive association rules, ignoring the negative consequences of particular diagnostic techniques. When it comes to bridging divergent diseases and drugs, negative association rules may give more helpful information than positive ones. This is especially true when it comes to physicians and social organizations (e.g., a certain symptom will not arise when certain symptoms exist). Data mining in healthcare must be done in a way that protects the identity of patients, especially when dealing with sensitive information. However, revealing this information puts it at risk of attack. Healthcare data privacy protection has lately been addressed by technologies that disrupt data (data sanitization) and reconstruct aggregate distributions in the interest of doing research in data mining. In this study, metaheuristic-based data sanitization for healthcare data mining is investigated in order to keep patient privacy protected. It is hoped that by using the Tabu-genetic algorithm as an optimization tool, the suggested technique chooses item sets to be sanitized (modified) from transactions that satisfy sensitive negative criteria with the goal of minimizing changes to the original database. Experiments with benchmark healthcare datasets show that the suggested privacy preserving data mining (PPDM) method outperforms existing algorithms in terms of Hiding Failure (HF), Artificial Rule Generation (AR), and Lost Rules (LR).

...read moreread less

10 citations

Journal Article•DOI•

An Adaptive Privacy Preserving Framework for Distributed Association Rule Mining in Healthcare Databases

[...]

Hasanien K. Kuba, Mustafa A. Azzawi, Saad M. Darwish, Oday A. Hassen, Ansam A. Abdulhussein - Show less +1 more

01 Jan 2023-Cmc-computers Materials & Continua

TL;DR: In this paper , the Tabu-genetic optimization paradigm was used for negative association rule mining in vertically partitioned healthcare datasets that respects users' privacy, and the applied approach dynamically determines the transactions to be interrupted for information hiding, instead of predefining them.

...read moreread less

Abstract: It is crucial, while using healthcare data, to assess the advantages of data privacy against the possible drawbacks. Data from several sources must be combined for use in many data mining applications. The medical practitioner may use the results of association rule mining performed on this aggregated data to better personalize patient care and implement preventive measures. Historically, numerous heuristics (e.g., greedy search) and metaheuristics-based techniques (e.g., evolutionary algorithm) have been created for the positive association rule in privacy preserving data mining (PPDM). When it comes to connecting seemingly unrelated diseases and drugs, negative association rules may be more informative than their positive counterparts. It is well-known that during negative association rules mining, a large number of uninteresting rules are formed, making this a difficult problem to tackle. In this research, we offer an adaptive method for negative association rule mining in vertically partitioned healthcare datasets that respects users’ privacy. The applied approach dynamically determines the transactions to be interrupted for information hiding, as opposed to predefining them. This study introduces a novel method for addressing the problem of negative association rules in healthcare data mining, one that is based on the Tabu-genetic optimization paradigm. Tabu search is advantageous since it removes a huge number of unnecessary rules and item sets. Experiments using benchmark healthcare datasets prove that the discussed scheme outperforms state-of-the-art solutions in terms of decreasing side effects and data distortions, as measured by the indicator of hiding failure.

...read moreread less

References

PDF

Open Access

More filters

Proceedings Article•

Fast algorithms for mining association rules

[...]

Rakesh Agrawal, Ramakrishnan Srikant

01 Jul 1998

TL;DR: Two new algorithms for solving thii problem that are fundamentally different from the known algorithms are presented and empirical evaluation shows that these algorithms outperform theknown algorithms by factors ranging from three for small problems to more than an order of magnitude for large problems.

...read moreread less

Abstract: We consider the problem of discovering association rules between items in a large database of sales transactions. We present two new algorithms for solving thii problem that are fundamentally different from the known algorithms. Empirical evaluation shows that these algorithms outperform the known algorithms by factors ranging from three for small problems to more than an order of magnitude for large problems. We also show how the best features of the two proposed algorithms can be combined into a hybrid algorithm, called AprioriHybrid. Scale-up experiments show that AprioriHybrid scales linearly with the number of transactions. AprioriHybrid also has excellent scale-up properties with respect to the transaction size and the number of items in the database.

...read moreread less

10,863 citations

Journal Article•DOI•

Data mining: an overview from a database perspective

[...]

Ming-Syan Chen¹, Jiawei Han², Philip S. Yu³•Institutions (3)

National Taiwan University¹, Simon Fraser University², IBM³

01 Dec 1996-IEEE Transactions on Knowledge and Data Engineering

TL;DR: In this paper, a survey of the available data mining techniques is provided and a comparative study of such techniques is presented, based on a database researcher's point-of-view.

...read moreread less

Abstract: Mining information and knowledge from large databases has been recognized by many researchers as a key research topic in database systems and machine learning, and by many industrial companies as an important area with an opportunity of major revenues. Researchers in many different fields have shown great interest in data mining. Several emerging applications in information-providing services, such as data warehousing and online services over the Internet, also call for various data mining techniques to better understand user behavior, to improve the service provided and to increase business opportunities. In response to such a demand, this article provides a survey, from a database researcher's point of view, on the data mining techniques developed recently. A classification of the available data mining techniques is provided and a comparative study of such techniques is presented.

...read moreread less

2,327 citations

Journal Article•DOI•

State-of-the-art in privacy preserving data mining

[...]

Vassilios S. Verykios¹, Elisa Bertino², Igor Nai Fovino², Loredana Parasiliti Provenza², Yucel Saygin³, Yannis Theodoridis¹ - Show less +2 more•Institutions (3)

Research Academic Computer Technology Institute¹, University of Milan², Sabancı University³

01 Mar 2004

TL;DR: An overview of the new and rapidly emerging research area of privacy preserving data mining is provided, and a classification hierarchy that sets the basis for analyzing the work which has been performed in this context is proposed.

...read moreread less

Abstract: We provide here an overview of the new and rapidly emerging research area of privacy preserving data mining. We also propose a classification hierarchy that sets the basis for analyzing the work which has been performed in this context. A detailed review of the work accomplished in this area is also given, along with the coordinates of each work to the classification hierarchy. A brief evaluation is performed, and some initial conclusions are made.

...read moreread less

884 citations

Association Rules Mining: A Recent Overview

[...]

Sotiris Kotsiantis¹, Dimitris Kanellopoulos•Institutions (1)

University of Patras¹

01 Jan 2006

TL;DR: The preliminaries of basic concepts about association rule mining are provided and the list of existing association rulemining techniques are surveyed.

...read moreread less

Abstract: In this paper, we provide the preliminaries of basic concepts about association rule mining and survey the list of existing association rule mining techniques. Of course, a single article cannot be a complete review of all the al- gorithms, yet we hope that the references cited will cover the major theoretical issues, guiding the researcher in interesting research directions that have yet to be explored.

...read moreread less

485 citations

Proceedings Article•DOI•

A fast distributed algorithm for mining association rules

[...]

David W. Cheung¹, Jiawei Han, V.T. Ng, Ada Wai-Chee Fu, Yongjian Fu - Show less +1 more•Institutions (1)

University of Hong Kong¹

01 Dec 1996

TL;DR: In this article, a fast distributed mining of association rules (FDM) algorithm is proposed to generate a small number of candidate sets and substantially reduce the number of messages to be passed at mining association rules.

...read moreread less

Abstract: With the existence of many large transaction databases, the huge amounts of data, the high scalability of distributed systems, and the easy partitioning and distribution of a centralized database, it is important to investigate efficient methods for distributed mining of association rules. The study discloses some interesting relationships between locally large and globally large item sets and proposes an interesting distributed association rule mining algorithm, FDM (fast distributed mining of association rules), which generates a small number of candidate sets and substantially reduces the number of messages to be passed at mining association rules. A performance study shows that FDM has a superior performance over the direct application of a typical sequential algorithm. Further performance enhancement leads to a few variations of the algorithm.

...read moreread less

475 citations