scispace - formally typeset
Proceedings ArticleDOI

Knowledge discovery from massive healthcare claims data

Reads0
Chats0
TLDR
This paper translates the problem of analyzing healthcare data into some of the most well-known analysis problems in the data mining community, social network analysis, text mining, and temporal analysis and higher order feature construction, and describes how advances within each of these areas can be leveraged to understand the domain of healthcare.
Abstract
he role of big data in addressing the needs of the present healthcare system in US and rest of the world has been echoed by government, private, and academic sectors. There has been a growing emphasis to explore the promise of big data analytics in tapping the potential of the massive healthcare data emanating from private and government health insurance providers. While the domain implications of such collaboration are well known, this type of data has been explored to a limited extent in the data mining community. The objective of this paper is two fold: first, we introduce the emerging domain of "big" healthcare claims data to the KDD community, and second, we describe the success and challenges that we encountered in analyzing this data using state of art analytics for massive data. Specifically, we translate the problem of analyzing healthcare data into some of the most well-known analysis problems in the data mining community, social network analysis, text mining, and temporal analysis and higher order feature construction, and describe how advances within each of these areas can be leveraged to understand the domain of healthcare. Each case study illustrates a unique intersection of data mining and healthcare with a common objective of improving the cost-care ratio by mining for opportunities to improve healthcare operations and reducing what seems to fall under fraud, waste, and abuse.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Health-CPS: Healthcare Cyber-Physical System Assisted by Cloud and Big Data

TL;DR: The results of this study show that the technologies of cloud and big data can be used to enhance the performance of the healthcare system so that humans can then enjoy various smart healthcare applications and services.
Journal ArticleDOI

Fraud detection: A systematic literature review of graph-based anomaly detection approaches

TL;DR: This study develops a framework to synthesize the existing literature on the application of GBAD methods in fraud detection published between 2007 and 2018 to investigate the present trends and identify the key challenges.
Journal ArticleDOI

Big Data fraud detection using multiple medicare data sources

TL;DR: This paper focuses on the detection of Medicare fraud using the following CMS datasets and suggests using the Combined dataset for detecting fraudulent behavior when a physician has submitted payments through any or all Medicare parts evaluated in this study.
Journal ArticleDOI

Handling big data: research challenges and future directions

TL;DR: A classification of some of the most important challenges when handling big data is presented and solutions that could address the identified challenges are recommended.
Journal ArticleDOI

Big data analytics in healthcare: a systematic literature review

TL;DR: The findings from this study suggest that applications of BDA in healthcare can be observed from five perspectives, namely, health awareness among the general public, interactions among stakeholders in the healthcare ecosystem, hospital management practices, treatment of specific medical conditions, and technology in healthcare service delivery.
References
More filters
Journal ArticleDOI

Latent dirichlet allocation

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.
Proceedings Article

Latent Dirichlet Allocation

TL;DR: This paper proposed a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hof-mann's aspect model, also known as probabilistic latent semantic indexing (pLSI).
Book

Social Network Analysis: Methods and Applications

TL;DR: This paper presents mathematical representation of social networks in the social and behavioral sciences through the lens of Dyadic and Triadic Interaction Models, which describes the relationships between actor and group measures and the structure of networks.
Proceedings Article

The PageRank Citation Ranking : Bringing Order to the Web

TL;DR: This paper describes PageRank, a mathod for rating Web pages objectively and mechanically, effectively measuring the human interest and attention devoted to them, and shows how to efficiently compute PageRank for large numbers of pages.
Journal ArticleDOI

Social Network Analysis: Methods and Applications.

TL;DR: This work characterizes networked structures in terms of nodes (individual actors, people, or things within the network) and the ties, edges, or links that connect them.
Related Papers (5)