scispace - formally typeset
Open AccessJournal ArticleDOI

Identifying and characterizing high-risk clusters in a heterogeneous ICU population with deep embedded clustering

TLDR
In this paper, a machine learning methodology was developed to identify and provide better characterization of patient clusters at high risk of mortality and kidney injury in the ICU of a Dutch hospital.
Abstract
Critically ill patients constitute a highly heterogeneous population, with seemingly distinct patients having similar outcomes, and patients with the same admission diagnosis having opposite clinical trajectories. We aimed to develop a machine learning methodology that identifies and provides better characterization of patient clusters at high risk of mortality and kidney injury. We analysed prospectively collected data including co-morbidities, clinical examination, and laboratory parameters from a minimally-selected population of 743 patients admitted to the ICU of a Dutch hospital between 2015 and 2017. We compared four clustering methodologies and trained a classifier to predict and validate cluster membership. The contribution of different variables to the predicted cluster membership was assessed using SHapley Additive exPlanations values. We found that deep embedded clustering yielded better results compared to the traditional clustering algorithms. The best cluster configuration was achieved for 6 clusters. All clusters were clinically recognizable, and differed in in-ICU, 30-day, and 90-day mortality, as well as incidence of acute kidney injury. We identified two high mortality risk clusters with at least 60%, 40%, and 30% increased. ICU, 30-day and 90-day mortality, and a low risk cluster with 25-56% lower mortality risk. This machine learning methodology combining deep embedded clustering and variable importance analysis, which we made publicly available, is a possible solution to challenges previously encountered by clustering analyses in heterogeneous patient populations and may help improve the characterization of risk groups in critical care.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Clustering analysis of geriatric and acute characteristics in a cohort of very old patients on admission to ICU

TL;DR: In this paper , the authors identify new and robust phenotypes based on the combination of these features to facilitate early outcome prediction in very old patients, which constitutes a major challenge to prognostication and patient management in ICUs.
Journal ArticleDOI

Cluster analysis integrating age and body temperature for mortality in patients with sepsis: a multicenter retrospective study

TL;DR: In this paper , the authors investigated the mortality rates in sepsis patients according to age and BT and identified the risk factors for mortality using a machine learning method based on a combination of age and body temperature.
Journal ArticleDOI

Subtyping Hyperchloremia among Hospitalized Patients by Machine Learning Consensus Clustering

TL;DR: In this article, the authors performed consensus cluster analysis based on demographic information, principal diagnoses, comorbidities, and laboratory data among 11,394 hospitalized adult patients with admission serum chloride of >108 mEq/L.
Journal ArticleDOI

Cluster analysis integrating age and body temperature for mortality in patients with sepsis: a multicenter retrospective study

TL;DR: In this article , the authors investigated the mortality rates in sepsis patients according to age and BT and identified the risk factors for mortality using a machine learning method based on a combination of age and body temperature.
Journal ArticleDOI

Exploring and Identifying Prognostic Phenotypes of Patients with Heart Failure Guided by Explainable Machine Learning

TL;DR: ML could effectively identify patient prognostic phenotypes, facilitating reasonable management and treatment considering prognostic condition, and age and creatinine clearance rate were identified as the top two most important predictors.
References
More filters
Proceedings ArticleDOI

XGBoost: A Scalable Tree Boosting System

TL;DR: XGBoost as discussed by the authors proposes a sparsity-aware algorithm for sparse data and weighted quantile sketch for approximate tree learning to achieve state-of-the-art results on many machine learning challenges.
Journal ArticleDOI

mice: Multivariate Imputation by Chained Equations in R

TL;DR: Mice adds new functionality for imputing multilevel data, automatic predictor selection, data handling, post-processing imputed values, specialized pooling routines, model selection tools, and diagnostic graphs.
Journal ArticleDOI

Data clustering: 50 years beyond K-means

TL;DR: A brief overview of clustering is provided, well known clustering methods are summarized, the major challenges and key issues in designing clustering algorithms are discussed, and some of the emerging and useful research directions are pointed out.
Journal ArticleDOI

On Clustering Validation Techniques

TL;DR: The fundamental concepts of clustering are introduced while it surveys the widely known clustering algorithms in a comparative way and the issues that are under-addressed by the recent algorithms are illustrated.
Proceedings Article

Unsupervised deep embedding for clustering analysis

TL;DR: Deep Embedded Clustering (DEC) as discussed by the authors learns a mapping from the data space to a lower-dimensional feature space in which it iteratively optimizes a clustering objective.
Related Papers (5)