scispace - formally typeset
Open Access

An Introduction to Classification and Regression Tree (CART) Analysis

Reads0
Chats0
TLDR
A common goal of many clinical research studies is the development of a reliable clinical decision rule, which can be used to classify new patients into clinically-important categories, and there are a number of reasons for these difficulties.
Abstract
Introduction A common goal of many clinical research studies is the development of a reliable clinical decision rule, which can be used to classify new patients into clinically-important categories. Examples of such clinical decision rules include triage rules, whether used in the out-of-hospital setting or in the emergency department, and rules used to classify patients into various risk categories so that appropriate decisions can be made regarding treatment or hospitalization. Traditional statistical methods are cumbersome to use, or of limited utility, in addressing these types of classification problems. There are a number of reasons for these difficulties. First, there are generally many possible " predictor " variables which makes the task of variable selection difficult. Traditional statistical methods are poorly suited for this sort of multiple comparison. Second, the predictor variables are rarely nicely distributed. Many clinical variables are not normally distributed and different groups of patients may have markedly different degrees of variation or variance. Third, complex interactions or patterns may exist in the data. For example, the value of one variable (e.g., age) may substantially affect the importance of another variable (e.g., weight). These types of interactions are generally difficult to model, and virtually impossible to model when the number of interactions and variables becomes substantial. Fourth, the results of traditional methods may be difficult to use. For example, a multivariate logistic regression model yields a probability of disease, which can be calculated using the regression coefficients and the characteristics of the patient, yet such models are rarely utilized in clinical practice. Clinicians generally do not think in terms of probability but, rather in terms of categories, such as " low risk " versus " high risk. " Regardless of the statistical methodology being used, the creation of a clinical decision rule requires a relatively large dataset. For each patient in the dataset, one variable (the dependent variable), records whether or not that patient had the condition which we hope to predic t accurately in future patients. Examples might include significant injury after trauma, myocardial infarction, or subarachnoid hemorrhage in the setting of headache. In addition, other variables record the values of patient characteristics which we believe might help us to predict the value of the dependent variable. For example, if one hopes to predict the presence of subarachnoid hemorrhage, a possible predictor variable might be whether or not the patient's headache was sudden in onset; another possible …

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Risk Stratification for In-Hospital Mortality in Acutely Decompensated Heart Failure: Classification and Regression Tree Analysis

TL;DR: The results suggest that ADHF patients at low, intermediate, and high risk for in-hospital mortality can be easily identified using vital sign and laboratory data obtained on hospital admission and provides clinicians with a validated, practical bedside tool for mortality risk stratification.
Journal ArticleDOI

Decreased beta-amyloid1-42 and increased tau levels in cerebrospinal fluid of patients with Alzheimer disease.

TL;DR: The findings suggest that the 2 measures, CSF beta-amyloid and tau, are biological markers of AD pathophysiology and may have a potential clinical utility as biomarkers of disease.
Journal ArticleDOI

Regional patterns of agricultural land use and deforestation in Colombia

TL;DR: In this article, the authors investigated the impact of ignoring the regional variability of model parameters, and identified biophysical and socioeconomic factors that best explain the current spatial pattern and inter-regional variation in forest cover.
Journal ArticleDOI

Serum Drug Concentrations Predictive of Pulmonary Tuberculosis Outcomes

TL;DR: Low drug AUCs are predictive of clinical outcomes in tuberculosis patients, and low rifampin and isoniazid peak and AUC concentrations preceded all cases of acquired drug resistance.
Journal ArticleDOI

Genetic influence on variability in human acute experimental pain sensitivity associated with gender, ethnicity and psychological temperament.

TL;DR: It is demonstrated that gender, ethnicity and temperament contribute to individual variation in thermal and cold pain sensitivity by interactions with TRPV1 and OPRD1 single nucleotide polymorphisms.
References
More filters
Book

Classification and regression trees

Leo Breiman
TL;DR: The methodology used to construct tree structured rules is the focus of a monograph as mentioned in this paper, covering the use of trees as a data analysis method, and in a more mathematical framework, proving some of their fundamental properties.
Journal ArticleDOI

Predictive Factors of Restenosis After Coronary Stent Placement

TL;DR: Multivariate analysis demonstrated that diabetes mellitus, placement of multiple stents and minimal lumen diameter (MLD) immediately after stenting were the strongest predictors of restenosis.
Journal ArticleDOI

A Classification Tree Approach to the Development of Actuarial Violence Risk Assessment Tools

TL;DR: This work proposes a classification tree rather than a main effects regression approach for actuarial violence risk assessment tools, and suggests that by employing two decision thresholds for identifying high- and low-risk cases, the use of actuarial tools to make dichotomous risk classification decisions may be further enhanced.
Journal Article

Classification and regression tree analysis of 1000 consecutive patients with unknown primary carcinoma.

TL;DR: These analyses demonstrated that important prognostic variables were consistently applied by the CART program and effectively segregated patients into groups with similar clinical features and survival.
Journal ArticleDOI

Predictive value of history and physical examination in patients with suspected ectopic pregnancy.

TL;DR: History and physical examination findings predictive of EP were identified, however, no constellation of findings could confirm or exclude this diagnosis with a high degree of reliability.