scispace - formally typeset
Journal ArticleDOI

Discrimination slope and integrated discrimination improvement - properties, relationships and impact of calibration.

Reads0
Chats0
TLDR
It is demonstrated that simple recalibration ascertaining calibration in-the-large and calibration slope equal to 1 are not sufficient to correct for some forms of mis-calibration, and it is concluded that R-squared metrics, including the discrimination slope, offer an attractive choice for quantifying model performance as long as one accounts for their sensitivity to model calibration.
Abstract
Discrimination slope, defined as the slope of a linear regression of predicted probabilities of event derived from a prognostic model on the binary event status, has recently gained popularity as a measure of model performance. It is as a building block for the integrated discrimination improvement that equals the difference in discrimination slopes between the two models being compared. Several authors have pointed out that it does not make sense to apply the integrated discrimination improvement and discrimination slope when working with mis-calibrated models, whereas others have raised concerns about the ability of improving discrimination slope without adding new information. In this paper, we show that under certain assumptions the discrimination slope is asymptotically related to two other R-squared measures, one of which is a rescaled version of the Brier score, known to be proper. Furthermore, we illustrate how a simple recalibration makes the slope equal to the rescaled Brier R-squared metric. We also show that the discrimination slope can be interpreted as a measure of reduction in expected regret for the Gini-Brier regret function. Using theoretical and practical examples, we illustrate how all of these metrics are affected by different levels of model mis-calibration. In particular, we demonstrate that simple recalibration ascertaining calibration in-the-large and calibration slope equal to 1 are not sufficient to correct for some forms of mis-calibration. We conclude that R-squared metrics, including the discrimination slope, offer an attractive choice for quantifying model performance as long as one accounts for their sensitivity to model calibration. Copyright © 2016 John Wiley & Sons, Ltd.

read more

Citations
More filters
Journal ArticleDOI

Quantifying the added value of new biomarkers: how and how not.

TL;DR: This commentary provides an overview of methods currently used to evaluate new biomarkers, describes their strengths and limitations, and offers some suggestions on their use.
Journal ArticleDOI

A comparison of statistical learning methods for deriving determining factors of accident occurrence from an imbalanced high resolution dataset

TL;DR: This analysis employs and compares a series of statistical learning techniques with respect to their predictive performance and discusses the importance of determining factors of accident occurrence from the ensemble of models, substantiate that a trade-off between accuracy and sensitivity is inherent to imbalanced classification problems.
Journal ArticleDOI

Use of Long-term Cumulative Blood Pressure in Cardiovascular Risk Prediction Models.

TL;DR: Using long-term measures of cumulative blood pressure, instead of single measurements, can modestly improve the ability of cardiovascular disease risk prediction models to correctly classify individuals in terms of their risk for cardiovascular disease.
Journal ArticleDOI

TRIPOD statement: a preliminary pre-post analysis of reporting and methods of prediction models

TL;DR: In this article, the authors assess the difference in completeness of reporting and methodological conduct of published prediction models before and after publication of the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) statement.
References
More filters
Journal ArticleDOI

Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond

TL;DR: Two new measures, one based on integrated sensitivity and specificity and the other on reclassification tables, are introduced that offer incremental information over the AUC and are proposed to be considered in addition to the A UC when assessing the performance of newer biomarkers.
Journal ArticleDOI

Strictly Proper Scoring Rules, Prediction, and Estimation

TL;DR: The theory of proper scoring rules on general probability spaces is reviewed and developed, and the intuitively appealing interval score is proposed as a utility function in interval estimation that addresses width as well as coverage.
BookDOI

Regression Modeling Strategies

TL;DR: Regression models are frequently used to develop diagnostic, prognostic, and health resource utilization models in clinical, health services, outcomes, pharmacoeconomic, and epidemiologic research, and in a multitude of non-health-related areas.
Journal ArticleDOI

Coefficients of Determination in Logistic Regression Models—A New Proposal: The Coefficient of Discrimination

TL;DR: In this article, the authors propose the concept of the coefficient of discrimination (COC) as a measure of explanatory power for logistic regression models, which is an extension of R2.
Related Papers (5)