Discrimination slope and integrated discrimination improvement - properties, relationships and impact of calibration.

doi:10.1002/SIM.7139

Journal ArticleDOI

Discrimination slope and integrated discrimination improvement - properties, relationships and impact of calibration.

Michael J. Pencina, +2 more

- 10 Dec 2017 -

Statistics in Medicine

- Vol. 36, Iss: 28, pp 4482-4490

Chats0

TLDR

It is demonstrated that simple recalibration ascertaining calibration in-the-large and calibration slope equal to 1 are not sufficient to correct for some forms of mis-calibration, and it is concluded that R-squared metrics, including the discrimination slope, offer an attractive choice for quantifying model performance as long as one accounts for their sensitivity to model calibration.

Abstract:

Discrimination slope, defined as the slope of a linear regression of predicted probabilities of event derived from a prognostic model on the binary event status, has recently gained popularity as a measure of model performance. It is as a building block for the integrated discrimination improvement that equals the difference in discrimination slopes between the two models being compared. Several authors have pointed out that it does not make sense to apply the integrated discrimination improvement and discrimination slope when working with mis-calibrated models, whereas others have raised concerns about the ability of improving discrimination slope without adding new information. In this paper, we show that under certain assumptions the discrimination slope is asymptotically related to two other R-squared measures, one of which is a rescaled version of the Brier score, known to be proper. Furthermore, we illustrate how a simple recalibration makes the slope equal to the rescaled Brier R-squared metric. We also show that the discrimination slope can be interpreted as a measure of reduction in expected regret for the Gini-Brier regret function. Using theoretical and practical examples, we illustrate how all of these metrics are affected by different levels of model mis-calibration. In particular, we demonstrate that simple recalibration ascertaining calibration in-the-large and calibration slope equal to 1 are not sufficient to correct for some forms of mis-calibration. We conclude that R-squared metrics, including the discrimination slope, offer an attractive choice for quantifying model performance as long as one accounts for their sensitivity to model calibration. Copyright © 2016 John Wiley & Sons, Ltd.

Discrimination slope and integrated discrimination improvement - properties, relationships and impact of calibration.

Citations

Order Restricted Statistical Inference.

Quantifying the added value of new biomarkers: how and how not.

A comparison of statistical learning methods for deriving determining factors of accident occurrence from an imbalanced high resolution dataset

Use of Long-term Cumulative Blood Pressure in Cardiovascular Risk Prediction Models.

TRIPOD statement: a preliminary pre-post analysis of reporting and methods of prediction models

References

Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond

Strictly Proper Scoring Rules, Prediction, and Estimation

Verification of forecasts expressed in terms of probability

Regression Modeling Strategies

Coefficients of Determination in Logistic Regression Models—A New Proposal: The Coefficient of Discrimination

Related Papers (5)

Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond

Net reclassification index at event rate: properties and relationships

A note on the evaluation of novel biomarkers: do not rely on integrated discrimination improvement and net reclassification index.

Extensions of net reclassification improvement calculations to measure usefulness of new biomarkers

2013 ACC/AHA Guideline on the Treatment of Blood Cholesterol to Reduce Atherosclerotic Cardiovascular Risk in Adults A Report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines