Sample size considerations for the external validation of a multivariable prognostic model: a resampling study.

doi:10.1002/SIM.6787

Open AccessJournal ArticleDOI

Sample size considerations for the external validation of a multivariable prognostic model: a resampling study.

Gary S. Collins, +2 more

- 30 Jan 2016 -

Statistics in Medicine

- Vol. 35, Iss: 2, pp 214-226

Chats0

TLDR

This study suggests that externally validating a prognostic model requires a minimum of 100 events and ideally 200 (or more) events, and provides guidance on sample size for investigators designing an external validation study.

Abstract:

After developing a prognostic model, it is essential to evaluate the performance of the model in samples independent from those used to develop the model, which is often referred to as external validation. However, despite its importance, very little is known about the sample size requirements for conducting an external validation. Using a large real data set and resampling methods, we investigate the impact of sample size on the performance of six published prognostic models. Focussing on unbiased and precise estimation of performance measures (e.g. the c-index, D statistic and calibration), we provide guidance on sample size for investigators designing an external validation study. Our study suggests that externally validating a prognostic model requires a minimum of 100 events and ideally 200 (or more) events.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Prediction models for diagnosis and prognosis of covid-19: systematic review and critical appraisal

Laure Wynants, +55 more

- 07 Apr 2020 -

BMJ

TL;DR: Proposed models for covid-19 are poorly reported, at high risk of bias, and their reported performance is probably optimistic, according to a review of published and preprint reports.

...read moreread less

Journal ArticleDOI

PROBAST : A Tool to Assess Risk of Bias and Applicability of Prediction Model Studies: Explanation and Elaboration

Karel G.M. Moons, +8 more

- 01 Jan 2019 -

Annals of Internal Medicine

TL;DR: The rationale behind the domains and signaling questions, how to use them, and how to reach domain-level and overall judgments about ROB and applicability of primary studies to a review question are described.

...read moreread less

Journal ArticleDOI

A calibration hierarchy for risk models was defined: from utopia to empirical data.

Ben Van Calster, +6 more

- 01 Jun 2016 -

Journal of Clinical Epidemiology

TL;DR: Strong calibration is desirable for individualized decision support but unrealistic and counter productive by stimulating the development of overly complex models, and model development and external validation should focus on moderate calibration.

...read moreread less

Journal ArticleDOI

External validation of clinical prediction models using big datasets from e-health records or IPD meta-analysis: opportunities and challenges

Richard D Riley, +6 more

- 22 Jun 2016 -

BMJ

TL;DR: Novel opportunities for external validation in big, combined datasets in e-health records and individual participant data meta-analysis are illustrated, drawing attention to methodological challenges and reporting issues.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

An introduction to the bootstrap

Bradley Efron, +1 more

TL;DR: This article presents bootstrap methods for estimation, using simple arguments, with Minitab macros for implementing these methods, as well as some examples of how these methods could be used for estimation purposes.

...read moreread less

Journal ArticleDOI

Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors

Frank E. Harrell, +2 more

- 29 Feb 1996 -

Statistics in Medicine

TL;DR: In this article, an easily interpretable index of predictive discrimination as well as methods for assessing calibration of predicted survival probabilities are discussed, which are particularly needed for binary, ordinal, and time-to-event outcomes.

...read moreread less

BookDOI

Regression modeling strategies : with applications to linear models, logistic regression, and survival analysis

Frank E. Harrell

TL;DR: In this article, the authors present a case study in least squares fitting and interpretation of a linear model, where they use nonparametric transformations of X and Y to fit a linear regression model.

...read moreread less

Journal ArticleDOI

General Cardiovascular Risk Profile for Use in Primary Care The Framingham Heart Study

Ralph B. D'Agostino, +6 more

- 12 Feb 2008 -

Circulation

TL;DR: A sex-specific multivariable risk factor algorithm can be conveniently used to assess general CVD risk and risk of individual CVD events (coronary, cerebrovascular, and peripheral arterial disease and heart failure) and can be used to quantify risk and to guide preventive care.

...read moreread less

Book ChapterDOI

Prognostic/Clinical Prediction Models: Multivariable Prognostic Models: Issues in Developing Models, Evaluating Assumptions and Adequacy, and Measuring and Reducing Errors

Frank E. Harrell, +2 more

TL;DR: An easily interpretable index of predictive discrimination as well as methods for assessing calibration of predicted survival probabilities are discussed, applicable to all regression models, but are particularly needed for binary, ordinal, and time-to-event outcomes.

...read moreread less

Collapse

Related Papers (5)

Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): explanation and elaboration.

Karel G.M. Moons, +8 more

- 06 Jan 2015 -

Annals of Internal Medicine

Assessing the performance of prediction models: a framework for traditional and novel measures.

Ewout W. Steyerberg, +7 more

- 01 Jan 2010 -

Epidemiology

Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): The TRIPOD statement

Gary S. Collins, +3 more

- 07 Jan 2015 -

BMJ

Clinical Prediction Models

Ewout W. Steyerberg

Decision curve analysis: a novel method for evaluating prediction models.

Andrew J. Vickers, +1 more

- 01 Nov 2006 -

Medical Decision Making

Sample size considerations for the external validation of a multivariable prognostic model: a resampling study.

Citations

Prediction models for diagnosis and prognosis of covid-19: systematic review and critical appraisal

Calculating the sample size required for developing a clinical prediction model.

PROBAST : A Tool to Assess Risk of Bias and Applicability of Prediction Model Studies: Explanation and Elaboration

A calibration hierarchy for risk models was defined: from utopia to empirical data.

External validation of clinical prediction models using big datasets from e-health records or IPD meta-analysis: opportunities and challenges

References

An introduction to the bootstrap

Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors

Regression modeling strategies : with applications to linear models, logistic regression, and survival analysis

General Cardiovascular Risk Profile for Use in Primary Care The Framingham Heart Study

Prognostic/Clinical Prediction Models: Multivariable Prognostic Models: Issues in Developing Models, Evaluating Assumptions and Adequacy, and Measuring and Reducing Errors

Related Papers (5)

Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): explanation and elaboration.

Assessing the performance of prediction models: a framework for traditional and novel measures.

Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): The TRIPOD statement

Clinical Prediction Models

Decision curve analysis: a novel method for evaluating prediction models.