External Validation
- Bibliography

External Validation

External validation of a prediction model is the process of assessing how well a previously developed model performs in an independent sample from the intended target setting. It estimates the model’s transportability by measuring calibration, discrimination, and overall prediction accuracy in data that were not used to develop the model. Resampling methods are used for ‘internal validation’ of the model for data originating under the same setting. Though internal validation can be thought of as an estimate of external validation, it is not sufficient evidence of external validation. An independent dataset suitable for external validation generally has one or more of the following properties (Moons et al. (2012)):

Temporal differences
- Data may be collected from the same locations, but over different periods of time.
Geographic differences
- Data was collected from different locations.
Institutional differences
- Data was collected from an organization not connected with the original source.

Along with a host of other factors such as differences in eligibility criteria, predictor and outcome definitions, and follow up time.

A principled approach to external validation follows these steps:

Collect a suitable independent sample of sufficient size.
Create a descriptive summary table that compares the characteristics of the original sample vs. the external sample.
Compare prediction performance estimates for the following scenarios
1. Original apparent: The original model applied to the original data.
2. Original internally validated: The internally validated performance estimate from the original model applied to the original data.
3. Original externally validated: The original model applied to the new data.
Compare model parameter estimates and prediction performance estimates from
1. The original model
2. The original model selection algorithm applied to the new data only
3. The original model selection algorithm applied to the combined data (original + new)
4. Potentially an updated model selection algorithm applied to the combined data (original + new)
Discuss differences for these outcomes, whether they might be due to population differences, overfitting, underfitting, differences in data capture, extrapolation, etc.

Royston and Altman (2013) provide specific suggestions for Cox models.

Bibliography

Moons, Karel G M, Andre Pascal Kengne, Diederick E Grobbee, Patrick Royston, Yvonne Vergouwe, Douglas G Altman, and Mark Woodward. 2012. “Risk Prediction Models: II. External Validation, Model Updating, and Impact Assessment.” Heart 98 (9): 691–98. https://doi.org/10.1136/heartjnl-2011-301247.

Royston, Patrick, and Douglas G Altman. 2013. “External Validation of a Cox Prognostic Model: Principles and Methods.” BMC Medical Research Methodology 13 (1): 33. https://doi.org/10.1186/1471-2288-13-33.

Published: 2022-02-20
Last Updated: 2026-04-29

Table of Contents

External Validation

Bibliography