Using regression models for prediction: shrinkage and regression to the mean

Stat Methods Med Res. 1997 Jun;6(2):167-83. doi: 10.1177/096228029700600206.

Abstract

The use of a fitted regression model in predicting future cases, either as a diagnostic tool or as an instrument for risk assessment is discussed. The regression to the mean effect implies that the future values of the response variable tend to be closer to the overall mean than might be expected from the predicted values. The extent of this shrinkage is studied for multiple and logistic regression models, and is found to be related to simple goodness-of-fit statistics of the original regression. Shrinkage is a particularly serious problem if the sample size is small and/or the number of covariates is large. Shrinkage of predictors is illustrated by two examples. A more general formulation is suggested.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biometry*
  • Body Height / genetics
  • Humans
  • Logistic Models
  • Male
  • Models, Statistical*
  • Regression Analysis*