Statistical Inference After Model Selection

Loading...
Thumbnail Image
Penn collection
Statistics Papers
Degree type
Discipline
Subject
model selection
statistical inference
mixtures of distributions
Criminology and Criminal Justice
Other Statistics and Probability
Statistics and Probability
Funder
Grant number
License
Copyright date
Distributor
Related resources
Author
Berk, Richard A
Brown, Lawrence D
Zhao, Linda
Contributor
Abstract

Conventional statistical inference requires that a model of how the data were generated be known before the data are analyzed. Yet in criminology, and in the social sciences more broadly, a variety of model selection procedures are routinely undertaken followed by statistical tests and confidence intervals computed for a “final” model. In this paper, we examine such practices and show how they are typically misguided. The parameters being estimated are no longer well defined, and post-model-selection sampling distributions are mixtures with properties that are very different from what is conventionally assumed. Confidence intervals and statistical tests do not perform as they should. We examine in some detail the specific mechanisms responsible. We also offer some suggestions for better practice and show though a criminal justice example using real data how proper statistical inference in principle may be obtained.

Advisor
Date Range for Data Collection (Start Date)
Date Range for Data Collection (End Date)
Digital Object Identifier
Series name and number
Publication date
2010-06-01
Journal title
Journal of Quantitative Criminology
Volume number
Issue number
Publisher
Publisher DOI
Journal Issue
Comments
Recommended citation
Collection