Significance Tests Harm Progress in Forecasting

Armstrong, J. Scott

Significance Tests Harm Progress in Forecasting

Files

StatSigIJF361.pdf (335.3 KB)

Penn collection

Marketing Papers

Subject

accuracy measures
combining forecasts
confidence intervals
effect size
M-competition
meta-analysis
null hypothesis
practical significance
replications

Permalink

https://repository.upenn.edu/handle/20.500.14332/39677

View all metadata

Author

Armstrong, J. Scott

Abstract

Based on a summary of prior literature, I conclude that tests of statistical significance harm scientific progress. Efforts to find exceptions to this conclusion have, to date, turned up none. Even when done correctly, significance tests are dangerous. I show that summaries of scientific research do not require tests of statistical significance. I illustrate the dangers of significance tests by examining an application to the M3-Competition. Although the authors of that reanalysis conducted a proper series of statistical tests, they suggest that the original M3 was not justified in concluding that combined forecasts reduce errors and that the selection of the best method is dependent upon the selection of a proper error measure. I show that the original conclusions were justified and that they are correct. Authors should try to avoid tests of statistical significance, journals should discourage them, and readers should ignore them. Instead, to analyze and communicate findings from empirical studies, one should use effect sizes, confidence intervals,replications/extensions, and meta-analyses.

Publication date

2007-04-01

Comments

Postprint version. Published in International Journal of Forecasting, Volume 23, Issue 2, April 2007, pages 321-327. Publisher URL: http://dx.doi.org/10.1016/j.ijforecast.2007.03.004

Collection

Articles