VIF Regression: A Fast Regression Algorithm for Large Data
Loading...
Penn collection
Statistics Papers
Degree type
Discipline
Subject
marginal false discovery rate
model selection
stepwise regression
variable selection
Applied Statistics
Statistics and Probability
model selection
stepwise regression
variable selection
Applied Statistics
Statistics and Probability
Funder
Grant number
License
Copyright date
Distributor
Related resources
Author
Lin, Dongyu
Foster, Dean P
Ungar, Lyle H
Contributor
Abstract
We propose a fast and accurate algorithm, VIF regression, for doing feature selection in large regression problems. VIF regression is extremely fast: it uses a one-pass search over the predictors, and a computationally efficient method of testing each potential predictor for addition to the model. VIF regression provably avoids model over-fitting, controlling marginal False Discovery Rate (mFDR). Numerical results show that it is much faster than any other published algorithm for regression with feature selection, and is as accurate as the best of the slower algorithms.
Advisor
Date Range for Data Collection (Start Date)
Date Range for Data Collection (End Date)
Digital Object Identifier
Series name and number
Publication date
2011-01-01
Journal title
Journal of the American Statistical Association