Bayesian Variable Selection in Structured High-Dimensional Covariate Spaces With Applications in Genomics

Loading...
Thumbnail Image
Penn collection
Statistics Papers
Degree type
Discipline
Subject
Ising model
Markov chain Monte Carlo
motif analysis
phase transition
undirected graph
Statistics and Probability
Funder
Grant number
License
Copyright date
Distributor
Related resources
Author
Li, Fan
Zhang, Nancy R
Contributor
Abstract

We consider the problem of variable selection in regression modeling in high-dimensional spaces where there is known structure among the covariates. This is an unconventional variable selection problem for two reasons: (1) The dimension of the covariate space is comparable, and often much larger, than the number of subjects in the study, and (2) the covariate space is highly structured, and in some cases it is desirable to incorporate this structural information in to the model building process. We approach this problem through the Bayesian variable selection framework, where we assume that the covariates lie on an undirected graph and formulate an Ising prior on the model space for incorporating structural information. Certain computational and statistical problems arise that are unique to such high-dimensional, structured settings, the most interesting being the phenomenon of phase transitions. We propose theoretical and computational schemes to mitigate these problems. We illustrate our methods on two different graph structures: the linear chain and the regular graph of degree k. Finally, we use our methods to study a specific application in genomics: the modeling of transcription factor binding sites in DNA sequences.

Advisor
Date Range for Data Collection (Start Date)
Date Range for Data Collection (End Date)
Digital Object Identifier
Series name and number
Publication date
2010-01-01
Journal title
Journal of the American Statistical Association
Volume number
Issue number
Publisher
Publisher DOI
Journal Issue
Comments
At the time of publication, author Nancy R. Zhang was affiliated with Stanford University. Currently, she is a faculty member at the Statistics Department at the University of Pennsylvania.
Recommended citation
Collection