Computational and Statistical Boundaries for Submatrix Localization in a Large Noisy Matrix

Loading...
Thumbnail Image
Penn collection
Statistics Papers
Degree type
Discipline
Subject
computational boundary
computational complexity
detection
planted clique
lower bounds
minimax
signal to noise ratio
statistical boundary
submatrix localization
Physical Sciences and Mathematics
Funder
Grant number
License
Copyright date
Distributor
Related resources
Author
Cai, Tony
Liang, Tengyuan
Rakhlin, Alexander
Contributor
Abstract

We study in this paper computational and statistical boundaries for submatrix localization. Given one observation of (one or multiple nonoverlapping) signal submatrix (of magnitude λ and size km×kn) embedded in a large noise matrix (of size m × n), the goal is to optimal identify the support of the signal submatrix computationally and statistically. Two transition thresholds for the signal-to-noise ratio λ/σ are established in terms of m, n, km and kn. The first threshold, SNRc, corresponds to the computational boundary. We introduce a new linear time spectral algorithm that identifies the submatrix with high probability when the signal strength is above the threshold SNRc. Below this threshold, it is shown that no polynomial time algorithm can succeed in identifying the submatrix, under the hidden clique hypothesis. The second threshold, SNRs, captures the statistical boundary, below which no method can succeed in localization with probability going to one in the minimax sense. The exhaustive search method successfully finds the submatrix above this threshold. In marked contrast to submatrix detection and sparse PCA, the results show an interesting phenomenon that SNRc is always significantly larger than SNRs, which implies an essential gap between statistical optimality and computational efficiency for submatrix localization.

Advisor
Date Range for Data Collection (Start Date)
Date Range for Data Collection (End Date)
Digital Object Identifier
Series name and number
Publication date
2017-01-01
Journal title
The Annals of Statistics
Volume number
Issue number
Publisher
Publisher DOI
Journal Issue
Comments
Recommended citation
Collection