Inference And Learning: Computational Difficulty And Efficiency

Liang, Tengyuan

Inference And Learning: Computational Difficulty And Efficiency

Files

Liang_upenngdas_0175C_12695.pdf (4.07 MB)

Degree type

Doctor of Philosophy (PhD)

Graduate group

Statistics

Subject

Computer Sciences
Statistics and Probability

Copyright date

2018-02-23T20:17:00-08:00

Permalink

https://repository.upenn.edu/handle/20.500.14332/29338

View all metadata

Author

Liang, Tengyuan

Abstract

In this thesis, we mainly investigate two collections of problems: statistical network inference and model selection in regression. The common feature shared by these two types of problems is that they typically exhibit an interesting phenomenon in terms of computational difficulty and efficiency. For statistical network inference, our goal is to infer the network structure based on a noisy observation of the network. Statistically, we model the network as generated from the structural information with the presence of noise, for example, planted submatrix model (for bipartite weighted graph), stochastic block model, and Watts-Strogatz model. As the relative amount of signal-to-noise'' varies, the problems exhibit different stages of computational difficulty. On the theoretical side, we investigate these stages through characterizing the transition thresholds on the signal-to-noise'' ratio, for the aforementioned models. On the methodological side, we provide new computationally efficient procedures to reconstruct the network structure for each model. For model selection in regression, our goal is to learn a ``good'' model based on a certain model class from the observed data sequences (feature and response pairs), when the model can be misspecified. More concretely, we study two model selection problems: to learn from general classes of functions based on i.i.d. data with minimal assumptions, and to select from the sparse linear model class based on possibly adversarially chosen data in a sequential fashion. We develop new theoretical and algorithmic tools beyond empirical risk minimization to study these problems from a learning theory point of view.

Advisor

Tony T. Cai
Alexander Rakhlin

Date of degree

2017-01-01

Collection

Dissertations and Theses