Statistics Papers

Statistics Papers

 

The aim of statistical modeling is to empower effective decision making, and the unique contribution of the field is its ability to incorporate multiple levels of uncertainty in the framing of wise decisions. Over the last few years, the development of new computational tools and the unprecedented evolution of “big data” have propelled statistical modeling to new levels. Today statistical modeling and machine learning have reached a level of impact that no large organization can afford to ignore. The information landscape is changing as it has never changed before.

At Wharton, the Department of Statistics is proud to have had a leadership role in this development. It participates in a wide range of university consortia that spans the fields of computer science, neuroscience, medicine, public policy, and finance. Moreover, our faculty members have won singular international recognition for their contributions to many parts of statistical science including observational studies, statistical algorithms, game theory, high dimensional inference, information theory, nonparametric function estimation, model selection, time series analysis, machine learning, and probability theory.

Follow


Papers from 2017

PDF

Weighted False Discovery Rate Control in Large-Scale Multiple Testing, Pallavi Basu, Tony Cai, Kiranmoy Das, and Wenguang Sun

PDF

Universal Limit Theorems in Graph Coloring Problems With Connections to Extremal Combinatorics, Bhaswar B. Bhattacharya, Persi Diaconis, and Sumit Mukherjee

PDF

Degree Sequence of Random Permutation Graphs, Bhaswar B. Bhattacharya and Sumit Mukherjee

PDF

Adaptive Estimation of Planar Convex Sets, Tony Cai, Adityanand Guntuboyina, and Yuting Wei

PDF

Confidence Intervals for High-Dimensional Linear Regression: Minimax Rates and Adaptivity, Tony Cai and Zijian Guo

PDF

Computational and Statistical Boundaries for Submatrix Localization in a Large Noisy Matrix, Tony Cai, Tengyuan Liang, and Alexander Rakhlin

PDF

Optimal Screening and Discovery of Sparse Signals with Applications to Multistage High-throughput Studies, Tony Cai and Wenguang Sun

PDF

Customer Acquisition via Display Advertising Using Multi-Armed Bandit Experiments, Eric M. Schwartz, Eric T. Bradlow, and Peter S. Fader

PDF

Explaining Normal Quantile-Quantile Plots Through Animation: The Water-Filling Analogy, Robert A. Stine

Papers from 2016

PDF

A Central Limit Theorem for Temporally Non-Homogenous Markov Chains with Applications to Dynamic Programming, Alessandro Arlotto and J Michael Steele

PDF

Beardwood-Halton-Hammersly Theorem for Stationary Ergodic Sequences: A Counterexample, Alessandro Arlotto and J. Michael Steele

PDF

Collision Times in Multicolor Urn Models and Sequential Graph Coloring With Applications to Discrete Logarithms, Bhaswar B. Bhattacharya

PDF

Global Testing Against Sparse Alternatives in Time-Frequency Analysis, Tony Cai, Yonina C. Eldar, and Xiaodong Li

PDF

Accuracy Assessment for High-Dimensional Linear Regression, Tony Cai and Zijian Guo

PDF

Optimal Large-Scale Quantum State Tomography With Pauli Measurements, Tony Cai, Donggyu Kim, Yazhen Wang, Ming Yuan, and Harrison H. Zhou

PDF

Geometric Inference for General High-Dimensional Linear Inverse Problems, Tony Cai, Tengyuan Liang, and Alexander Rakhlin

PDF

Estimating Sparse Precision Matrix: Optimal Rates of Convergence and Adaptive Estimation, Tony Cai, Weidong Liu, and Harrison H. Zhou

PDF

Optimal Rates of Convergence for Noisy Sparse Phase Retrieval via Thresholded Wirtinger Flow, Tony Cai, Xiadong Li, and Zongming Ma

PDF

Estimating Structured High-Dimensional Covariance and Precision Matrices: Optimal Rates and Adaptive Estimation, Tony Cai, Zhao Ren, and Harrison H. Zhou

PDF

Matrix Completion via Max-Norm Constrained Optimization, Tony Cai and Wen-Xin Zhou

PDF

Estimating an NBA Player’s Impact on is Team’s Chances of Winning, Sameer K. Deshpande and Shane T. Jensen

PDF

Sparse CCA: Adaptive Estimation and Computational Barriers, Chao Gao, Zongming Ma, and Harrison Zhou

PDF

Familywise Error Rate Control via Knockoffs, Lucas Janson and Weijie Su

PDF

Impartial Predictive Modeling: Ensuring Fairness in Arbitrary Models, Kory D. Johnson, Dean P. Foster, and Robert A. Stine

PDF

Instrumental Variables Estimation With Some Invalid Instruments and its Application to Mendelian Randomization, Hyunseung Kang, Anru Zhang, Tony Cai, and Dylan Small

PDF

Power Weighted Densities for Time Series Data, Daniel McCarthy and Shane T. Jensen

PDF

Efficient Empirical Bayes Prediction Under Check Loss Using Asymptotic Risk Estimates, Gourab Mukherjee, Lawrence D. Brown, and Paat Rusmevichientong

PDF

Comparison of the Value of Nursing Work Environments in Hospitals Across Different Levels of Patient Risk, Jeffrey H. Silber, Paul R. Rosenbaum, Matthew D McHugh, Justin M. Ludwig, Herbert L. Smith, Bijan A. Niknam, Orit Even-Shoshan, Lee A. Fleisher, Rachel R. Kelz, and Linda H. Aiken

PDF

The Bruss-Robertson Inequality: Elaborations, Extensions, and Applications, J. Michael Steele

PDF

False Discoveries Occur Early on the Lasso Path, Weijie Su, Malgorzata Bogdan, and Emmanuel Candès

PDF

SLOPE is Adaptive to Unknown Sparsity and Asymptotically Minimax, Weijie Su and Emmanuel Candès

PDF

Nonparametric Multi-Level Clustering of Human Epilepsy Seizures, Drausin F Wulsin, Shane T. Jensen, and Brian Litt

PDF

Optimal Shrinkage Estimation of Mean Parameters in Family of Distributions With Quadratic Variance, Xianchao Xie, Samuel C. Kou, and Lawrence D. Brown

PDF

Scanning a Poisson Random Field for Local Signals, Nancy R. Zhang, Benjamin Yakir, Charlie L. Xia, and David O. Siegmund

Papers from 2015

PDF

Potential Mechanisms for Cancer Resistance in Elephants and Comparative Cellular Response to DNA Damage in Humans, Lisa M. Abegglen, Aleah Fox Caulin, Ashley Chan, Kristy Lee, Rosann Robinson, Michael S. Campbell, Wendy K. Kiso, Dennis L. Schmitt, Peter J. Waddell, Srividya Bhaskara, Shane T. Jensen, Carlo C. Maley, and Joshua D. Schiffman

PDF

A Spectral Algorithm for Latent Dirichlet Allocation, Anima Anandkumar, Dean P. Foster, Daniel Hsu, Sham Kakade, and Yi-Kai Liu

PDF

OpenWAR: An Open Source System for Evaluating Overall Player Performance in Major League Baseball, Benjamin S. Baumer, Shane T. Jensen, and Gregory J. Matthews

PDF

Twitter Event Networks and the Superstar Model, Shankar Bhamidi, J Michael Steele, and Tauhid Zaman

PDF

SLOPE – Adaptive Variable Selection via Convex Optimization, Malgorzata Bogdan, Ewout Van Den Berg, Chiara Sabatti, Weijie Su, and Emmanuel Candès

PDF

Models as Approximations - A Conspiracy of Random Regressors and Model Deviations Against Classical Inference in Regression, Andreas Buja, Richard A. Berk, Lawrence D. Brown, Edward I. George, Emil Pitkin, Mikhail Traskin, Linda Zhao, and Kai Zhang

PDF

Robust and Computationally Feasible Community Detection in the Presence of Arbitrary Outlier Nodes, Tony Cai and Xiaodong Li

PDF

Optimal Estimation and Rank Detection for Sparse Spiked Covariance Matrices, T. Tony Cai, Zongming Ma, and Yihong Wu

PDF

Graph-Based Change-Point Detection, Hao Chen and Nancy Zhang

PDF

Disease Diagnosis From Immunoassays With Plate to Plate Variability: A Hierarchical Bayesian Approach, Oliver Entine, Dylan S. Small, Shane T. Jensen, Gerardo Sanchez Garcia, Milagros Bastos Mazuelos, Manuela R. Verastegui Pimentel, and Michael Z. Levy

PDF

Supplement to "Minimax Estimation in Sparse Canonical Correlation Analysis", Chao Gao, Zongming Ma, Zhao Ren, and Harrison H. Zhou

PDF

Medical Students in the Emergency Department and Patient Length of Stay, Kimon Ionnides, Mira Mamtani, Frances S. Shofer, Dylan S. Small, Sean Hennessey, Benjamin Abella, and Kevin Scott

PDF

Discussion of "Frequentist of Coverage of Adaptive Nonparametric Bayesian Credible Sets, Mark G. Low and Zongming Ma

PDF

Computational Barriers in Minimax Submatrix Detection, Zongming Ma and Yihong Wu

PDF

Robust Dimension Free Isoperimetry in Gaussian Space, Elchanan Mossel and Joe Neeman

PDF

Robust Optimality of Gaussian Noise Stability, Elchanan Mossel and Joe Neeman

PDF

Empirical Bayes Prediction for the Multivariate Newsvendor Loss Function, Gourab Mukherjee, Lawrence D. Brown, and Paat Rusmevichientong

PDF

Sequential Complexities and Uniform Martingale Laws of Large Numbers, Alexander Rakhlin, Karthik Sridharan, and Ambuj Tewari

PDF

Some Counterclaims Undermine Themselves in Observational Studies, Paul R. Rosenbaum

Papers from 2014

PDF

Misspecified Mean Function Regression: Making Good Use of Regression Models That Are Wrong, Richard A. Berk, Lawrence D. Brown, Andreas Buja, Edward I. George, Emil Pitkin, Kai Zhang, and Linda Zhao

PDF

Minimum-Weight Edge Discriminators in Hypergraphs, Bhaswar B. Bhattacharya, Sayantan Das, and Shirshendu Ganguly

PDF

Variable Selection for BART: An Application to Gene Regulation, Justin Bleich, Adam Kapelner, Edward I. George, and Shane T. Jensen

PDF

Adaptive Confidence Bands for Nonparametric Regression Functions, T. Tony Cai, Mark G. Low, and Zongming Ma

PDF

Discussion: “A Significance Test for the Lasso”, T. Tony Cai and Ming Yuan

PDF

Sparse Representation of a Polytope and Recovery of Sparse Signals and Low-Rank Matrices, T. Tony Cai and Arnu Zhang

PDF

Using an Instrumental Variable to Test for Unmeasured Confounding, Jing Cheng, Scott A. Lorch, Dylan S. Small, and Zijian Guo

PDF

Spectral Learning of Latent-Variable PCFGs: Algorithms and Sample Complexity, Shay B. Cohen, Karl Stratos, Michael Collins, Dean P. Foster, and Lyle H. Ungar

PDF

Equivalence Testing for Functional Data With an Application to Comparing Pulmonary Function Devices, Colin B. Fogarty and Dylan S. Small

PDF

Improving Money’s Worth Ratio Calculations: The Case of Singapore’s Pension Annuities, Joelle H. Fong, Jean Lemaire, and Yiu K. Tse

PDF

Large Sample Bounds on the Survivor Average Causal Effect in the Presence of a Binary Covariate with Conditionally Ignorable Treatment Assignment, Michael H. Freiman and Dylan. S. Small

PDF

Hunting for Significance: Bayesian Classifiers Under a Mixture Loss Function, Igar Fuki, Lawrence D. Brown, Xu Han, and Linda Zhao

PDF

Boundary Value Problems for a Family of Domains in the Sierpinski Gasket, Zijian Guo, Rachel Kogan, Hua Qiu, and Robert S. Strichartz

PDF

Clustered Treatment Assignments and Sensitivity to Unmeasured Biases in Observational Studies, Ben B. Hansen, Paul R. Rosenbaum, and Dylan S. Small

PDF

A Level-Set Hit-And-Run Sampler for Quasi-Concave Distributions, Shane T. Jensen and Dean P. Foster

PDF

Geometric Influences II: Correlation Inequalities and Noise Sensitivity, Nathan Keller, Elchanan Mossel, and Arnab Sen

PDF

High-Dimensional Learning of Linear Causal Networks via Inverse Covariance Estimation, Po-Ling Loh and Peter Bühlmann

PDF

Majority Dynamics and Aggregation of Information in Social Networks, Elchanan Mossel, Joe Neeman, and Omer Tamuz

PDF

Asymptotic Learning on Bayesian Social Networks, Elchanan Mossel, Allan Sly, and Omer Tamuz

PDF

Predicting the Draft and Career Success of Tight Ends in the National Football League, Jason Mulholland and Shane T. Jensen

PDF

Metastatic Tumor Evolution and Organoid Modeling Implicate TGFBR2 as a Cancer Driver in Diffuse Gastric Cancer, Lincoln Nadauld, Sarah Garcia, Georges Natsoulis, John M. Bell, Laura Miotke, Erik S. Hopmans, Hua Xu, Reetesh K. Pai, Curt Palm, John F. Regan, Hao Chen, Patrick Flaherty, Akifumi Ootani, Nancy R. Zhang, James M. Ford, Calvin J. Kuo, and Hanlee P. Ji

PDF

Anesthesia Technique, Mortality, and Length of Stay After Hip Fracture Surgery, Mark Neuman, Paul R. Rosenbaum, Justin Ludwig, Jose Zubizarreta, and Jeffrey H. Silber

PDF

On Extracting Common Random Bits From Correlated Sources on Large Alphabets, Siu On Chan, Elchanan Mossel, and Joe Neeman

PDF

Online Nonparametric Regression, Alexander Rakhlin and Karthik Sridharan

PDF

Probability Aggregation in Time-Series: Dynamic Hierarchical Modeling of Sparse Expert Beliefs, Ville Satopää, Shane T. Jensen, Barbara Mellers, Philip E. Tetlock, and Lyle H. Ungar

PDF

Estimation of Causal Effects Using Instrumental Variables With Nonignorable Missing Covariates: Application to Effect of Type of Delivery NICU on Premature Infants, Fan Yang, Scott A. Lorch, and Dylan S. Small

PDF

Uniform Correlation Mixture of Bivariate Normal Distributions and Hypercubically-Contoured Densities That Are Marginally Normal, Kai Zhang, Lawrence D. Brown, Edward I. George, and Linda Zhao

PDF

Matching for Balance, Pairing for Heterogeneity in an Observational Study of the Effectiveness of For-Profit and Not-For-Profit High Schools in Chile, José R. Zubizarreta, Ricardo D. Paredes, and Paul R. Rosenbaum

PDF

Isolation in the Construction of Natural Experiments, José R. Zubizarreta, Dylan S. Small, and Paul R. Rosenbaum

Papers from 2013

PDF

Stochastic Convex Optimization With Bandit Feedback, Alekh Agarwal, Dean P. Foster, Daniel Hsu, Sham M. Kakade, and Alexander Rakhlin

PDF

The Power to See: A New Graphical Test of Normality, Sivan Aldor-Noiman, Lawrence D. Brown, Robert A. Stine, Andreas Buja, and Wolfgang Rolke

PDF

Noise Correlation Bounds for Uniform Low Degree Functions, Per Austrin and Elchanan Mossel

PDF

The Effects of City Streets on an Urban Disease Vector., Corentin M. Barbu, Andrew Hong, Jennifer M. Manne, Dylan S. Small, Javier E. Quintanilla Calderón, Karthik Sethuraman, Víctor Quispe-Machaca, Jenny Ancca-Juárez, Juan G. Cornejo del Carpio, Fernando S. Málaga Chavez, César Náquira, and Michael Z. Levy

PDF

Valid Post-Selection Inference, Richard A. Berk, Lawrence D. Brown, Andreas Buja, Kai Zhang, and Linda Zhao

PDF

Scaling Limits for Width Two Partially Ordered Sets: The Incomparability Window, Nayantara Bhatnagar, Nick Crawford, Elchanan Mossel, and Arnab Sen

PDF

The Poisson Compound Decision Problem Revisited, Lawrence D. Brown, Eitan Greenshtein, and Ya'acov Ritov

PDF

Two-Sample Covariance Matrix Testing and Support Recovery, Tony Cai, Weidong Liu, and Yin Xia

PDF

Optimal Hypothesis Testing for High Dimensional Covariance Matrices, Tony Cai and Zongming Ma

PDF

Distributions of Angles in Random Packing on Spheres, T. Tony Cai, Jianqing Fan, and Tiefeng Jiang

PDF

Adaptive Confidence Intervals for Regression Functions Under Shape Constraints, T. Tony Cai, Mark G. Low, and Yin Xia

PDF

Sparse PCA: Optimal Rates and Adaptive Estimation, T. Tony Cai, Zongming Ma, and Yihong Wu

PDF

Optimal Rates of Convergence for Estimating Toeplitz Covariance Matrices, T. Tony Cai, Zhao Ren, and Harrison H. Zhou

PDF

Sharp RIP Bound for Sparse Signal and Low-Rank Matrix Recovery, T. Tony Cai and Anru Zhang

PDF

Compressed Sensing and Affine Rank Minimization Under Restricted Isometry, T. Tony Cai and Arnu Zhang

PDF

A Max-Norm Constrained Minimization Approach to 1-Bit Matrix Completion, T. Tony Cai and Wen-Xin Zhou

PDF

Stress Functions for Nonlinear Dimension Reduction, Proximity Analysis, and Graph Drawing, Lisha Chen and Andreas Buja

PDF

Experiments With Spectral Learning of Latent-Variable PCFGs, Shay B. Cohen, Karl Stratos, Michael Collins, Dean P. Foster, and Lyle H. Ungar