Topics in selective inference

Tian, Xiaoying; Stanford University, Department of Statistics.

Topics in selective inference

<a href="https://embed.stanford.edu/iframe/?url=https%3A%2F%2Fpurl.stanford.edu%2Fkk317cm9935" class="su-underline">Show Content</a>

Abstract/Contents

Abstract: This thesis addresses problems of inference and estimation after using the data to select a reasonable statistical model. In particular, it uses the conditional approach espoused in selective inference. This approach allows us to compute exact level-$\alpha$ tests and 1-$\alpha$ confidence intervals through the study of the conditional distributions that conditions on the selection event. Chapter 2 -- based on Tian and Taylor [2015b] -- introduces the concept of randomized selection. The selectively valid tests after randomized selection are more powerful. It allows consistent estimation and weak convergence of selective inference procedures, which extends selective inference to nonparametric settings. Finally, we propose a framework of inference after combining multiple randomized selection procedures, providing the framework for adaptive data analysis. Chapter 3 -- based on Tian et al. [2016a, b] -- proposes a novel approach of computing the selective tests through MCMC sampling. The novelty lies in the reparametrization of the conditional distribution, which makes sampling feasible for selection after solving a general penalized regression problem. This, together with the result in Tian and Taylor [2015b] allows selective inference with arbitrary data generating distribution and general model selection through penalized regression. Applications include LASSO, forward stepwise, stagewise algorithms, marginal screening and generalized LASSO, where selective tests are hard to compute otherwise. Chapter 4 -- based on Tian [2016] -- takes a different perspective at the impact of model selection. In addition to statistical inference, the effect of model selection is also manifested in the estimation of prediction error. Although the prediction error of a linear estimator can be computed through the Cp formula, it is not valid when variables used to construct it are also selected using the data. In this work, I propose a method for estimating prediction error of linear estimators after arbitrary model selection procedures. Direct applications include, estimating degrees of freedom of arbitrary selection procedures and heritability estimation in genetics.

Description

Type of resource	text
Form	electronic; electronic resource; remote
Extent	1 online resource.
Publication date	2017
Issuance	monographic
Language	English

Creators/Contributors

Associated with	Tian, Xiaoying
Associated with	Stanford University, Department of Statistics.
Primary advisor	Taylor, Jonathan
Thesis advisor	Taylor, Jonathan
Thesis advisor	Hastie, Trevor
Thesis advisor	Tibshirani, Robert
Advisor	Hastie, Trevor
Advisor	Tibshirani, Robert

Subjects

Genre	Theses

Bibliographic information

Statement of responsibility	Xiaoying Tian.
Note	Submitted to the Department of Statistics.
Thesis	Thesis (Ph.D.)--Stanford University, 2017.
Location	electronic resource

Access conditions

License: This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Also listed in

View in SearchWorks

Loading usage metrics...