Classification and testing under robust model assumptions

<a href="https://embed.stanford.edu/iframe/?url=https%3A%2F%2Fpurl.stanford.edu%2Ftk432pn5208" class="su-underline">Show Content</a>

Abstract/Contents

Abstract: In this thesis, we develop methodologies for two well-known problems under more realistic and robust model assumptions. First, we consider the target gene identification problem using gene perturbation data. We propose a simple data-adaptive model by incorporating information across the genome as an alternative to the traditional two-group model testing for differential gene expression. The data-adaptive model is more robust than the traditional two-group model without assuming zero-effect for the null model. Second, we consider a multi-class classification problem where the training and the out-of-sample test data may have different distributions. We propose a method that simultaneously makes good predictions for samples similar to the training data and makes rejections otherwise, as well as methods for performance evaluation in this mismatched setting. The proposed method is robust to distributional changes in the data distribution and has good performance in high dimension.

Type of resource	text
Form	electronic resource; remote; computer; online resource
Extent	1 online resource.
Place	California
Place	[Stanford, California]
Publisher	[Stanford University]
Copyright date	2019; ©2019
Publication date	2019; 2019
Issuance	monographic
Language	English

Author	Guan, Leying
Degree supervisor	Tibshirani, Robert
Degree supervisor	Wong, Wing Ho
Thesis advisor	Tibshirani, Robert
Thesis advisor	Wong, Wing Ho
Thesis advisor	Efron, Bradley
Degree committee member	Efron, Bradley
Associated with	Stanford University, Department of Statistics.

Genre	Theses
Genre	Text

Statement of responsibility	Leying Guan.
Note	Submitted to the Department of Statistics.
Thesis	Thesis Ph.D. Stanford University 2019.
Location	electronic resource

License: This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

View in SearchWorks

Loading usage metrics...