Classification and testing under robust model assumptions

Placeholder Show Content

Abstract/Contents

Abstract
In this thesis, we develop methodologies for two well-known problems under more realistic and robust model assumptions. First, we consider the target gene identification problem using gene perturbation data. We propose a simple data-adaptive model by incorporating information across the genome as an alternative to the traditional two-group model testing for differential gene expression. The data-adaptive model is more robust than the traditional two-group model without assuming zero-effect for the null model. Second, we consider a multi-class classification problem where the training and the out-of-sample test data may have different distributions. We propose a method that simultaneously makes good predictions for samples similar to the training data and makes rejections otherwise, as well as methods for performance evaluation in this mismatched setting. The proposed method is robust to distributional changes in the data distribution and has good performance in high dimension.

Description

Type of resource text
Form electronic resource; remote; computer; online resource
Extent 1 online resource.
Place California
Place [Stanford, California]
Publisher [Stanford University]
Copyright date 2019; ©2019
Publication date 2019; 2019
Issuance monographic
Language English

Creators/Contributors

Author Guan, Leying
Degree supervisor Tibshirani, Robert
Degree supervisor Wong, Wing Ho
Thesis advisor Tibshirani, Robert
Thesis advisor Wong, Wing Ho
Thesis advisor Efron, Bradley
Degree committee member Efron, Bradley
Associated with Stanford University, Department of Statistics.

Subjects

Genre Theses
Genre Text

Bibliographic information

Statement of responsibility Leying Guan.
Note Submitted to the Department of Statistics.
Thesis Thesis Ph.D. Stanford University 2019.
Location electronic resource

Access conditions

Copyright
© 2019 by Leying Guan
License
This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Also listed in

Loading usage metrics...