Coupling optional pólya trees and the two sample problem

Placeholder Show Content

Abstract/Contents

Abstract
Testing and characterizing the difference between two data samples is of fundamental interest in statistics. Existing methods such as Kolmogorov-Smirnov and Cramer- von-Mises tests do not scale well as the dimensionality increases and provide no easy way to characterize the difference should it exist. In this work, we propose a theoretical framework for inference that addresses these challenges in the form of a prior for Bayesian nonparametric analysis. The new prior is constructed based on a random-partition-and-assignment procedure similar to the one that defines the standard optional Pólya tree distribution, but has the ability to generate multiple random distributions jointly. These random probability distributions are allowed to "couple", that is to have the same conditional distribution, on subsets of the sample space. We show that this "coupling optional Pólya tree" prior provides a convenient and effective way for both the testing of two sample difference and the learning of the underlying structure of the difference. We provide both simulated and real data examples to illustrate the work of this method and compare its performance to several existing methods, both frequentist and Bayesian. We demonstrate through an analysis of the Wellcome Trust Case-control Consortium data how the method can be applied to conduct multiple marker association tests. We also discuss some practical issues in the computational implementation of this prior and explore techniques that can be utilized to improve the computational efficiency in the appendices.

Description

Type of resource text
Form electronic; electronic resource; remote
Extent 1 online resource.
Publication date 2011
Issuance monographic
Language English

Creators/Contributors

Associated with Ma, Li
Associated with Stanford University, Department of Statistics
Primary advisor Wong, Wing Hung
Thesis advisor Wong, Wing Hung
Thesis advisor Siegmund, David, 1941-
Thesis advisor Tibshirani, Robert
Advisor Siegmund, David, 1941-
Advisor Tibshirani, Robert

Subjects

Genre Theses

Bibliographic information

Statement of responsibility Li Ma.
Note Submitted to the Department of Statistics.
Thesis Thesis (Ph.D.)--Stanford University, 2011.
Location electronic resource

Access conditions

Copyright
© 2011 by Li Ma
License
This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Also listed in

Loading usage metrics...