Measuring sample quality with Stein's method

Placeholder Show Content

Abstract/Contents

Abstract
As the size of datasets has grown, classical methods like Markov chain Monte Carlo have become increasingly burdensome from a computational perspective. Practitioners have been turning to biased Markov chain Monte Carlo procedures that are able to trade off asymptotic exactness for computational speed. Unfortunately, previously used diagnostics to aid with these methods are insufficient for assessing the asymptotic bias incurred. We will first introduce a new computable quality measure based on Stein's method that quantifies the maximum discrepancy between sample and target expectations over a large class of test functions. Our first main theoretical contribution will be showing that our measure converges to zero only if a sample converges to its target distribution. Empirically we will show this discrepancy avoids the problems faced by previous diagnostics, e.g., effective sample size. Our next step will be to generalize these ideas to cover a larger class of target distributions. By studying Ito diffusions with fast-mixing rates, we will be able to extend the purview of acceptable target distributions from distributions with strongly log-concave densities with bounded third and fourth derivatives to a much larger class that includes multimodal and heavy-tailed distributions. Finally, in the last chapter, we will study a variation of our previous methods that is computationally feasible to obtain for much larger samples. This variation will combine our ideas with those from reproducing kernel Hilbert spaces to define a closed-form expression for our measure. While other authors have also recommended this faster variation, we show that most common choices for the kernel function are insufficient for controlling convergence in the multivariate setting. A special class of kernel functions that do indeed control convergence will be proposed and shown to dominate the other traditional kernel functions empirically.

Description

Type of resource text
Form electronic; electronic resource; remote
Extent 1 online resource.
Publication date 2017
Issuance monographic
Language English

Creators/Contributors

Associated with Gorham, Jackson
Associated with Stanford University, Department of Statistics.
Primary advisor Candès, Emmanuel J. (Emmanuel Jean)
Primary advisor Mackey, Lester
Thesis advisor Candès, Emmanuel J. (Emmanuel Jean)
Thesis advisor Mackey, Lester
Thesis advisor Holmes, Susan
Advisor Holmes, Susan

Subjects

Genre Theses

Bibliographic information

Statement of responsibility Jackson Gorham.
Note Submitted to the Department of Statistics.
Thesis Thesis (Ph.D.)--Stanford University, 2017.
Location electronic resource

Access conditions

Copyright
© 2017 by Jack Christopher Gorham
License
This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Also listed in

Loading usage metrics...