Laplacian regularized stratified model fitting

Placeholder Show Content

Abstract/Contents

Abstract
A large number of practical problems in optimization involve the fitting of models with categorical features. These problems arise in a variety of fields, such as machine learning, statistics, finance, medicine, and data analysis. A common and simple paradigm parameterizes the model by a parameter vector, and uses convex optimization to minimize an empirical loss on a training data set plus a regularization term that (hopefully) skews the model parameter toward one for which the model generalizes to out of sample data. In this dissertation, we discuss a framework of model fitting methods that depend on a number of categorical features. We build a model of some data that depends in an arbitrary way on these categorical features, by building a separate model for the data that take on each of the possible values of these categorical features. We call these models stratified models. Furthermore, we propose augmenting a basic stratified model fitting problem with an additional regularization term, Laplacian regularization, which encourages the parameters found for each value to be close to their neighbors on some specified weighted graph on the categorical values. We begin this dissertation with an early work on solving Laplacian regularized problems. We then define these models as Laplacian regularized stratified models, and propose an efficient distributed solution method based on the alternating direction method of multipliers (ADMM), which allows us to solve the Laplacian regularized stratified model fitting problem in a distributed fashion and at large scale, in spite of the model parameters being coupled due to the Laplacian regularization term. We then provide an extension of Laplacian regularized stratified models, where we show that if the model parameters vary smoothly over the graph Laplacian, then it can be well approximated by a linear combination of a small number of the Laplacian eigenvectors, which can drastically reduce the number of variables in the problem. Finally, we provide two examples: one example of fitting Laplacian regularized Gaussian models, and another, much larger example which brings the ideas of this paper together by fitting Laplacian regularized stratified risk and return models to used in a realistic quantitative finance setting.

Description

Type of resource text
Form electronic resource; remote; computer; online resource
Extent 1 online resource.
Place California
Place [Stanford, California]
Publisher [Stanford University]
Copyright date 2021; ©2021
Publication date 2021; 2021
Issuance monographic
Language English

Creators/Contributors

Author Tuck, Jonathan Eric
Degree supervisor Boyd, Stephen P
Thesis advisor Boyd, Stephen P
Thesis advisor Osgood, Brad
Thesis advisor Srivastava, Ashok
Degree committee member Osgood, Brad
Degree committee member Srivastava, Ashok
Associated with Stanford University, Department of Electrical Engineering

Subjects

Genre Theses
Genre Text

Bibliographic information

Statement of responsibility Jonathan Eric Tuck.
Note Submitted to the Department of Electrical Engineering.
Thesis Thesis Ph.D. Stanford University 2021.
Location https://purl.stanford.edu/zm017yr2171

Access conditions

Copyright
© 2021 by Jonathan Eric Tuck

Also listed in

Loading usage metrics...