Laplacian regularized stratified model fitting

Tuck, Jonathan Eric

Laplacian regularized stratified model fitting

<a href="https://embed.stanford.edu/iframe/?url=https%3A%2F%2Fpurl.stanford.edu%2Fzm017yr2171" class="su-underline">Show Content</a>

Abstract/Contents

Abstract: A large number of practical problems in optimization involve the fitting of models with categorical features. These problems arise in a variety of fields, such as machine learning, statistics, finance, medicine, and data analysis. A common and simple paradigm parameterizes the model by a parameter vector, and uses convex optimization to minimize an empirical loss on a training data set plus a regularization term that (hopefully) skews the model parameter toward one for which the model generalizes to out of sample data. In this dissertation, we discuss a framework of model fitting methods that depend on a number of categorical features. We build a model of some data that depends in an arbitrary way on these categorical features, by building a separate model for the data that take on each of the possible values of these categorical features. We call these models stratified models. Furthermore, we propose augmenting a basic stratified model fitting problem with an additional regularization term, Laplacian regularization, which encourages the parameters found for each value to be close to their neighbors on some specified weighted graph on the categorical values. We begin this dissertation with an early work on solving Laplacian regularized problems. We then define these models as Laplacian regularized stratified models, and propose an efficient distributed solution method based on the alternating direction method of multipliers (ADMM), which allows us to solve the Laplacian regularized stratified model fitting problem in a distributed fashion and at large scale, in spite of the model parameters being coupled due to the Laplacian regularization term. We then provide an extension of Laplacian regularized stratified models, where we show that if the model parameters vary smoothly over the graph Laplacian, then it can be well approximated by a linear combination of a small number of the Laplacian eigenvectors, which can drastically reduce the number of variables in the problem. Finally, we provide two examples: one example of fitting Laplacian regularized Gaussian models, and another, much larger example which brings the ideas of this paper together by fitting Laplacian regularized stratified risk and return models to used in a realistic quantitative finance setting.

Description

Type of resource	text
Form	electronic resource; remote; computer; online resource
Extent	1 online resource.
Place	California
Place	[Stanford, California]
Publisher	[Stanford University]
Copyright date	2021; ©2021
Publication date	2021; 2021
Issuance	monographic
Language	English

Creators/Contributors

Author	Tuck, Jonathan Eric
Degree supervisor	Boyd, Stephen P
Thesis advisor	Boyd, Stephen P
Thesis advisor	Osgood, Brad
Thesis advisor	Srivastava, Ashok
Degree committee member	Osgood, Brad
Degree committee member	Srivastava, Ashok
Associated with	Stanford University, Department of Electrical Engineering

Subjects

Genre	Theses
Genre	Text

Bibliographic information

Statement of responsibility	Jonathan Eric Tuck.
Note	Submitted to the Department of Electrical Engineering.
Thesis	Thesis Ph.D. Stanford University 2021.
Location	https://purl.stanford.edu/zm017yr2171

Access conditions

Also listed in

View in SearchWorks

Loading usage metrics...