Submodular optimization in massive datasets

Placeholder Show Content

Abstract/Contents

Abstract
In this thesis, we study the problem of submodular maximization in various ``massive data" models, namely the MapReduce, streaming, and adaptive models. For the MapReduce model, we give a simple $O(1/\varepsilon)$-round algorithm for maximizing a monotone submodular function with respect to a cardinality constraint. This algorithm achieves an optimal $1-1/e$ approximation ratio, is simple to implement in practice, and removes the data duplication issue of prior works obtaining the same approximation ratio. For the streaming model, we study both the random order and the adversarial order settings. In both settings, we give the first multi-pass approximation algorithm for maximizing monotone submodular functions with respect to a matroid constraint. Our algorithms achieve $1-1/e-\eps$ approximations in $poly(1/\eps)$ passes. For the adversarial case, it is known that our algorithm is tight -- it is impossible to achieve better than a $1/2$ approximation in a single pass, and anything better than $1-1/e$ requires linearly many passes. Our techniques also allows us to develop single pass and multi-pass algorithms for the $p$-matchoid constraint, which generalizes the popular matroid ($p=1$) and intersection-of-matroids constraint ($p=2$). In the adaptive model, we study the problem of maximizing an unconstrained non-monotone submodular function. We show the following: for any problem instance, and every $\delta > 0$, either we obtain a $(1/2-\delta)$-approximation in 1 round, or a $(1/2 + \Omega(\delta^2))$-approximation in $\Omega(1/\delta^2)$ rounds. In particular (and in contrast to known lower bounds for the monotone cardinality constrained case), there cannot be an instance where (i) it is impossible to achieve an approximation factor better than $1/2$ regardless of the number of rounds, and (ii) it takes $r$ rounds to achieve a factor of $1/2-O(1/r)$.

Description

Type of resource text
Form electronic resource; remote; computer; online resource
Extent 1 online resource.
Place California
Place [Stanford, California]
Publisher [Stanford University]
Copyright date 2022; ©2022
Publication date 2022; 2022
Issuance monographic
Language English

Creators/Contributors

Author Liu, Paul
Degree supervisor Charikar, Moses
Degree supervisor Vondrak, Ján, (Mathematician)
Thesis advisor Charikar, Moses
Thesis advisor Vondrak, Ján, (Mathematician)
Thesis advisor Rubinstein, Aviad
Degree committee member Rubinstein, Aviad
Associated with Stanford University, Computer Science Department

Subjects

Genre Theses
Genre Text

Bibliographic information

Statement of responsibility Paul Liu.
Note Submitted to the Computer Science Department.
Thesis Thesis Ph.D. Stanford University 2022.
Location https://purl.stanford.edu/vw164nb4193

Access conditions

Copyright
© 2022 by Paul Liu
License
This work is licensed under a Creative Commons Attribution 3.0 Unported license (CC BY).

Also listed in

Loading usage metrics...