Submodular optimization in massive datasets

Liu, Paul

Submodular optimization in massive datasets

<a href="https://embed.stanford.edu/iframe/?url=https%3A%2F%2Fpurl.stanford.edu%2Fvw164nb4193" class="su-underline">Show Content</a>

Abstract/Contents

Abstract: In this thesis, we study the problem of submodular maximization in various ``massive data" models, namely the MapReduce, streaming, and adaptive models. For the MapReduce model, we give a simple $O(1/\varepsilon)$-round algorithm for maximizing a monotone submodular function with respect to a cardinality constraint. This algorithm achieves an optimal $1-1/e$ approximation ratio, is simple to implement in practice, and removes the data duplication issue of prior works obtaining the same approximation ratio. For the streaming model, we study both the random order and the adversarial order settings. In both settings, we give the first multi-pass approximation algorithm for maximizing monotone submodular functions with respect to a matroid constraint. Our algorithms achieve $1-1/e-\eps$ approximations in $poly(1/\eps)$ passes. For the adversarial case, it is known that our algorithm is tight -- it is impossible to achieve better than a $1/2$ approximation in a single pass, and anything better than $1-1/e$ requires linearly many passes. Our techniques also allows us to develop single pass and multi-pass algorithms for the $p$-matchoid constraint, which generalizes the popular matroid ($p=1$) and intersection-of-matroids constraint ($p=2$). In the adaptive model, we study the problem of maximizing an unconstrained non-monotone submodular function. We show the following: for any problem instance, and every $\delta > 0$, either we obtain a $(1/2-\delta)$-approximation in 1 round, or a $(1/2 + \Omega(\delta^2))$-approximation in $\Omega(1/\delta^2)$ rounds. In particular (and in contrast to known lower bounds for the monotone cardinality constrained case), there cannot be an instance where (i) it is impossible to achieve an approximation factor better than $1/2$ regardless of the number of rounds, and (ii) it takes $r$ rounds to achieve a factor of $1/2-O(1/r)$.

Description

Type of resource	text
Form	electronic resource; remote; computer; online resource
Extent	1 online resource.
Place	California
Place	[Stanford, California]
Publisher	[Stanford University]
Copyright date	2022; ©2022
Publication date	2022; 2022
Issuance	monographic
Language	English

Creators/Contributors

Author	Liu, Paul
Degree supervisor	Charikar, Moses
Degree supervisor	Vondrak, Ján, (Mathematician)
Thesis advisor	Charikar, Moses
Thesis advisor	Vondrak, Ján, (Mathematician)
Thesis advisor	Rubinstein, Aviad
Degree committee member	Rubinstein, Aviad
Associated with	Stanford University, Computer Science Department

Subjects

Genre	Theses
Genre	Text

Bibliographic information

Statement of responsibility	Paul Liu.
Note	Submitted to the Computer Science Department.
Thesis	Thesis Ph.D. Stanford University 2022.
Location	https://purl.stanford.edu/vw164nb4193

Access conditions

License: This work is licensed under a Creative Commons Attribution 3.0 Unported license (CC BY).

Also listed in

View in SearchWorks

Loading usage metrics...